Category Archives: Asked & Answered

Map Challenge!

Asked & Answered: 10

I have always been a map nerd.  When I was a kid, I drew street maps of imaginary cities with improbably curvy roadways and myriad underpasses.  One of my fictional streets would invariably be named “Standard Avenue” or “Standard Road,” being that my father worked at Rockwell-Standard and my imagination had its limits.  When I was a little older, I biked down to city hall and bought a street map of our town (for one dollar), tacked it up on my bedroom wall, and then went about exploring all the streets I could pedal to on my mono-speed bicycle. 

I also remember inventing a map game to play with the younger kids who lived next door.  I appropriated a fold-out world map from one of our National Geographic magazines, drew a grid on the map to create “spaces” for tokens to move about, and then marked various squares on the map as “targets.”  The game required players to draw “target” cards and navigate from one target to the next via the roll of the dice.  I think I made up other special rules or cards to introduce an element of chance/excitement into an otherwise mundane undertaking, but hey, I was like, eleven years old?

Anyway, map-nerdiness is the only thing I likely have in common with Jeopardy! champ Ken Jennings, who in fact wrote a book titled “Maphead” which I really must read one day. I don’t know whether “Maphead” deals with challenges like the following, but here goes.

I recently came across a post heralding the longest straight overland line in the continental U.S, which I call the Nandor Line in honor of its discoverer.  The Nandor Line, 2,802 miles long, extends from Ocean Creek in Washington State to Ocean Drive in Jupiter, Florida.  In theory, one could walk the Nandor Line without crossing an ocean, gulf or foreign land.  (Here, we must stipulate that Idaho does not constitute a foreign land.)

As best I can tell, the Nandor Line passes through 12 states: Washington, Idaho, Montana, Wyoming, Nebraska, Kansas, Missouri, Arkansas, Mississippi, Alabama, Georgia, Florida. But there are close calls with a number of other states.  Naturally, this begs the question, what overland line passes over the greatest number of U.S. states, regardless of its length?

I have an answer… but I am not certain if it is the answer.  As with all scientific endeavors, my answer will be the answer until someone else comes up with a better one.  Here is how I came up with mine.

I decided that my solution — as well as those of any challengers — should be expressed in terms of a direct flight from Airport A to Airport B.  This allows the proposed path to be accurately mapped and checked via the website GPS Visualizer.

I also decided that flying over the Great Lakes is allowed, as long as the flight path does not cross the border with Canada.  Sticklers for full-overland solutions may object, but then how would they propose we cross the Mississippi River?

Anyway, here’s my solution (click to zoom): New Bedford, MA (EWB) to Fullerton, CA (FUL), a 2597-mile flight over 17 U.S. states: Massachusetts, Rhode Island, Connecticut, New York, Pennsylvania, Ohio, Indiana, Illinois, Iowa, Missouri, Nebraska, Kansas, Colorado, Utah, Arizona, Nevada, California. 

Although the illustration may suggest otherwise, the flight path is “straight” insofar as it represents the shortest distance between the two airports.  Several state flyovers are very close calls, and the flight path over Lake Erie comes within yards of the Canadian border.  Nonetheless!  I challenge like-minded map nerds to come up with an 18-state solution, or a 17-state solution that is strictly overland (excepting rivers).

More in  Asked & Answered | Read 7 comments | Subscribe

Asked & Answered 9.0

Coronavirus Chart - Buncombe County - June 2020We have been in isolation mode for the coronavirus for so long that maintaining distance in public has pretty much become an automated, fear-induced behavior.  This is sad.  Like everyone else, I want to get out and go to restaurants and live life normally, but the case numbers here have not been encouraging.  I have been keeping a chart (at right) which shows we are averaging 7 new positive tests a day.

Is there some “magic number” of new cases per day that would make me feel comfortable increasing my degree of exposure to the world?  There must be some number, otherwise it wouldn’t be worth my while tracking this data.  So, how do I translate the rate of positive test results in my area to my personal risk level of going out and about?

Having found no clear answer to this on the internet, I thought I would play Dr. Fauci and guesstimate it myself, given the data available to me, plus some assumptions.  Here goes.

• • •

At its most basic level, the formula may be expressed as chance of infection per contact = chance the other person is infected times chance of transmission upon contact.

Let’s start with the chance that the contacted person is infected.  Here I define contact as passing through a person’s airspace for some length of time.  I will ignore transmission via objects for now, as I use gloves, hand-washing and object-cleaning protocols to minimize that risk factor — for me.

My chance of contacting an infected person on a given outing depends on the number of contacts I make and the percentage of infected people among them.  This is a good place to introduce my framework and assumptions, starting with the chart below:

Populations and sub-populations with respect to Coronavirus InfectionMy first assumption is that I only come in contact with the local population (P), defined as those who live in my county.  This population consists of two groups, those infected with the virus (I) and those not infected (N).  The infected group is further divided into three sub-groups: asymptomatic (IA); symptomatic but untested (IS); and those who have tested positive (IT) and are considered contagious.  Those who test negative and those who have recovered from the virus are included in group N.

For our purposes, I assume that infected people are evenly distributed around the county and may travel anywhere within its borders.  Whether I meet an infected person depends on their propensity to travel, which may differ among the various sub-groups.  If we define the baseline (i.e., non-infected group) propensity to travel as fN = 1.0, then we can assign (by guesswork) travel factors fA , fS  and fT to the infected sub-groups, where each f-factor is between 0 and 1.  For instance, we might suppose that those who have tested positive and are still contagious would have a low propensity to travel — so fT might be 0.1, say.

This framework leads to Equation 1, the chance CI that a random contact is infected:

(1)      CI = ( fA IA + fS IS + fT IT ) / P

Now, we know P, and county health officials provide a weekly update on the number of positive tests, but that’s about all we know.  We have to deduce the sizes of the sub-groups IA, IS and IT and the travel factors fA , fS  and fT using official estimates and semi-informed guesswork.  Let’s consider each of these in turn.

First, we estimate the number of people in the IT (tested positive) group as IT = d Q , where d is the duration of a typical infection (also the quarantine time following a positive test) and Q is the rate of new positive test results per day.  We assume that the positive test rate is at steady-state, i.e., Q is constant.  For simplicity, we also assume that those who test positive are tested on the first day of their infections.  So, if Q = 10 new positive test results per day and d = 21 days, then the number of people currently in the IT sub-group is 210.

But there are plenty of stories about symptomatic people who do not bother to get tested and who only slightly modify their behaviors (the Isub-group).  I have found few clues as to the size of this group, and none of the testing statistics are relevant.  The best I can do is take a wild guess at the fraction of infected people who develop symptoms but go untested. Based on human nature (see hurricanes), I would bet that at least a third of symptomatic people “ride it out” without ever seeking care or getting tested, at least in this county.

So the size of the symptomatic group IS = k( I + IT ), where kis the fraction of those symptomatic and infected who go untested.  This may be rearranged as IS = IT k/ (1 – kS ).

Finally, the infected-but-asymptomatic group IA .  There has been much debate about how many infected people are asymptomatic and how infectious they are.  The CDC estimates that 35 percent of infected people are asymptomatic — I have seen figures as high as 80%. For our purposes, we will assume that asymptomatic people also never get tested, and that their propensity to travel is the same as the non-infected group.

The relevant formula here is IA = ( IS + IT  ) k/ (1 – kA ), where kis the fraction of infected people who are asymptomatic.

Given all the above, we are now ready to plug some numbers, guesses and estimates into our equations:

  • Local population (P) = 262,000
  • Local positive test rate (Q) = 7 positive results/day, where we live, as of now
  • Average duration of infection (d) = 21 days, more or less
  • Fraction of symptomatic infected people who go untested (kS ) = 0.33 (?)
  • Fraction of infected people who are asymptomatic (kA ) = 0.35 (per CDC)
  • Relative propensity of asymptomatic-infected people to travel (fA ) = 1.0
  • Relative propensity of symptomatic-infected people to travel (fS ) = 0.7 (?)
  • Relative propensity of tested-positive people to travel (fT ) = 0.1 (?)

This works out to 147 tested-positive people (IT ), 72 symptomatic-infected people (IS ) and 118 asymptomatic-infected people (IA ) in my county at the moment.  So the chance CI that an individual contact is infected, given my assumptions, is…

One of every 1427 contacts.

This is equivalent to selecting one American at random and finding out that she lives in Boise, Idaho.

Extending this scenario, if I were to contact 10 people during an outing here, the chance that at least one of those contacts is infected would be about 1 in 140.  This is roughly the odds of drawing a straight or better in a five-card poker hand.

What about large gatherings, say, a restaurant with 50 patrons?  According to my model, the odds in my county that at least one of them is infected would be about 15 to 1.  So, this gives you a sense of how the risk increases with the number of contacts one makes.

• • •

Transmission factor vs time for coronavirus (an example)But contact is not the same as transmission.  The likelihood of transmission depends on the degree of exposure (i.e., how much virus is being released) and the time of exposure, as well as the overall effectiveness of masks, distancing and the like.  We can quantify this via the transmission factor τ = 1 – (1-x) t where x is the transmission risk per minute of exposure and t is the length of exposure in minutes.  (The chart at right shows τ vs time for x = 0.4).  τ = 1 represents 100% certainty of virus transmission during a given contact.

As with propensity to travel, the different infected subgroups may vary in their ability to infect others.  For example. asymptomatic victims are thought to shed less virus than those who are coughing and sneezing.  On the other hand, knowing that one is interacting with a positive-test victim may lead both parties to take more precautions.  So, the x factors could look something like this in practice:

  • Transmission risk per minute, asymptomatic-infected people (xA ) = 0.1 (?)
  • Transmission risk per minute, untested symptomatic-infected people (xS ) = 0.3 (?)
  • Transmission risk per minute, tested-positive people (xT ) = 0.2 (?)

These guesswork values imply, for instance, that 5 minutes of contact with a symptomatic untested victim is 83% certain to lead to transmission in our current personal-protection environment.  Is this true?  I have no idea.  But again, by building a model and plugging in numbers that sound sort of reasonable, one can at least get an order-of-magnitude sense of the risk.

So, adding transmission risk to Equation 1 produces the model in Equation 2, where CX is the chance of transmission from a random contact:

(2)      CX = (τA  fA IA + τS  fS IS + τT fT IT ) / P

And, the chance of transmission C from an outing with n random contacts (assuming the same exposure time for each) is then…

(3)      C (n) = 1 - (1 - CX ) n

Here then is a chart summarizing my estimated odds of being infected from an outing in my county at the current time, given n random contacts and exposure time t per contact (plus all the other assumptions above):

Local odds of becoming infected with Coronavirus, as a function of exposure time per contact and number of contactsThese are admittedly rough estimates, but the salient point here is that these odds are not a million-to-one, but nor are they three-to-one.  If they were either, one’s rational response would be much clearer.  As it is, these figures call for deliberation, which we have not been given much opportunity to do — health officials are naturally reluctant to express our risks this way.  Epidemiologists are not paid to make back-of-the-envelope calculations — that’s why I had to do it.

• • •

Now, if you’re not happy with my figures, you can play Dr. Fauci yourself.  I have created a risk calculator that will let you enter figures for your own location and estimate your odds of becoming infected.  And you can change all the parameters of the model as you see fit.  The odds are updated instantly when you change one of the entries.

If you play around with the calculator a bit, you will see how the odds depend heavily on the assumed fraction of infected people who are asymptomatic (kA).  The more people who are out and about, unaware that they are infected but still able to infect others, the greater your risk of becoming infected during an outing.*

Now it’s time for the disclaimers.  I remind readers that this is a static model, a snapshot of a person’s contact risk given the current local rate of new cases and enough assumptions to fill an F-150 pickup.  It is not a dynamic model — it does not predict trends and it does not take the weekly rise and fall in local rates into account.  It is obvious from my model, however, that if one wants to lessen her risk of infection, she should make fewer outings, limit her number of contacts and the time she spends with them, and adopt measures to minimize the risk of transmission when exposed to those contacts.

But what if the odds are 1000:1?  Well, it will be Person #1000 who takes that chance and does his part to keep the pandemic alive.

Which is what responsible health officials have been telling us all along.


I ran some numbers for the Tulsa, Oklahoma, Trump rally scheduled for June 20, 2020.  Tulsa County has 620,000 residents and its daily new case rate is 120 per day and rising.  After adjusting some of my figures, given that attendees will be temperature-checked and the more symptomatic may be turned away, I still come up with odds of transmission of about 44:1.  I assume that each attendee will share an airspace for 60 minutes with his 8 nearest neighbors.  If 19,000 people attend, then about 430 cases, and perhaps 5 deaths, may stem from the rally. “A very small percentage,” Trump said.  He doesn’t care.


* Robert Redfield, CDC Director:  “Of those of us that get symptomatic, it appears that we’re shedding significant virus in our oropharyngeal compartment, probably up to 48 hours before we show symptoms This helps explain how rapidly this virus continues to spread across the country because we have asymptomatic transmitters.”
More in  Asked & Answered | Read 6 comments | Subscribe

Asked and Answered 8.0

You’ve been there.  You’re driving on the expressway at a reasonable pace, a bit faster than some drivers, slower than others.  You move into the left lane to pass a slightly slower car.  Just about the time you draw even, you glance up at your rear-view mirror and see a set of fierce headlights bearing down and closing in on you by the second.  You think, where did this guy come from?  Barely ten seconds later he’s riding your tail, making sure you know in no uncertain terms that you’re in his way, so get your ass moving already!

At this point, you make one of two choices, depending on the kind of driver you are and your mood that day.  You either finish your pass without changing speed, no matter how much the guy behind you tries to intimidate you, or, you decide this person is bad news and you speed up to get out of his way and let him go roaring by — which he will.

Doesn’t it seem like aggressive drivers are everywhere?  It makes you wonder what kind of life experiences create such angry, impatient, bullying people.  And can there really be that many of them out there?

To explore this situation, we’re going to do a little thought experiment here at the stay-at-home office of Asked and Answered.  Here’s the setup.  You and 999 other drivers are going to take a 60-mile trip on the same stretch of interstate highway.  The posted speed limit is 60 mph, but most of you drive some other speed.  In our scenario (see diagram), 30% of drivers drive at 60 mph; 30% drive at 65 mph; 30% drive at 70 mph; and the remaining 10%, the most aggressive ones, drive at 80 mph.  We will assume that the four types of drivers enter the on-ramp in random order and at a steady rate of 4 cars per minute.

Now, decide which type of driver fits you best (denoted by silver, blue, green or red) and then answer the question: what will your driving experience be like?  How many vehicles will you encounter and of what type?

To answer this, I originally thought that I would have to write a computer simulation of the problem and keep track of hundreds of cars as they navigated 60 miles of interstate. But then I stumbled upon time-space diagrams in traffic engineering texts.  Simply put, such diagrams capture how a vehicle (or any number of them) covers a stretch of road based on the vehicle’s speed profile.  This concept is the key to the ignition, if you will.

The chart at right is a simplified example of a time-space diagram.  Travel time is on the horizontal axis and total distance traveled in that time is on the vertical axis.  Each of the colored lines (refer to color scheme above) represents a trip made by one of the drivers in our scenario.  In this example, one driver of each speed-type drove the 60-mile trip, with the drivers starting out 5 minutes apart from each other.  The silver car started first, followed by green, blue and finally red.

We will assume each driver maintained constant speed — otherwise these would be curves instead of lines.  Now, consider the trip-line of the red car, the 80-mph driver (Red) who started at the 15-minute mark and finished his trip at the 60-minute mark.  Whenever two trip-lines cross, it means one car passes another.  Here, we see that Red encountered and then passed Blue around Mile 25.  Red then caught up with Silver at Mile 60, the end of the trip for both drivers.  And on this trip, Red and Green never saw each other.

One more example before we turn to the original question.  The chart at right shows the trip-lines for 40 cars on a 60-mile trip.  Four of every five drivers (silver) drive 60 mph; the fifth (pink) drives 90 mph.  From the line crossings, we see that the typical pink car passes 10 silver cars, while a silver car encounters 2 or 3 pink ones.

What we learn from this example is that a driver’s perception of the other types of drivers on the road is distorted by the relative speeds of the drivers.  Based on his encounters with other cars during his trip, a Silver driver could easily conclude that most drivers are Pinks.  Conversely, a Pink driver might think that over 90% of drivers are Silvers.

This brings us to the formula I derived for the expected number of times a given driver will encounter drivers of other types during a constant-speed trip.  The formula is:


EIJ = the expected number of encounters Driver I will have with drivers of type J
fJ   = the fraction of drivers of type J in the general population
C   = the average number of cars per hour (of all types) passing a given point
D   = the distance of the trip in miles
SI    = the (constant) speed in miles/hour of a driver of type I
SJ   = the (constant) speed in miles/hour of a driver of type J
SΔ  = the absolute difference in the speeds of Drivers I and J


Now let’s return to the question posed at the start.  We defined 4 types of drivers based on their speeds: 30% (Silver) drive 60 mph, the next 30% (Blue) do 65 mph, 30% (Green) do 70 mph, and 10% (Red) do 80 mph.  In the figure below, the actual driver population is shown in the middle, along with each driver’s perception of the population based on the sample of cars — including her own — that she encountered during the trip.

A few things to note here.  First, based on her encounters on the road, every driver thinks that her own driver-type is decisively in the minority.  This is because she will neither pass or be passed by anyone driving at the same speed.  While she may follow another car of similar speed for many miles, her own vehicle and the one she sees right in front of her would be the extent of her experience with like drivers.

A second observation is that every type of driver over-estimates the proportion of both speedsters and slowpokes (unless you happen to be one of those types).  In our scenario, both Silver and Blue drivers believe that more than 25% of drivers are Reds, when the actual figure is 10%.  Similarly, both Green and Red drivers believe that Silvers comprise 45% or more of the driving population.  As is evident from the formula, the greater the speed difference, the more encounters one is likely to have with a given type of driver.  Remember this the next time you complain about all the crazies on the road.

I would be remiss here if I didn’t mention something about the real-world distribution of driving speeds.  The most recent data I could find is from a 2015 federal survey of traffic speeds on various classes of roads, compared to the speed limits on those roads.  The data for limited-access highways suggest that 30% of drivers do not speed (Silver), 25% exceed the speed limit by up to 5 mph (Blue), 25% exceed it by no more than 10 mph (Green), and 20% drive more than 10 mph over the limit (Red).

So the hypothetical driver pool that I presented in my original question is not much of a departure from reality.  The main difference is that, in real life, there is a continuum of driving speeds, and most drivers do not maintain a fixed speed for a whole trip.  However, I think my general observations still hold:  relative speed skews a driver’s perspective of the driver pool.  The greater the speed difference, the more prevalent that type of driver appears to be to you.

Thanks for reading.  I trust all your questions have now been answered.  Except of course, the most important one:  what makes aggressive drivers be that way?

More in  Asked & Answered | Read 3 comments | Subscribe