Fall 96 - Considerations for Data Quality and Uncertainty in Testing of Bicycle Aerodynamics

By: Michael J. Flanagan, Ph.D.

Reproduced from: Cycling Science Fall '96

Testing of bicycle aerodynamics in wind tunnel facilities presents a unique set of challenges generally not found in traditional aerodynamic testing. For a typical aircraft test, a scale-model would normally be used and the test results would be scaled and manipulated based upon a nondimensional flowfield parameter such as Reynolds number - a term that relates velocity, model scale, and the viscosity of air. Drag, the result of the contributions of many elements such as shape factors, skin friction, boundary-layer growth and transition, and shock waves, is highly dependent upon Reynolds number. It is nearly impossible to simulate the full-scale environment for typical aerodynamic testing (it would cost too much money and require a power grid larger than the TVA); hence, some "skilled" manipulation of the data is required - an effort that introduces significant uncertainty in the accuracy of the test results.

Figure 1

For this test effort, it was possible to simulate the full-scale environment. As the test article was full-scale and the wind speed was matched precisely, duplicating the real-life flowfield was simple. With no post-test manipulation of the data necessary, it was believed that highly accurate drag measurements would be possible. Little did we know that, in our attempt to measure highly accurate drag levels associated with bicycle aerodynamics, we would uncover a Pandora's box of data quality issues related to the very nature of cycling dynamics. The nature of this testing did bring to light a peculiar point of view in the age -old quest of accuracy versus precision. Simply put, even though a test can be organized and run utilizing state-of-the-art equipment giving precise answers to 0.05 lb, the flowfield can be so complex given such variables as wheel rotation, pedaling speed (cadence), and rider posture that accurate measurements are impossible. Confused? Don't be! This is the age-old question of accuracy versus precision. The answer you get is very precise - unfortunately, it is precisely the wrong answer!

Since determining highly accurate drag levels was the primary focus of this test, it was necessary to clearly understand the level of measurement uncertainty of the test data. The term measurement uncertainty relates to how far the measured answer is from the "true answer." Simply put, we can think of the measurement uncertainty as the sum of the precision and bias errors. The "state-of-the-art" data acquisition system, model mount, and "rolling-road" simulator enabled us to measure very precise answers; but the basic answers were "biased" by such "errors" introduced by rider position, wind-chill (January is a cold month to sit in a wind tunnel at 30 mph), pedal speed, and wheel rotation ... to name a few.

To date, a significant portion of the known experimental data comes from a technique referred to as "coast-down testing."1 This method involves the derivation of bicycle (and rider) drag from the velocity achieved at the end of a ramp of known geometry. Typical drag levels at 30 mph are of the order of 6.5 to 8.0 lb for a rider on a racing bicycle in a racing posture. 2 This particular speed, bicycle, and posture conditions are chosen to provide a degree of consistency for presenting test results. A drag reduction on the order of 10 percent would save an estimated 2 minutes total time during a typical 60-minute race. As races are won by mere seconds, serious cyclists would gain an advantage with drag reductions on the order of 1 percent, or 0.08 lb. Proponents of the coast-down technique criticize the lack of adequate simulation during static testing of bicycles in wind tunnels. Coast-down testing inherently produces time-averaged results of the bicycle trajectory (the same type of results that would result from static testing within a wind tunnel). The final velocities derived from coast-down runs are affected by rider position, pedaling technique, and the mechanical condition of the bicycle, as well as environmental factors. Despite these influences, the uncertainty of the drag levels associated with these tests have been quoted to within approximately 0.02 lb. 3

EXPERIMENTAL SET UP

Four rider postures were considered as the controlling variables for this test effort. Two typical postures, seenleft, reflect a broad sampling of bicycle riders and uses. These body postures range from those used by recreational riders to those used by competitive racing cyclists. The four postures are described as: (1) general touring pose,(not shown) (2) drop-touring pose, (3) racing pose, and (4) long-distance racing pose (not shown) (similar to those popular with current triathletes).

Posture 2

Posture 3

The balance assembly used to measure axial force and hence drag force is mounted in the seat post directly beneath the rider. This was done not only in an attempt to minimize the effect of pitching moment on the balance but also to constrain the bicycle (and rider) in pitch (up and down) and yaw (left and right). The bicycle and rider, suspended within the test section, are integrated with the centerline sting-support in the wind tunnel. The heart of this balance is a one-component load cell utilizing linear bearings to transmit all axial loads to a compression load cell.

Figure 3

The roller system, submerged beneath a false floor, enables the rider to power the bicycle wheels at a speed that matches the tunnel airspeed and simulates true motion of the rider and bicycle environment. This assembly consists of a single roller for the front wheel and a dual-roller assembly for the rear wheel. The rear wheel of the bicycle transfers power to the dual rollers. A belt drive connecting one rear roller to the front roller enables the front roller to spin the front wheel at the same speed in order to match wheel rotational velocities. A simple speedometer connected to the front wheel hub enables the rider to match wheel rotational speed, and there-fore bicycle velocities, to the tunnel velocity. The entire test set-up is shown above in Figure 3.

Part of the process to minimize the uncertainty involves the proper determination of interference effects, also called tares. These values cannot be overlooked as they can cause the drag levels to vary by as much as 100 percent! Consideration must be given to the effect of the floor contact on the overall drag level. As the cycle wheels "bridge," or interfere with, the drag balance at the three roller-contact points, the interference effect must be measured and subtracted (or added) to the final measurements. Essentially, every data point must be measured several times with the wind off in order to determine these tare values. The wheels are in contact with a nonmeasurement surface. The extent of that contact is dependent upon a complicated relationship between tire deflection, sting deflection due to model weight, and center-of-gravity position, as well as the load-cell deflection. Not all of the individual tares act in the same direction. Careful attention to sting deflection and tire deformation is necessary to ensure that the tires contact the rollers, yet do not hinder axial movement of the bicycle due to drag.

EXPERIMENTAL RESULTS

Test results give time-averaged drag levels on the order of 10.2 lb. for a rider in racing posture 3, at 30 mph. This drag level increases to about 12.9 lb when the rider moves to a more vertical position, one that is typical of a posture for a recreational cyclist. These levels are higher than the drag levels quoted in all three references by approximately 20 percent. This variation is understandable considering the rider accounts for almost 70 percent of the total drag and the rider for this study was of muscular build - not typical of the average racing cyclist. While steady-state test results are similar to those found in previous works, in truth, the basic nature of this drag data is highly dynamic - a fact that will introduce bias errors during time-averaging. These time-dependent trends were not evident in the test results from the "coast-down" technique.

Figure 4

The graph, Figure 5, presents a comparison between the drag levels while the rider is pedaling at a cadence of 75 rpm versus not pedaling (stationary). Notice the dynamic characteristics of the data. The mean (average) value of the data at a cadence of 75 rpm is approximately 10.2 lb with a corresponding root-mean-square (RMS) drag value, a bias error, of approximately 0.64 lb. These data for posture 3, at a wind speed of 30 mph, graphically demonstrate the effect of pedal cadence. The difference between the mean drag values of the two curves, pedaling versus stationary, is approximately 4.0 lb. In addition, the results from coast-down testing fall between these traces. The second plot, in Figure 6, presents a comparison of drag levels for postures 1 and 3. Notice how the data are quite similar in dynamic content. The difference in average value is due to the rider position (exposed frontal area). It is particularly significant to note that the difference between the mean drag values for a rider pedaling and for the pedals stationary is independent of body posture. This drag value difference seems to be a constant value of approximately 4.0 lb.

Figure 5

Figure 6

MEASUREMENT UNCERTAINTY

For the -25 to +25 lb range load cell used in this study, the manufacturer quoted accuracy is 0.2 percent full-scale, or about 0.05 lb. Precise calibration within the range of 0 to 15 lb provides for an expected precision of 0.03 lb. Careful attention to the calibration, the data acquisition system, and electrical interference (noise) provides for an overall precision error that is approximately 0.05 lb. These values can generally be determined prior to the test. Determining the bias error however, is not so simple and generally must be done after the test has been completed.

Data repeatability is also a significant problem. Subtle variations in body position while trying to maintain a specific posture led to variations in the mean drag values by as much as 1.4 lb. Additionally, pedal cadence had a strong influence on RMS drag value. This was seen when the author, at a cadence of 55 rpm, managed to produce an RMS drag value of almost 1.5 lb (almost three times the value at a cadence of 75 rpm). Finally, the windchill factor was not an insignificant problem.

Determining the bias error generally requires a detailed statistical analysis of the test data. Allow me to digress momentarily to describe some important basics of statistical analysis (understand, of course, this paper was written by a rocket scientist). When I talk about statistical analysis I refer to identifying trends buried within the data. These trends are evident from four statistical terms named: mean (average) value; variance (standard deviation or RMS), skewness (symmetry); and kurtosis (deviation from a Gaussian "bell" shape). Drag levels are calculated using the first two statistical moments, the mean and RMS values. These two terms provide a good approximation of the average value and the relative variation within the data - the lower the RMS, the lower the overall error. In order for these values to have any physical significance, the mathematicians who derived these relationships required that the flowfield would be ergodic. Okay, so what does it mean when a flowfield is ergodic? Well, ergodicity is a property whereby, for all probability, all data statistics can be calculated from a single data-trace by time-averaging. More simply put, the average of 1000 data points taken during one run would be the same as the average of 10 data points from each of 100 runs. This concept of ergodicity is crucial to wind tunnel testing as it allows scientists to accurately draw conclusions from a small set of data. If each data point needed to be repeated, say 10 times, the cost of testing, would be prohibitive.

Figure 7

This concept of ergodicity requires that the data be stationary, specifically, that the data exhibit no time-dependence in any of the statistical moments. Figure 7 presents a plot that is representative of the RMS drag value (second statistical moment) versus time. For this particular run, 20 seconds of data were taken corresponding to approximately 10,000 data points. The result of this run is one "test point" - the average of all 10,000 data points for this run. The data exhibit a trend to converge to approximately 0.8 lb - the value is actually meaningless. The very fact that this plot shows ANY time dependence, as opposed to being a straight line, is VERY significant. This plot shows that the flowfield is NOT ergodic. Consequently, the entire data set is suspect from an accuracy standpoint. The concept of ergodicity requires, regardless of the particular ensemble (data set) analyzed, that there will be no time-dependence in the ensemble - that all test points will yield identical results. In order to achieve an accurate result, repeat runs would be necessary to achieve some type of ensemble average. This ensemble average would provide a more accurate estimation of the "true" drag level. How many repeat runs are necessary for this one test point? The only way to know for sure would be to make perhaps a dozen or more and plot the average value from those runs.

One can now understand the problem encountered. If our original test matrix involved several model configurations at several velocities, 100 test points could be involved - say five days of testing at $4000/day. Now, if we are required to repeat each test point 10-15 times to achieve our desired accuracy goals, we now need 50-75 days and $300,000.

CONCLUSIONS

The most significant trends apparent within the test data are the dynamic influence of wheel rotation and the rider pedaling. Additionally, the effect of subtle changes in body position during the run is significant. While the dynamic trends are of interest, the most useful information to the rider is some type of accurate, time-averaged drag value. Accurate time-averaging, however, does require the assumption of flowfield ergodicity. As the flowfield surrounding the cycle and rider within the wind tunnel is clearly NOT ergodic, "real" drag values are meaningless for short-duration testing. In order to achieve meaningful drag levels, the test plan must allow for sufficient repeat runs for each test point. What are the reasonable limits to data quality? For this test, the customer wished to know drag levels to 0.06 lb or approximately the predicted measurement uncertainty level. This was not possible after the fact, as the flowfield was clearly not ergodic and the test plan and funding did not allow for any additional testing. Some of the information obtained was useful. In particular, specific effects during a given run, such as the effect of head-up versus head-down, provided a repeatable increment from run to run. So, even though the absolute level of the drag was never determined, a highly accurate measurement could be made of the effect of handlebar droop, head-up versus head-down, and pedaling versus stationary. In addition, as the effect of pedaling was the primary reason the flowfield was not ergodic, all data taken while the rider was stationary data were valid.

For this particular technique, the sources for uncertainty are neither the instrumentation nor the data acquisition system. Rather, the close-coupled nature of the balance and test vehicle system creates a flowfield environment that is not ergodic. The presence of the rider and the dynamics due to the pedaling required to achieve proper flowfield simulation force the assumption of ergodicity to be invalid. This raises serious doubt about the certainty of the measured drag levels. Only through repetition of one particular test condition will sufficient data be available for an ensemble average. This will provide a more accurate estimation of the "true" drag level.

References
1. Schondorf, P. (1982) Design Examples - Coast Down Tests. In Proceedings from the First International Human-Powered Vehicle Scientific Symposium.
2. Phone conversation with John Cobb of Bicycle Sports; Shreveport, LA.
3. Kyle, C.R. (1982) Improving the Bicycle Racing System. In Proceedings from the Second International Human -Powered Vehicle Scientific Symposium.

Michael J. Flanagan, Ph.D.
University of Dayton Research Institute
Dayton, OH 45469-0117
email: flanaginj@elwood.udayton.edu