Examination of Witnesses (Questions 19640
- 19659)
19640. What has been available to you, you have
looked at?
(Dr Hunt) Yes.
19641. With that in mind, can you take us through
the sources of uncertainty with the Findwave model in mind, please,
starting with model assumptions?
(Dr Hunt) Yes, in general I have been interested
in trying to understand why it is that predictive models have
uncertainty built into them. It is then a question of how can
we make them better, and if we understand the sources of uncertainty
then we can move towards improving those models. I have divided
the areas of uncertainty into seven areas which you see listed
before you. I can go through them very quickly one by one. At
some stage or another, before you begin to write software, to
make predictions you have to decide which laws of physics you
are going to use. There are various things we need to do.[18]
For instance, we need to understand that elasticity is a linear
phenomenon, by which I mean that if you push on something with
twice the force you will get twice the deflection. Interestingly,
some of the simplest problems are not linear. The example I like
to use most often is that of a tuning fork, which, once hit, you
cannot hear until you place it onto a table, and when you do place
it onto a table it is much, much louder than it is in free air.
The only way you can explain that is using a non-linear model.
I am not saying this is at all relevant to the exact process of
modelling vibrations on railways, but it is an example whereby
a non-linearity is present and if you inadvertently ignore it
that can lead to a significant error. Damping models are very
important. Damping is all about how quickly things die away. If
I hit a piece of metal, it rings like a bell; if I do the same
to a similar sized piece of rubber, it is dead. Rubber is a highly-damped
material and a piece of metal is a very low-damped material. In
a construction of a tunnel in soil, we have a mixture of all sorts
of materials and all sorts of different damping characteristics.
Understanding the damping in detail is essential to get an accurate
model. As a body of scientists and engineers, we do not understand
damping. The next slide refers to the excitation model. We need
to know about the way in which the wheel which is not perfectly
round moves along the rail which is not perfectly smooth. The
models that we have which are presented in many conferences and
many journals go a long way to understanding the excitation model
but they are certainly incomplete and there are errors which arise
from that. Next Slide please.[19]
All computer programmes are inherently inaccurate because you
have to make some kind of assumption about the real world to fit
them into computerised language. For example, if I wanted to find
a circle, you have to do what Archimedes did when he was trying
to get a value for pi: you divide your circle into perhaps
six points to make a hexagon or eight points to make an octagon.
How many points do you need around a circle before you can really
call it a circle? How many points do you need around that circle
before, for instance, the value of pi would be correct?
That is an example of what I have called here the discretized
models needing to be checked for convergence. Also, we know by
using any software that we do find bugs and those bugs need to
be found out and sometimes they can be hard to find. Also we sometimes
find that when we want to run a computer model quickly there is
a limit to the amount of time and resources that we have to cut
our model smaller or so on and that leads to errors. That is what
I have called the truncation error. The slide headed "Shoe-horning"
the picture on the left shows a train running through a tunnel
and the tunnel is not perfectly circular and the ground around
is not perfectly flat.[20]
There are roots from trees and there are inclined layers and perhaps
there is a river nearby, yet the computer model will not have
all these things and I have to use my judgment to fit the things
that I am trying to model into the model. What kind of assumptions,
what kind of judgment might I use to deal with these things. Different
people will make different assumptions and therefore they will
get different answers. My guess is that two different people using
Findwave, for example, or my own Pipe-in-Pipe model will get different
answers, even given the same measured data for soil parameters
and so on. Next slide please.[21]
The next area is data gathering. We know we have to try to figure
out what the soil properties are, what the tunnel characteristics
are, what the foundation characteristics are, where the water
table is, the geometry of the buildings and so on and so forth.
We have to know how accurate that data is because, as the adage
goes, "garbage in garbage out". I am not saying we are
putting garbage in but we are putting data in which has some sort
of error associated with it and, inevitably, if there is uncertainty
in the data provided to the model, there must be uncertainty coming
out. We also assume, for instance, that if we measure properties
at the ground in one place they will be the same as they might
be were they measured somewhere else. Perhaps, also, if we measure
them on the Monday they will be the same as if they were measured
on a Tuesday, or if they were measured in the summer they will
be the same as they would be in the winter. There are some things
we simply cannot measure. What are the soil properties directly
underneath the tunnel. It turns out that this is a very important
part of the model. It is very hard to obtain data for that kind
of place.[22]
I have mentioned the excitation model before. There are so many
things that we do not truly understand about the excitation model.
We can presume that we understand what is going on but I have
listed here a few things: unsprung masses, vehicle speed and so
on, but we find that if we take the same train running over the
same piece of track at the same speed that we will get different
measured sound levels or measured vibration levels. Why is that?
Perhaps it is because the train is not following exactly the same
track line on the rails or perhaps it is because the interaction
between the wheel on the rail is not exactly what we think it
might be. But there is some uncertainty there.[23]
The measurement point is a clear source of uncertainty. If we
take a room such as the one we are in here, we know that this
room is probably going to be different from the one next door.
What kind of detail do I need in my model to be sure that I have
got to within one or two or three or four or five decibels, a
good answer for prediction of sound levels? Likewise, whereabouts
in the room am I going to make my measurements? We all know that
if you go to a venue to listen to music, there are some seats
from which you hear better than others. The nature of rooms is
that they are variable. One thing which is shown in the colour-coded
diagram is the dB level variation you might get in the ground.
The white circle in the middle represents the tunnel. The scale
is given in metres. This represents a cross-section through the
ground 40 metres wide and 40 metres high, so 20 metres up and
down from the tunnel, and 20 metres left and right. You can imagine
that somewhere where near the arrows are pointing represents the
position where the foundations for the building might go. The
difference in colours, depending on where the arrow heads are,
go from yellow to orange-ish to light blue and dark blue. There
is a variation there of plus or minus 10dB from place to place.
This diagram shows the effect of what kind of variations you might
get with small, but reasonable within the levels of uncertainty,
changes of soil properties. You can see then, depending on exactly
where you think the foundations are for the building, that you
might find the building will act as a receiver for vibration much
more effectively in one particular case but less effectively in
another. Do we really have enough detail in the model and our
knowledge of the building to be sure?[24]
Then I turn to validation. We really need to be sure that whatever
model we use has been used often enough to demonstrate to us all
the range of predictions. We do not just want to see the best
predictions, we want to see the worst ones as well. If we are
trying to forecast the weather we need to know that sometimes
the weather forecast is really bad and sometimes the weather forecast
is really good, so that we get some feel for how much we should
trust the weather forecast. My feeling is that, generally speakingand
it is not just on Findwave but all models of which I speakwe
are not in a position to have followed through the validation
process sufficiently to have confidence in making accurate predictions.
I know that the world of the weather forecasting is very, very
far down the track on that. You may not know that the weather
forecast that we get is an amalgam of several collaborating or
communicating communities of weather forecasters and they all
have to agree on the weather forecast for the next 24 hours and
when they do agree our weather forecast is robust. When they disagree,
we get what we call unsettled weather or uncertain forecasts anyway.
You can see this little quotation at the bottom from the world
of Microscale Meteorologists. These are the ones that look at
pollution dispersion. If you have a chimney which is spewing out
horrible stuff, we want to know that pollution levels are modelled.
I think it is very nice that they consider that models are "only
of use if their quality (fitness for purpose) has been quantified,
documented and communicated to potential users. It may not be
appropriate to talk of a valid model but only of a model that
has agreed upon regions of applicability and quantified levels
of performance" I just do not think we have those agreed
upon limits of applicability and we do not have quantified levels
of performance. I have given two examples that Findwave used in
published literature on the next slide.[25]
The one on the left was published at a conference I attended in
September 2004 and the one on the right Findwave has used against
the same set of measured data published as part of the validation
process for Crossrail. The next slide shows an enlargement of
both graphs.[26]
The red line is a prediction made by Findwave for the Crossrail
validation and the blue line is a prediction made by Findwave
for the conference in Buxton a few weeks earlier. The black dotted
line is the measured data. It is in the nature of things that
when you have the measured data in front of you it is very easy
to say, "I can see how I can make my prediction better because
I can adjust my data" and indeed my prediction would be better.
What has happened, perfectly sensibly, is that the loss factor,
which is a very important parameter which can be used to change
predictions by a large amount, has been adjusted to get better
predictions. However, the question I would ask is: How confident
are we that in our present case, when the measured data is obtained,
that the model will actually agree very well. When we have the
measured data available to us, it is very tempting to re-run the
model and to get the better agreement. In this case, you can see
the model differences where I have marked 8dB there.
19642. Chairman: Are you talking about
the model used for the whole of the Crossrail project or just
for a particular part of it?
(Dr Hunt) I am talking about models in general,
of which Findwave is one, of which my own Pipe-in-Pipe model is
another. I am talking about areas of uncertainty inherent in the
process of modelling. Therefore, yes, indeed, it does apply to
the whole project. Just to sum up then with this slide here.[27]
Let us suppose we are going to talk in quanta of 5dB because I
do not think anyone is inclined to use a finer scale, but that
plus and minus 5dB is reckoned to be really good and the best
model that we might like to think of. I feel in my own opinion
that there are six areas of uncertainties about, let us leave
out the computer model itself, I do not know whether Findwave
has been checked to make sure there is no bugs in it, and I am
not familiar enough with Findwave, but let us look at things which
are not to do with Findwave but to do with all models. I would
say it is not at all unreasonable to suppose the various assumptions
each have a 5dB uncertainty to them. How do all these add up?
It does not make sense to add them all up. You could end up with
plus or minus 25dB but we do know that models are better than
that, so this is why we have to go through a validation process.
A validation process needs to be robust, transparent and include
the best with the worst. I have no doubt that in a few years'
time as a community we will have gone through that validation
process and have a much better idea of what is a true and representative
area to apply. In the meantime, I think 5dB plus and minus is
unreasonably tight. If you look at measurements and modelling,
you will find that it is pretty much everywhere, the best you
can get is plus and minus 10dB. That is why I had this plus and
minus 10dB figure which I like to use as my figure.
19643. Mr Binley: As a complete layman
whose only concern is trying to deal with this matter, you have
opened up a whole new world to me. I want to relate the world
that you have opened up to what we are talking about with regard
to this particular issue before us. Are we talking about NC25
as a standard which is being applied and lowering it to NC20?
How does your margin of error relate to that? Is it a direct transference
and you are saying plus or minus ten points on that scale?
(Dr Hunt) Yes. I do not
think there is much point in going back to one of the slides we
saw earlier this morning, but if you imagine a slide that had
those little vertical bars, which were 5dB in length, I would
expect those bars, in my judgment, to be plus or minus 10dB which
is in fact what we see. That has been published in and around
this particular case. You will find that, indeed, the data goes
up and down, it is data which in fact Rupert Taylor has just republished
in the last few days. There is a prediction line and the data
goes up and down and around that plus or minus 8 or 10dB.
19644. It becomes pretty useless in those terms?
(Dr Hunt) It does not become
useless at all. There are standards whichI have to say
the setting of those standards and the meeting of those standards
is not my subject, I am into the modelling processneed
to be agreed upon and we need to strive for those standards and
achieving those standards is the ultimate goal. We need to know
what they are. NC25 and NC20 are standards and, likewise, octave
bands or third octave bands are agreed procedures by which we
might go towards meeting those standards. Ultimately, the question
is what do we do if we exceed those standards? Essentially, if
we exceed those standards, we are, by an objective nature, saying
this is too noisy. We are going to use predictive models to help
guide us to design a good railway so that we can in all probability
meet the standards but because there are errors and predictions
always have errors
19645. How do we judge potential error rates
of that kind with the sorts of very, very fine figures that we
hear are necessary with regard to the process, the matter we are
discussing today presents to us? How do we relate those two things
and come to some sort of sensible judgment?
(Dr Hunt) We heard earlier
that the best studios are not built above railways so what is
happening is here we have got a railway which is being built under
the studio, so the predicted models are saying that the vibration
and sound levels will be below NC25 by a sufficient amount to
satisfy those in the recording studio; they may not be. This is
a very sensitive site and it is awkward. Ultimately, it may turn
out that it is nice and quiet and the studio can operate perfectly
normally but it may turn out that it cannot.
19646. Mr Hopkins: Your summary of uncertainties,
the possibility or probability of all of them pointing in one
direction is unlikely, it is a possibility, and, in fact, some
of them may counter each other, some may be plus and some may
be a negative, so one could take an extreme case but that might
be very rare. You have also had the opportunity of looking at
a large number of existing situations where there are tunnels
with continuously welded track, soundproofed, under London Clay
or sand or whatever, and one can look at empirical evidence from
elsewhere across the country, not just in London, to get a normal
distribution of noise effects. You are talking about the possibility
of one end of that normal distribution, an extreme end of that
noise distribution. Anything is possible. We might walk out and
be run over tomorrow, but it is an unlikely possibility, most
of us will survive. I think there are some probabilities to all
of this and it seems to me that you are saying it could be an
extreme case and therefore it might be difficult.
(Dr Hunt) I can say really
that if you look through the various published papers and what
people are presenting at conferences, what you will find on the
internet, what Mr Thornely-Taylor has published and so on, the
plus and minus 10dB looks pretty much the norm and my worry, of
course, is that people only publish their best results. If you
run a model and you get a prediction which is whatever it is and
then you have got lots of measured data and say, "I have
done something wrong here", then you say, "I am so stupid.
I forget to put the decimal point in here", so you run it
again and you get a better figure and it is good, but in the absence
of measured data, would you run the model again? I think what
you are suggesting should be done and it has not been done, and
that is to do lots a predictions, a priori predictions, before
doing measurements or at least without the measurements being
known to the people doing the predictions. That is very rarely
done, because usually what happens is the person who is writing
the paper to be presented in the ninth c International Workshop
on Railway Noise which is coming this September, those people
will have the measured data and the model predictions side by
side and will produce a nice presentation. There you will get
the plus or minus 10dB. My worry is if plus or minus 10dB is present
in the best of models, best of predictions, I think that really
is where my reason for saying plus or minus 10dB comes from. I
should also say my 10dB also comes from my own experience of running
my own predictive model which has been cross-validated with other
models, and you can very easily get plus or minus 10dB there.
You made the point earlier that sometimes these things add up
and sometimes they do not. There is a graph which I am not sure
whether it is going to be presented later on, but it shows a prediction
with a peak of 80 Hz and a couple of troughs on either side, whereas
the measurement does not seem to show those peaks and that is
entirely consistent with the fact that you get strong excitation
from around 80 Hz. It is also possible, as I showed in the previous
slide with those coloured plots, that you can get peaks and troughs
at different frequencies on either side and the two things can
cancel out or they can reinforce. It is a close-run thing. It
could be up by 10dB or it could be down which is precisely what
we see all the time.
19647. I have some knowledge of statistics as
a student and in use of my daily life, but in politics we are
used to using statistics dishonestly! Except for me because I
am always truthful but we have seen what can be done with statistics.
My point is still that even if it is this plus or minus 10dB for
each of these factors
(Dr Hunt) I am not saying
that it is.
19648. Even if it was, the probability of them
all pointing in one direction is two times, two times, two times,
two times two, or something.
(Dr Hunt) I am not saying
that the models are out by 25dB, I am saying that plus and minus
10dB is a good working figure.
19649. My final point is are you saying that
there is no objective data anywhere about the vibrations from
trains in tunnels affecting ground buildings?
(Dr Hunt) I am saying there
is no objective statistical data of the types you are quite rightly
suggesting there should be showing predictions made over a large
number of sites by a large number of different people and collecting
the data, adding it together, and saying, "All right, this
is the mean area and these are the outliers", no, not available,
very sadly not available.
19650. Mr Newberry: Dr Hunt, what I would
like to do is on your summary of uncertainties if you are able
to indicate under the various headings which you have identified
what it is specifically about the Findwave model in terms of assumptions
and shoehorning et cetera, what it is that concerns you about
the assertion that the model can be accurate to plus or minus
5dB?[28]
(Dr Hunt) In model assumptions,
for instance underneath the railway we have a rail pad, this is
the pad which goes underneath the rail it is made of rubber. It
has got core particles in it or whatever, it might even have dimples
on it, and it is well known by the manufacturers of these pads
that these are non-linear. In fact, they are designed to be non-linear
in many cases. The manufacturers themselves cannot provide dynamic
data for these rail pads which is particularly accurate. Then
if we are talking about ballast mats or under-sleeper pads, this
is an under-sleeper pad (indicating) again this is a material
which has non-linear characteristics. I would wonder whether the
non-linearity has been included properly. Ballast itself is a
highly non-linear material. There are various models which have
been used to predict the performance of ballast. The ones I have
seen published are okay, but in my opinion I think they miss the
point. The way in which the ballast functions is different from
the way in which it supposed to function and that is a subject
for discussion.
19651. Pausing there, Dr Hunt, this is under
the subject of linearity, and you have looked at various materials
which are not linear.
(Dr Hunt) They are not linear,
and the soil itself underneath the tunnel is not linear.
19652. Is the Findwave model a linear model?
(Dr Hunt) It is not a linear
model. I know that. It has, for instance, frequency-varying damping
and frequency-varying loss factorin fact, the very use
of the loss factor itself is a non-linear model. So it is not
a linear model, but without looking in detail at the model I cannot
say what is inside it but I would be extremely surprised if it
did include the kinds of non-linearities I am talking about because
the data that is being put into the model, as far as has been
published, does not make any reference to the non-linearities
of which I speak.
19653. Given that you are dealing with a model
that is non-linear and dealing with linear properties, what does
that lead you to conclude in terms of the accuracy, in this context,
of plus or minus 5dB?
(Dr Hunt) I suppose I should
say that one of the biggest sources of error here is in modelling
of damping, and that I notice that the Findwave model uses a frequency-varying
loss factor and that it uses the same loss factor both for compression
modulus as it does for shear modulus.
19654. What does that mean?
(Dr Hunt) That means that
waves that travel through the ground travel as pressure waves.
For instance, if I clap my hands, that wave travels through the
air as a pressure pulse but solid materials can resist shear.
So that if I were to push sideways on a block of rubber it will
resist motion, whereas if I push sideways on water and air, water
and air then these will not resist the pressure. Within soil and
within materials generally you can have two types of waves that
travel, a compression wave and a shear wave. In soils, it is pretty
well recognised that the way in which pressure waves travel through
the soil, the damping associated with it does not have a sliding
motion, any sideways motion, of the grains of soil, whereas with
shear motion there is a sliding motion. So it is perfectly reasonable
to assume that the loss factor in shear will be greater than the
loss factor in compression. I am pretty certain that that will
be the case, but in order to make those measurements and distinguish
between the two requires some pretty careful measurements, but
I know that Findwave does not distinguish between the two. It
may turn out that there is no need to distinguish between the
two, but I have not seen any evidence published or otherwise to
say that it is not important. In my view it is important to distinguish
between the two, and not distinguishing between the two will lead
to errors of, again, perhaps 5dB or more.
19655. So that is looking at soil.
(Dr Hunt) That is looking
at soil.
19656. There is an uncertainty because of those
two factors you have identified, and your perusal of the model
has not explained either at all or satisfactorily why one option
as opposed to another is being pursued.
(Dr Hunt) That is right.
It is interesting, perhaps, to say that a change of loss factor
by going from (if I get this from my memory right) 0.1 to 0.15
will give you a 6 or 9dB difference in vibration level at the
surface of the ground for a 15-metre deep tunnel. That 9dB is
already more than the 5dB that we are hoping to get from Findwave.
That change in loss factor from 0.1 to 0.15 is completely within
tolerance that is used by Findwave and which is expressed in the
data that we have had in front of us.
19657. What is the next factor?
(Dr Hunt) I do not want
to say anything about the computer model. The Shoe-horning: I
suspect that for London it is probably not too bad; we have got
fairly level horizontal layers, we have got a fairly stable water
table; the tunnel is fairly level. I say "fairly", there
are always going to be small effectsthe tunnel is ever
so slightly curved. What kind of influence will that have? If
it is half a dB then we need to know that. The tunnel is not going
to be perfectly, perfectly uniform in wall thickness and the ribs
in the tunnel and detail of tunnel constructionwe have
to make assumptions there. I think we have to bear in mind that
the area around Soho is full of other tunnels, other foundations
and other buildings with piles, and so forth. Do they need to
be included in the model, or do we make assumptions about the
error associated with ignoring them? The error associated with
ignoring these things, which is the shoe-horning effect, for 5dB
is a perfectly sensible and, perhaps, rather conservative estimate.
Without going through the statistical process who knows?
19658. Could I ask you, finally, just on the
question of validation, is Findwave, so far as you can see, a
properly validated model such that one can predict accuracy to
5dB?
(Dr Hunt) I do not believe
there is any model which is sufficiently validated to achieve
that kind of accuracy, so the answer is, in my opinion, no. My
own model is also not validated to that kind of accuracy.
19659. You have appeared at conferences, of
course you have, and I think you have also appeared at conferences
with the Promoters' noise witness. Is that right?
(Dr Hunt) Yes.
18 Committee Ref: A221, Model Assumptions (WESTCC-9305A-005). Back
19
Committee Ref: A221, Computer Model Correctness and Convergence
(WESTCC-9305A-006). Back
20
Committee Ref: A221, Shoe-horning (WESTCC-9305A-007). Back
21
Committee Ref: A221, Data Gathering (WESTCC-9305A-008). Back
22
Committee Ref: A221, Excitation (WESTCC-9305A-009). Back
23
Committee Ref: A221, Measurement Point (WESTCC-9305A-010). Back
24
Committee Ref: A221, Validation (WESTCC-9305A-011). Back
25
Committee Ref: A221, Findwave Validation Graphs (WESTCC-9305A-012). Back
26
Committee Ref: A221, Findwave Validation Graph (WESTCC-9305A-013). Back
27
Committee Ref: A221, Summary of Uncertainties (WESTCC-9305A-014). Back
28
Committee Ref: A221, Summary of Uncertainties (WESTCC-9305A-014). Back
|