Loading [MathJax]/jax/output/SVG/jax.js

Monthly Archive: May 2015

Mass-Varying Systems

Last week, the problem of the conveyor belt accumulating some material at a steady rate r was examined. The motor driving the belt kept it moving at a constant velocity. The power expended by the motor was double the power needed to accelerate the material when it hits the belt. The natural question is, why is it exactly double.

This question provoked a lot of discussion in ‘The Physics Teacher’, starting with an article by Wu, which tried to give a straightforward answer to why the value is exactly 2 and also where did the other half of the energy go. For the most part, the argument by Wu, while physically intuitive, is lacking in logical rigor.

The best argument was given by Marcelo Alonso who starts with the varying-mass form of Newton’s law

m˙v+˙m(vu)=Fext.

In this equation, m is the system mass, which, in the case of the conveyor problem, is the mass of the belt and the material on it at any given time. The vector v is the velocity of the system. The quantities ˙m=r and u are the rate of change of the system mass (more on this later) and the velocity of the mass that either enters of leaves the system, respectively. Justification for this equation will be given below.

The power imparted by the external force Fext is

P=Fextv=m˙vv+˙m(vvuv).

For the conveyor belt problem, the material enters with a very small velocity u that is perpendicular to the system velocity v so the last term vanishes. Since the motor keeps v constant it can be moved into and out of any total time derivative. The motor power is now

P=˙m(vv)=ddt(mvv)=ddt(2KE).

Viewed in this light, the factor of two is not so much mysterious as accidental. It is a consequence of the special circumstances that uv is zero and ˙v=0. If the material were introduced to the belt in some other fashion, for example, thrown horizontally on the belt, the ratio of the power provided to the time rate of change of the materials kinetic energy would have been different and its value wouldn’t attract special attention.

All told this is a mostly satisfactory explanation. The only problem is that the terse response Alonso provided in the form of a letter to the editor didn’t give enough details on the varying-mass form of Newton’s law was derived.

Surprisingly, these details are difficult to find in most textbooks. Modern presentations don’t seem to care too much about varying mass situations although they are important in many applications. Systems in motion that suffer a mass change are almost everywhere in modern life. Examples range from cars that consume gasoline as they move, to ice accumulation on or water evaporation from industrial surfaces, to rocket motion and jet propulsion.

So in the interest of completeness, I derive the varying-mass form of Newton’s equations here.

There are two cases to consider: the system mass increases as material is accreted and the system mass decreases as material is ejected. The specific mechanisms of accretion or ejection are not important for what follows.

I will start with the accreting case just before a bit of matter δm is introduced into the system mass. The initial momentum of the system and lump is

pi=mv+δmu.

Just after this small lump has combined with the system, the final momentum is

pf=(m+δm)(v+δv).

The change in momentum is the difference between these two quantities

Δp=pfpi=(m+δm)(v+δv)mvδmu,

which simplifies to

Δp=mδv+δmvδmu.

This change is momentum is caused by the impulse due to the external force Fext

Δp=Fextδt

acting over the short time δt.

Equating the terms gives

mδv+δmvδmu=Fextδt,

which becomes

mδvδt+δmδt(vu)=Fext

when dividing by the small time δt. Before this equation is taken to the limit and derivatives materialize out of deltas, let’s look at the ejection case.

For ejected mass, the system starts with the momentum

pi=mv.

After the lump with mass δm is ejected with velocity u, the final momentum takes the form

pf=(mδm)(v+δv)+δmu.

Following the same process above, the analogous equation

mδvδtδmδt(vu)=Fext

results.

The two equations look similar with only a small change in sign in front of the δmδt to hint at the physical difference of mass inflowing in the first and outflowing in the second.

Sign differences can always be problematic and more so when it comes to ‘in’ versus ‘out’. Confusion can easily insert itself in such cases. As an example where careful track of sign conventions is important, consider the presentation of mass-varying systems in Halliday and Resnick. They present the mass ejection case as a preliminary to a discussion of rocket propulsion but they arrive at an equation with the opposite sign from the one derived here. They get this difference by explicitly setting the sign of dm/dt opposite to that of δm/δt.

A small change in one’s frame of mind and a careful attention to the difference between δm/δt and dm/dt is all that is needed to reconcile these differences. To go from the simple ratios of deltas to actual derivatives, note that by the initial construction δm was a positive quantity, a small bit of matter. The time increment δt is also positive. So the ratio of these two quantities is also positive. But the rate of change of the system mass m can be either positive or negative. Despite the notational similarity between δm/δt and dm/dt, the mass m being addressed is not the same. The mass in the derivative expression is the system mass which can gain or lose mass and so dm can be positive or negative.

For the accreted mass case, dm/dt=r>0 and the appropriate limit is

mdvdt+dmdt(vu)=Fext,

or, with an eye towards combining the two cases,

mdvdt+|dmdt|(vu)=Fext.

For the ejected mass case, dm/dt=r<0 and the appropriate limit is mdvdt+|dmdt|(vu)=Fext, with the sign on the second term now changing due to the inclusion of the absolute value.

The Conveyor Belt

The conservation of energy, which is so innocently applied in elementary applications, can be actually quite complicated when the system in question has mass flowing into or out of it. One of the most popular textbook questions for provoking thought on this topic is the conveyor belt.

Two well-known texts, ‘Mechanics’ 3rd ed., by Symon and ‘Physics’ by Halliday and Resnick, cover the conveyor belt problem, although in slightly different ways. Both start from a common setup of a belt moving at a constant velocity v onto which mass is dropped at a rate r=dmdt from a hopper above. The motor powering the conveyor applies a varying force F so that the constant velocity is maintained as the material mass m(t) grows. For completeness, the mass of the belt is assigned a value M. Since the problem is one-dimensional, the explicit vector notation will be dropped in what follows.

The derivation starts with the mechanical momentum of the belt, defined as
p(t)=[m(t)+M]v.

The time-varying motor force needed to maintain the constant velocity has to match the change in the momentum and thus
F(t)=dpdt=dm(t)dtv=rv.

The power supplied by the motor is

P=F(t)v=v2dm(t)dt=v2r,

which can be manipulated into a more familiar form since both v and M are constant, to yield

P=ddt(mv2)=ddt[(m+M)v2]=2ddt[KEsys].

In words, the motor supplies power that is twice is large as the change in the kinetic energy of the system. Halliday and Resnick also derive this two-to-one relationship. Both texts then ask where the excess half of the power going?

Of course, these authors expect that the student would infer where the energy has gone. His argument would start with the notion that the law of conservation of energy is cherished and believed to be correct in all cases. If half the supplied energy is easily found in the form of kinetic energy of the belt plus material the rest must be ‘hidden’ in a less obvious place. This missing other half must have been converted into some other form. Continuing on in this vein, the student would then conclude that a force is needed to accelerate each bit of mass introduced onto the belt and that that force can only be due to friction between the belt and the material so introduced. Once friction is introduced, it is a short and easy step to conclude that the extra energy is converted into the internal energy of the belt or the material.

Since this is the obvious path for such a venerable problem, it is hard to believe that there is any controversy surrounding this conclusion. But as Mu-Shiang Wu points out in ‘Note on a Conveyor-belt problem’, The Physics Teacher (TPT) 23, 220 (1986), the student is often puzzled why the missing energy happens to be half of the supplied power. By his presentation in that article, Wu also implies that the student is not satisfied with the broad and general conclusion that missing half is dumped into heat, he also wants to see the explicit mechanism. Wu’s argument to explain this mechanism goes something like this.

Follow a bit of mass δm as it falls from the hopper. Relative to an observer riding along the belt, this chunk is moving with velocity v. Since this bit of mass must come to rest with respect to the belt over some period of time δt there must a be a frictional force Ff that arrests the mass’s leftward motion. Wu posits the form of this frictional force to be

Ff=μN=μδmg,

where μ is the coefficient of kinetic (or sliding) friction, and N is the normal force supplied by the belt. This kinetic friction force results in a constant acceleration a=μg. Using the standard the kinematic relations for constant acceleration

xf=xi+viδt+1/2aδt2

and

vf=vi+aδt

and eliminating the time δt it takes for the friction force to bring the chunk of mass to a stop (i.e. vf=0) leads to the total distance traveled as

D=v22a.

The work done by the frictional force is then

δW=FfD=12δmv2,

and the power exerted is

Pf=δWδt=12δmδtv2=ddt(12mv2).

At this point, Wu stops with the statement

“It turns out that regardless of whether we assume 1 sec or 1/100 sec for the acceleration time, the thermal power developed by frictional forces between the belt and the sand is always exactly half of the supplied power.”

Overall, I am suspicious of Wu’s argument. It has the attractive feature of having a explicit mechanism for the force that brings the material to rest on the conveyor belt but the use of the co-moving frame is done using a bit of a cheat. To demonstrate the roots of my suspicion, let me modify Wu’s argument starting just after the use of the constant acceleration kinematic equations. The total displacement (not distance) traveled is

D=(xfxi)=v22a.

The work done by the frictional force is then (note the sign)

δW=xfxiFfdx=Ff(xfx0)=12δmv2.

Thus the chunk of mass loses energy in this frame and the power loss is

δWδt=12δmδtv2

or (taking the limit in the usual casual physicist style)

Pf=ddt(12mv2).

The power lost by the object is exactly one half of the total power supplied by the belt. This loss is assumed to go into heat so the energy balance is satisfied in a hand-waving way but there is this pesky problem associated with the two different frames. So I don’t think the puzzle is satisfied.

In the intervening years (1987-1990) after Wu’s initial article was published a number of other author’s published notes, critiques, and alternative ways of thinking about the conveyor belt problem. Judging by the different points-of-view expressed, the original unanswered question by Symon and Halliday and Resnick seems to have assumed a manifest truth that is not as obvious once one digs in as it is on the surface. Almost none of the arguments I’ve read in TPT have swayed me except for a letter by Marcel Alonso in response to Wu’s original article.

Next week I’ll cover Alonso’s argument and some details about variable mass systems.

A Flux Transport Example

One of the more confusing things when I was learning vector calculus \& classical field theory was the derivation of the flux transport theorem. The theorem relates the change in flux Φ due to a vector field F through a moving surface S to certain integrals involving the time rate of change and the divergence of the field over the surface along with a contour integral around the surface’s boundary. Using the notation of ‘Introduction to Vector Analysis, 4th ed.’ by Davis and Snider, the flux transport theorem is given by:

dΦdt=SdS[Ft+(F)v]+SdF×v,

where dS is the outward normal to a differential portion of surface area, v is the velocity at each point on the surface (recall it is moving), and d is a differential line element along the boundary of the surface S.

Davis and Snider have some nice homework problems but I wanted an example where the surface changed size as well as moved in space. The example I concocted is one with a rectangular shaped surface whose four vertices are given by:
A[t,t,1],
B[t,t,1],
C[t,t,1],
and
B[t,t,1].
Note that I represent () the coordinates of the points by row arrays simply for typographical convenience, although whether they are rows or columns doesn’t matter.

The surface, which moves uniformly in the x-direction, is canted 45 degrees with respect to the y-z plane and has a time varying area of 4t. The surface is parametrized by two parameters u and v ranging from 0 to 1 such that any point on the surface (including the boundary) is given by

R(u,v)[t,(2u1)t,2v1]u,v[0,1].

In this parameterization, the outward normal to a differential patch is

dS=Ru×Rvdudv[4t,0,0]dudv,

from which we immediately get the total area as

Area(S)=(u,v)|dS|=10du10dv4t=4t

as expected.

The form of the vector field is

F(r)[xy2,yz2zx2]

where r[x,y,z].

Since the form of the surface and the vector field are fairly simple, the flux through the surface due to F

Φ[F,S]=SF(R(u,v))dS

can be explicitly computed as
Φ[F,S]=10du10dv4t4(2u1)2=4t43.

The time derivative is then easily obtained as

dΦ[F,S]dt=16t33.

To use the flux transport theorem, we need to compute div(F) and Ft and then evaluate these terms across the expanse of the surface S. Likewise we also need to compute F×v, where v is the velocity of the surface and then evaluate this terms along the its boundary.

The divergence of the field is given by

div(F)=x2+y2+z2,

which, in the uv parametrization becomes

div(F)=t2+(2u1)2t2+(2v1)2.

The time derivative of the field is

Ft=0

since the vector field F has no explicit time dependence. All of the time rate of change is due to the surface moving within the field.

The surface integral in the flux transport equation, formally given by

I0=SdS[div(F(R(u,v)))v+Ft(R(u,v))]

in the uv parametrization, evaluates to

I0=16t33+4t3.

The velocity of any point on the surface is obtained by taking a time derivative with respect to R

v=R(u,v)t[1,2u1,0].

The integrand of the line integral is

F×v[zx2(2u1),zx2,xy2(2u1)yz2],

which, in the uv parametrization, becomes

F×v[(2v1)(2u1)t2,(2v1)t2,(2u1)3t3(2u1)(2v1)2t].

There are four distinct lines or legs making up the boundary of the surface. These are given by

L1(u)=A+u(BA),
L2(v)=B+v(CB),
L3(u)=C+u(DC),

and

L4(v)=D+v(AD)

and there is a corresponding integral for each.

The evaluation of each is a bit tedious (particularly in taking care to make sure that u or v take on the appropriate value for the leg being traversed) but straightforward leading to
I1=2t310du=2t3,
I2=2t310dv2t10dv(2v1)2=2(3t3t)3,
I3=2t310du=2t3,
and

I4=2t310dv2t10dv(2v1)2=2t323t,
respectively.

The total line integral around the boundary is then given as the sum of these terms

Ic=43t.

Adding this result to the result from surface integral gives

Itot=16t33,

which is the same result as before.

While this example is a bit contrived, it does offer a simple combination of both movement and growth that seems absent within the literature. Furthermore, when worked in detail, each piece drives home the content of the flux transport theorem.