Fermat’s principles of least time:light travels between two points alone the path that requires the least time, as compared to other nearby paths.
Snell’s Law:
n1sin(θ1)=n2sin(θ2)
The refraction index of a medium n is defined as n(r)=C/v, where C is the speed of light in vacuum and v is the speed of light in the new medium. Then, speed of light in a medium is equal to v=C/n(r), where n(x,r,z)=n(r) can be a continuous function. The time required to travel a distance ds at speed v is given by:
dt=speeddistance=vds
The total time required to travel from a to b is given by:
t=∫dt=t∫C/n(r)d=C1∫n(r)ds
💡
Fermat’s principles of least time:light travels between two points alone the path that requires the least time, as compared to other nearby paths.
One question raises from Fermat’s principle of least time is: howdo light rays know which path is closest ?The equation is answered by a method called the variation of calculus.
3.2: The Calculus of Variation
From the above section, we know that the time required to travel from points A to B is given by:
t=C1∫n(r)ds
In the integral, ds can be expressed in terms of dx and dy, giving us ds=dx2+dy2. The refraction index n(r)=n(x) because we are integrating alone that x coordinate. Then, the integral becomes:
The picture explains the integral D(f), the distance from A to B alone f, where x coordinates varies from x1 to x2. Because we are integrating alone x coordinates (we use x coordinates to define/constrain the path), n(r)=n(x)
In this case, the integrand depends both on y’ and dx, so the integral is rewritten as:
t=C1∫n(x,y)1+y′2dx=∫F(x,y(x),y′(x))dx
where
C1n(x,y)1+y′2=F(x,y(x),y′(x))
Example: Light Path Between Two Points in Glass
Consider first a very simple special case. Suppose the index of refraction n is constant throughout a sheet of glass, and that the endpoint of a light ray at x=x0, y=y0 is directly across the sheet from the beginning point at x=0, y=y0. Then the time to penetrate the sheet is what?
3.21: Euler’s Method Derivation Part 1
More generally, Euler and Lagrange considered some arbitrary integrals in the form:
I=F(x,y(x),y′(x))
and wanted to solve paths y(x) that maximum and minimized. In other words, they wanted to find the stationary points where I′=0.
Suppose we have a function f(x1,x2,⋯)=f(xi) of independent variables i=1,2,3,⋯,N, and we are asked to find the stationary points. At stationary points, f’(xi)=0, so f(xi) must be a constant. (This might sounds confusing at the first point. Consider functions f(xi)=3xandf(xi)=3, their derivatives f’(xi)=3=0andf’(xi)=0f(xi) must be a constant. Thus, f(xi) must be a constant for f’(xi)=0). We represent a horizontal shift in f(xi) as f(xi+δxi). When reaching a stationary point, the function is a constant, so f(xi)≈f(xi+δxi), where δxi→0. Using Taylor expansion, f(xi+δxi) is equal to:
In order to the constant for f(xi+δxi)=f(xi), all other terms need to be 0. This is only possible if ∂f/∂xj=0. Thus, when y(x) reaches a stationary point,
∂xj∂f=0withj=1,2,⋯,N
3.22: Euler’s Equation Derivation Part 2
In order to find such paths, we introduce some new functions ϵ and η. Suppose yˉ(x) is a path for stationary points and y(x) is a path nearby. The path yˉ(x) can be expressed using y(x) plus a small difference.
y(x)=yˉ(x)+ϵη
At stationary points, I’=0. Applying chain rule for multivariable functions, we integrate I with respect to ϵ, giving us:
This might be nuance, but we are getting used to y’=dy’dt. Note that in Euler’s equation, y’=dy/dx.
Example: The Straight Wire
Minimize the equation
s=∫ds=∫dx2+dy2=∫1+y′2dx
Solution
In the problem F=1+y′2. Because the integrand is independent of y(x), ∂F/∂y(x)=0. The Euler’s equation becomes:
0=dxd(∂y′(x)∂F)
Then
∂y′(x)∂F=21+y′22y′=1+y′2y′=A constant
The shortest distance on a plane between two points is a straight line (!),
3.3: Geodesics
One application of the calculus of variation is to find geodesics, which are the stationary (usually shortest) paths between two points on a given surface. Before diving into details, we will talk about cylindrical and spherical coordinates in Cartesian and polar forms.
3.31: Coordinates
Cylindrical Coordinates
Cartesin(rcosθ,rsinθ,z)
In polar form
Polars=(rθ,z)
Polards=(rdθ,z)
Polards=r2dθ2+z2
Spherical coordinates
By pythagorean theorem, the length of the red line is given by:
s=x2+y2+z2
where:
x=rsinθcosϕy=rsinθsinϕz=rcosϕ
For spherical coordinates, we divide a sphere into infinitesimal number of cubes.
s=rdθ
Example: Geodesics on a Sphere
Consider the problem of finding the shortest distance between two points on the surface of a sphere. The sphere has a radius of R. Find the geodesics on a sphere
Solution
There are two methods for representing the coordinates on the surface of a sphere. Both approaches give rise to the correct/same integral. The first approach is to use spherical coordinates, giving us:
x=Rcosθsinϕ,y=Rsinθsinϕ,z=Rcosϕ
Because we want to find ds, we differentiate x,y and z to find dx,dy, and dz using the chain rule for multivariable functions.
From the equation, we can see that F=F(θ,ϕ′,ϕ)=R1+ϕ′2sin2θ. Using Euler’s equation, we know that:
0=∂ϕ∂F−dθd(∂ϕ′∂F)
Because F is independent of θ, ∂F/∂ϕ=0, giving us
0=dθd(∂ϕ′∂F)
Then, ∂F/∂θ’=A constant=k. Differentiating F with respect to ϕ’, we get:
k=1+ϕ′2sin2θ1sin2θ
Solving for ϕ’
Using separable integration, ϕ is equal to:
ϕ=αsin−1q+β
3.4: Brachistochrone
3.41: A Cycloid
The section is not included in the textbook, but it will be beneficial to know some math on cycloids for our understanding of the brachistochrone problem.
Cycloids
Cups:
A graph of a cycloid.
In polar forms, a cycloid has equations of:
x=r(θ−sinθ)y=r(1−cosθ)
3.4.2: Time and Paths Taken for a Brachistochrone
The problem is to find the shape of a track between two given points, such that a small ball starting at rest at the upper point-and sliding without friction alone the track under the influence of gravity-arrives at the lower point in the shortest time.
Brachistochrone: The path between two points A and B which minimizes the time taking by a particle falling from A to B under the influence of gravity.
t=∫vds
Let point A be the origin of the coordinate system we define. Because the particle is dropped, it has an initial velocity and kinetic energy of zero. As the point is zero, gravitational potential energy is also zero. Thus, the initial, which is equal to the total, energy is zero. By energy conservation, we get:
E=21mv2−mgy=0
The distance ds is given by:
ds=dx2+dy2=1+y′2dx
Then, time t is equal to:
t=∫2gy1+y′2dxort=∫2gy1+x′2dy
Using Euler’s equation, t=t(y,x′,x) and F()=(1+x′2)/(2gy). Then,
0=∂x∂F−dyd∂x′∂F
Because F is independent of x, ∂F/∂x=0, giving us:
∂x′∂F=2gy11+x′2x′=k
Solving for x′, we get:
x′=±1−2k2gy2gk2y=±1/2gk2−yy≡a−yy
We choose the plus square root. Integrating over y, we get:
x=∫dx=∫a−yydy
Note that we integrate with respect to y because x′=x′(y) is a function of y, where y is the dependent variable. The integral is evaluated using substitution. Let y=asin2(θ/2)=a/2(1−cosθ), the integral is equal to:
x=2a(θ−sinθ)
Differentiating x with respect to θ, we get
x′=2a(1−cosθ)
Expressing dy in term of θ, we get:
dθdy=2a(sinθ)
dy=2a(sinθ)dθ
Using separable integration, y is equal to:
y=2a(1−cosθ)
Notice that both x and y represent equations for a cycloid, the curve traced by a point on a circle as it rolls alone a straight line without slipping. The quantity a and final angle θb can be determined from coordinates (xb,yb) of the final position.
A graph of a cycloid. Note that top points of a cycloid represents a half of a complete revolution, where θ=π. In order to see this, imaging a point on ground initially, when the points is at top, the circle has completed one half of a revolution, so θ=π.
Substitute x or y into (6), the time t required to fall to the final position is:
Notice that if (xb,yb)=(aπ/2,a), time t is equal to:
t=π2ga
Example: Path Traveled by Lights in Atmosphere
We return to where we began the chapter, with Fermat’s principle of stationary time, illustrated in Figure 3.9(a). Bringing to bear the calculus of variations, we can now find the path of a light ray in a medium like earth’s atmosphere, where the index of refraction n is a continuous function of position. If a ray of light from a star descends through the atmosphere, it encounters an increasing density and an increasing index of refraction. We might therefore expect the ray to bend continuously, entering the atmosphere at some angle θaand reaching the ground at a steeper angle θb. For simplicity, take the earth to be essentially flat over the horizontal range of the ray and assume the index of refraction n=n(y) only, where yis the vertical direction. The light travel time is then equal to what?
Solution
Recall that refraction index of a medium n(r)=c/v. Then, velocity of light in a medium is equal to v=c/n(r), and the infinitesimal distance ds=1+x′2dy. Time t is equal to:
t=∫vds=c1∫n(y)1+x′2dy
In order to minimized the integral, we use Euler’s equation, where F=F(y,x′,x)=n(y)1+y′2. Because F is independent of x, we get:
k=∂x′∂F=n(y)1+x′2x′
The expression can be simplified using trigonometry. We know that x′=dx/dy=tanθ, and 1+tanθ=secθ.(Note that dx=dx/dy. This is obtained when calculating ds=dx2+dy2, where we divide the expression under the square root by dy2 to get ds=1+(dx/dy)2dy=1+(x′)2dy) Then, k is equal to:
k=n(x)secθtanθ=cosθsinθcosθ=n(y)sin(θ)
Doesn’t this looks similar? We have derived Snell’s law!
k=n(y)sinθ
3.5 Several Dependent Variables
∂yi(x)∂F−dxd(∂yi′(x)∂F)=0
Example: Geodesics in Three Dimension
Using Euler’s equation for several dependent variable, find geodesics in three dimensions.
Solution
Distance ds is equal to:
ds=dx2+dy2+dx2=1+(dxdy)2+(dxdz)2=1+y′2+z′2
Let F=F(x,y’,y,z’,z). We choose x to be the independent variable and y and z as dependent variables. Using Euler’s equation, we get:
∂y∂F−dxd∂y′∂F=0∂z∂F−dxd∂z′∂F=0
Because F is independent of y and z, equations are reduced to:
dxd∂y′∂F=0dxd∂z′∂F=0
Differentiating F with respect of x’ and y’, we get:
k1=1+y′2+z′2y′,k2=1+y′2+z′2z′
where k1 and k2 are constants. The equation can be coupled by taking the sum of the squares of the two equations to show that the denominator of each equation or constant, or equivalently y’2+z’2=k, where k is a constnat.
The minimum path has a constant and positive slope in both and x−y and x−z planets, showing a stright line.
3.6: Mechanics From a Variation Principle
Recall from relativity that proper time (the reading in the frame moving with the clock) is equal to τ=Δtγ, where Δt is the reading in another frame (For instance, earth’s frame).
Δt=γτ
We want to find a general formulation of mechanics that is based entirely on a variational principle. Consider a relativistic particle, the proper time in the particle’s frame is τ, and the reading in earth’s frame is Δt.
I=∫dτ=∫γdt
Using space-time four dimensional coordinates (c,x,y,z), we know that
I=∫dτ=∫γdt=∫dt1−cx˙−cy˙−cz˙
Using variation of calculus for several calculus we derived in section 3.6, we get:
Because the F is independent of x,y and z, expression becomes:
dtd∂x′∂F=0,dtd∂y′∂F=0,dtd∂z′∂F=0
dtd(γx˙)=0,dtd(γy˙)=0,dtd(γz˙)=0
Will come back later after finishing chapter 2
3.7: Motion in a Uniform Field
3.7.1: Doppler Effect
Dropper effect: the change in frequency of a wave when a source and an observer are moving relativity to each other. Frequency increases when the observer and the source approach each other and decreases when moving away.
Relativistic Dropper Shift
For a source directly approaching or moving away from an observer, the doppler effect gives:
fob=v±vemvfem,fob=fem1±vwavevem1
fob=fem1−v/c1+v/c
Redshift: As an object moves away from us, the sound or light waves emitted the source are stretched out, making them have a lower pitch and move towards the red end of the electromagnetic spectrum (which has a longer wavelength and a smaller frequency). The lights from the object is known as redshift.
Blueshift: As an object moves toward from us, the sound or light waves emitted the source shrink out, making them have a higher pitch and move towards the blue end of the electromagnetic spectrum (which has a shorted wavelength and a larger frequency). The lights from the object is known as blueshift.
3.7.2: Principle of Equivalence
Consider two spaceships. One accelerating uniformly at a in gravity-free empty space and one standing at rest in a uniform gravity field. Let acceleration a of the first ship is adjusted to be equal, but opposite in direction, to the gravitational field g on the second ship.
An observer in the bow of the accelerating ship shies a laser beam at another observer in the stern of the ship. The ship is initially at rest and the laser emits monochromatic light of frequency fem in the laser’s rest frame. We assume that the distance traveled by the ship is very small while the bean is traveling (the assumption can be understood by considering the ship’s and the laser’s speed. The last has a speed of light C, which we assume, is largely greater than the ship’s speed. Thus, the distance traveled by the ship is negligible compare to the distance traveled by the laser).
Let the length of the ship be h. The time it takes for beam to reach the stern is t=h/c. During the time, the stern attains a velocity v=at=ah/c.
We can gain several insights from the equation. First, the frequency at the bottom (stern) is increased by a factor of gh/c2. In a time of t=1s, the bow emits t/period=tfem wavelengths of light. However, because fob>fem, observers at the bottom must collect theses waves in less than 1 second according to their own clocks (number of wavelengths of light remains unchanged for observers at the bottom and at the top, frequency fob>fem, so time decreases).
Becase tlow=tob<tem=thigh, it is natural for observers at the top to conclude that bottom clocks run slower than top clocks (as their clocks have a smaller reading). The must be true for all clocks at the top and bottom. Rearranging the equation, we get
Δthigh=Δtlow(1+gy/c2)
The equation shows the time difference for two clocks at rest, but at different altitudes in a uniform gravitational field.
Clock C travels with a nonzero speed by an arbitrary path between stationary ground ground clocks A and B. All clocks are synchronized initially. Clock B reads tf when C arrives, and the reading of clock C depends on the path it takes in moving from A to B.
Recall that the proper time for time dilation is:
τ=Δt1−c2v2
where v is the velocity of the moving object relative to the outside observer. Because clock C moves with a non-zero speed, by time dilation, we know that the reading of C is different from the reading of B. In an infinitesimal time dt according to the top clock, clock C advances by the proper time:
dτ=dt(1+gy/c2)1−v2/c2≈dt(1+gy/c2−v2/2c2)
τf=∫0tfdt(1+gy/c2−v2/2c2)
3.7.3: The Hafele–Keating Experiments
Time for eastward clock
Westward clock
Explanation
The opposite sign of the two results suggests that the rate at which time flows depends on the motion of the observer. The eastward clock is moving in the same direction as earth’s rotation, so its velocity relative to earth is greater than the westward clock. Thus, a greater time dilation effect. The westward clock is moving in the opposite direction as the earth’s rotation, so it has a smaller relative velocity and time dilation effect.
Previously we have derived that clock C advances by the proper time
τf=∫0tfdt(1+gy/c2−v2/2c2)
We can rewrite the integral in the form of energy, giving us:
We know that 1/2mv2=K is kinetic energy and mgy=U is potential energy. Then, the integral becomes:
τf=tf−∫0tf(K−U)dt
In order to find a path that extremeizes (We use the term “extremeize” because it is uncertain if the proper time τ is maximized or minimized. Euler’s equation only provides a path (a function) that maximizes or minimizes a function) the proper time τ, we apply Euler’s equation to the integral I, giving us:
I=∫0tf(K−U)dt
Let F=F(x˙,x,y˙,y,z˙,z) be the integrand, we get:
F=F(x˙,x,y˙,y,z˙,z)=21m(x˙2+y˙2+z˙2)−mgy
Since F is independent of x and z, we get:
0=dtd∂x′∂F,0=mg−dtd∂y′∂F,0=dtd∂z′∂F
giving us:
0=x˙x¨,0=g−y˙y¨,0=z˙z¨
Then,
0=x¨,y˙=g,0=z¨
When x¨,z¨=0, we recognize these differential equations as results of Newton’s equation of motion F=ma in a uniform gravitational field.
x=vxt,y=21gt2,z=0
Our goal in identifying a variational principle for the motion of a particle in a uniform gravitational field is successful. Without every using Newton’s law, we found the correct equations of motion. In the next section, we will derive F=−∇U using variational calculus.