Lecture 19 - Atoms and Light (2/26/96)

Seeds: Chapters 5, 6

Interference and Interferometry
- In the previous lecture, I mentioned that the resolution of a telescope was due to the wave packets of the photons being cut off by the telescope. I also showed examples of separate telescopes linked together as interferometers which gave the resolving power of a single telescope the size of the distance between the telescopes. These two things are related, and can be understood by considering the interference of different parts of the light or radio wave corresponding to a photon.
- The resolution of a telescope is determined by the ability of the telescope to interfere the parts of the wavefront of light coming from an angle to localize the direction.
- As an analogy, the way you can tell what direction a wave is coming from at the shore of the beach is to notice the timing of arrival of the peaks and troughs of the wave as they hit the beach. If you could look over a long stretch of the shore, you could easily see small differences between angles of arrival, because the equivalent differences in arrival times at large separations on the beach are proportionally large. If you had only a small section of beach to check, it would be hard to notice even large differences in angle.
- A telescope can be thought of as a wave-catcher as well as a photon bucket. (Again, the wave-particle duality.) In fact, the whole purpose of the parabolic surface of the primary mirror (or the curved surfaces of the objective lens) is to arrange that the various portions of an incoming plane-parallel photon wavefront reach the focus at the same time!
- A wavefront perpendicular to the optical axis of the telescope is arranged to arrive at the focus in phase, that is, all parts of the wavefront at the same phase of the sinusoid arrive at the same time, and add together. In the focal plane, but off the focus, wavefronts at different angles are arranged to arrive in phase. This is how an image is made.
- It is useful to define an aperture plane as a plane perpendicular to the optical axis of the telescope (at the top of the telescope - the aperture that is - though any such plane will do). All parts of the wave arriving simultaneously at the aperture are added up at the focus. Thus, it is easy to see maximum brightness at the focus occurs for on-axis waves.
- Note that if we tilt the incoming wavefronts at a slight angle, the various parts no longer add up at the focus in phase, but slightly out of phase. In fact, if we tilt it far enough, the wave striking the aperture plane at the intersection with the optical axis are exactly 180 degrees out of phase with the part of the wave striking the aperture at the edge of the telescope at the same time, and thus cancel out at the focus. When you add up the contributions for all the parts of the aperture, you get a greatly reduced brightness as the different phases interfere. At this angle in fact, you can see that all points along this cut of the aperture are paired with another point 180 degrees out of phase, so we would expect zero brightness at the focus.
- You can use the small angle formula to see that this occurs when the diameter times the angle (in radians) equals one wavelength. This is just our formula for the resolution! (See the diagram below).
- Since we are dealing with plane waves from an infinitely distant astronomical source, symmetry shows us that for this special angle we would get zero signal for a square telescope aperture in the same direction. It turns out that for a circular aperture you need to tilt a little bit farther to get exactly zero signal (because there is more in the center than the edge) - a factor of about 1.22 to be precise. For our purposes, resolution = wavelength/diameter will do just fine.
- For angles less than the critical angle, the brightness is diminished but not zero. Thus, a star offset from an on-axis star by such a small angle will be blurred together. This is what we mean by "resolution".
- You notice that when adding up the wavefront, we do it on a point by point basis in the aperture plane. We can interfere bits of the aperture with each other by placing a mask at the top of the telescope, for example two small holes at opposite edges. Our interference calculations would show in this case that waves tilted by one wavelength at the separation between them would add up in phase with full brightness, and angles tilted by one half wavelength would add up out of phase with zero brightness, with a smooth sinusoidal gradation in between. In fact, our response pattern of angles on the sky would follow a sine function! The resolution of our little interferometer is given by the width between zero response (or between the maximum responses) which is just our prevouse angle = wavelength/separation.
- There is in fact no particular reason that we need the rest of the aperture in this case - we can make our interferometer out of separate telescopes as long as we can arrange to have the waves from each arrive at the "focus" simultaneously to interfere.
- Two telescopes (or radio antennas) separated by some distance D make an interferometer of baseline D. The "focus" where the waves are combined is called a correlator, and it can be separate from either telescope.
- A whole bunch of telescopes linked together to form an interferometer is called an interferometer array. We can see that such an array has the equivalent resolution of the longest baseline (separation) in the array. The angular response from a single baseline is a sinusoid on the sky (with angular wavelength of photon wavelength/D), but by making an array with more and more different baseline lengths and orientations we approach the single telescope response shape (which is a circularly symmeteric pattern for a circular aperture).
- Though the resolution is of a telescope with the diameter of the longest baseline, the light gathering power is merely the sum of the individual telescope areas. You only get the photons you collect.
- The Very Large Array (VLA), shown in the previous lecture, is an array of 27 telescopes, each 25 meters in diameter, spread over the New Mexico plains with baselines up to 36 km. The wave signals from each antenna are piped to the central correlator using waveguides (special metal pipes) buried under the desert.

Detectors (a brief intro)
After light is collected and focused by the telescope, some sort of detector is needed at the focus to turn the light into information. Generally speaking, you can do one of two things: detect photons, or detect waves.

In the optical part of the spectrum, wavelengths are very small (100 - 1000 nm) while photons are energetic, so photon detection is the preferred method. Photons can be detected by inducing a chemical reaction on a photographic plate (like film in your 35mm camera), or by knocking electrons around in a small silicon wafer that looks much like a computer chip called a charge-coupled device (CCD). There are CCD's in most video cameras nowadays.

With the development of CCD's, photographic plates are becoming more and more rare (except for special applications like wide-field sky surveys). The images made on CCD's are digital, and can be easily computer-processed and stored. All of the Hubble Space Telescope images were taken with CCDs.

Because radio waves are large (0.1-1 meter), but radio photons are weak, it is generally much easier to detect radio waves than photons. Radio receivers work by causing the electric field in the wave to induce a current in a wire, which is then amplified by a transistor just like your home radio or TV (though using special devices that are much more powerful because the signals are very weak). Because the wave is detected, these receivers are well suited for interferometry.

Some radio detectors absorb the very weak photons and turn them into heat energy in special wafers of silicon cooled to very nearly absolute zero temperature. These are known as bolometers and have the advantage that they can detect radiation over a wide spectrum, since all the photons are turned into heat regardless of wavelength.

If it is desired to examine the individual wavelengths of light, then a dispersive element such as a prism is placed in front of the detector. This makes what is called a spectrograph. In the radio wavelengths, narrow-band filters are used instead of dispersive elements (because it is hard to disperse long-wavelength waves in a small space) to block out the unwanted wavelengths.

The Atom
Atoms are made up of subatomic particles: electrons, protons, and neutrons. Electrons, light particles that carry negative charge, were discovered by J.J. Thomson in 1897, and Ernest Rutherford discovered in 1911 that the positive charge in the atom was concentrated in a tiny nucleus. Later, the nucleus was shown to contain positive protons, and charge neutral neutrons.

In Rutherford's model of the atom, the nucleus is surrounded by a cloud of electrons constantly in motion.

Protons and neutrons are the most massive subatomic particles, though even they are tiny. Both have about the same mass: 1.67 x 10^-24 g.

Electrons are much less massive, with a mass of about 1/1836 of the proton mass.

Atoms are mostly empty space. The nucleus itself contains most of the mass, but is only about 10^-6 nm ( 10^-15 m) in size! The extent of the electron cloud is about 0.1 - 0.5 nm.

The electric forces between the oppositely charged electrons and protons hold the atom together. The protons and neutrons in the nucleus are held together by the strong nuclear force, which can overcome the mutual repulsion of the like-charged protons.

Even though atoms are mostly empty space, two atoms don't easily pass through each other because of the electric forces of the electron clouds. The solidity of matter is purely the doing of electric forces. When you press your hand against a tabletop, it is actually the electron clouds in the atoms of your hand pushing electrically against the electron clouds in the atoms of the table.

Why don't all atoms just repel each other and fly apart, like a gas? If you bring two atoms close enough together, with the right configuration of electron clouds in each, then the two separate atoms can merge, sharing electron clouds - this is called a molecule.

Molecules can fit together into solids (and to a lesser extent liquids) by very very weak inter-molecular forces left over in the outermost electron clouds. The physics of the attractive forces between atoms and molecules is what makes chemistry work.

Coulomb Forces and Electron Orbits
The electric force, as specified by Charles Coulomb (1736-1806) is given by the inverse-square law: F = C q_1 q_2 / r^2 where q_1 and q_2 are two electric charges, r the distance between them and C a constant having to do with the units of charge (and distance).

Notice any similarity to something we already know? Indeed, the electric Coulomb force has the same dependence as the gravitational force of Newton, with the charges standing in for masses.

One difference with gravity is that electric charge can have both positive and negative sign, so the electric force is either attractive or repulsive. Opposite charges attract - like charges repel.

The gravitational force, on the other hand, is strictly attractive. There are no negative masses, and thus there is no gravitational repulsion (anti-gravity).

Since the electric and gravitational forces look the same mathematically, then we might expect them to behave similarly physically. Thus, we would expect that electrons would orbit the nucleus much like a planet orbits the Sun!

Just like it takes energy to send a spaceship out into a higher orbit, and that if you take some of the energy of motion away from a spaceship (say through friction with the atmosphere) it will drop to a lower orbit, then so also does the energy of the electron orbit rise with increasing distance from the nucleus.

If we carry on our analogy with planets, then we might expect that any orbit at all could be obtained for an electron, as long as you gave it the proper energy.

The energy of an electron can be changed by emitting or absorbing a photon. This is, as we mentioned before, how the electromagnetic force is transmitted, through the intermediation of photons. If a photon is absorbed by the electron, then the outgoing electron carries the energy and momentum of both the incoming electron and photon. If an electron emits a photon, then its energy and momentum are distributed between the outgoing electron and photon.

Thus an electron in an atom can absorb a photon and jump to a higher orbit, or emit a photon and fall to a lower orbit. The change in orbit is given by the energy of the photon.

There is one (at least one) conceptual problem with this. It was known for quite a while that an accelerating electric charge radiates electromagnetic radiation, that is, photons. This is how we use an oscillating electric current to generate radio waves. This is also why very hot things emit light - the moving atoms jiggle around the electrons which emit photons. In any event, we would expect the orbiting electrons, accelerated by the centripetal accleration of the electrical force, to cause them to radiate photons and thus lose energy and spiral in toward the nucleus. This was a big puzzle in 20th century physics.

Quantization and Energy Levels
Physics is a series of a few themes played over and over.

It was mentioned before, in discussing the particle and wave nature of a photon and light, that particles such as electrons have a wave nature also. This was normally hard to detect, since the wavelength was inversely proportional to the momentum of the particle. Electrons can have wavelengths of less than a nanometer, and thus can make useful electron microscopes that can see very tiny things.

It is obvious that in the tiny orbits in atoms, that we must take into account the wave nature of the electron! That is, it is not an orbiting particle like a planet that we must work with, but our task is to fit an electron wave into a confined orbit in a way that makes sense.

According the the physicist de Broglie, particles correspond to waves with wavelength given by L = h / p (where p is the momentum, and h is Planck's constant) and frequency f = E / h (just like light).

What we have learned from dealing with light waves is that when you bring parts of a wave together, then unless you are careful, it will interfere with itself and cancel out. In fact, if you try drawing a sinusoidal wave on a circlular orbit, then unless you adjust the wavelength, you will arrive back at the starting point out of phase and as you draw the wave around and around the orbit, it will cancel out completely!

Thus, you can see graphically that the only allowed wave orbits are those with an integer number of wavelengths in the circumference (see drawing below). Then, the wave wraps around and around always in phase with itself - this sort of wave pattern is called a standing wave. This is how a plucked string generates a note of musical sound, it is allowed to oscillate only such that a standing wave of motion is produced.

Thus, we would expect that orbits are quantized in a special way so that an integer number of standing electron waves, say n, will fit on the circumference 2 Pi r = n where the integer n = 1, 2, 3, ... and so on. Thus we would expect only certain quantized energy levels to exist in an atom, and thus only photons of special energies corresponding to the differences between these levels to be absorbed and emitted by the electrons.

The different allowed orbits are roughly spherical (not circles) and are called electron shells. Even though they are spheres, not closed circles, our wave arguments still hold. The exact orbital pattern of an electron wave determines the ability of an atom to make molecules, because as you can see if it is concentrated into a circle, then the positive proton can make its attractive force felt in the perpendicular direction along the poles of the orbit! It also turn out that we can pack in more than a single electron in one shell. How many you can put in without them interfering with each other is an important property of atomic structure. We will discuss these topics more in the next lecture.

The physics of the actions of wave-particles, and of quantized states that arise when you work with them, is called quantum mechanics. This subject was pioneered in the early parts of this century, and has caused a revolution in our understanding of the nature of the Universe.

So, in our quantum atom, an electron can absorb a photon of specific energy and jump to a higher energy shell. However, it turns out that it will almost immediately emit a photon and jump back down to the original level, or one lower if possible, emitting a photon in the process.

You can see there is a minimum-energy orbit corresponding to a single wavelength on the orbit. If we number the shells outward from the nucleus by their integer number of waves, we would call this the n = 1 orbit.

Because rules of quantum mechanics allow us to only put a certain number of electrons in each orbit, there is an atomic state where all the lowest available orbits are filled. This is the lowest energy state of the atom, and is called the ground state.

If an atom is in the ground state, and one of its electrons absorbs a photon and jumps to a higher-energy orbit, then we say that this atom is now in an excited state. It almost immediately emits a photon of the same energy and de-excites back to the ground state. It would take a continuous bombardment by photons (or collisions) to keep atoms in an excited state for any length of time.

What are the energy levels, and thus the allowed wavelengths of emitted and absorbed photons, for a simple atom like hydrogen? It turns out that there is a simple formula, called the Rydberg formula, that gives this (see below).

Note that there is a finite energy difference between the lowest orbit and an electron orbiting at infinity. For hydrogen, this corresponds to a wavelength of 91.18 nm. If a photon of this wavelength or shorter is absorbed by an electron in the ground state, it will be knocked out of the atom altogether! Note that a jump from the n = 1 to the n = 2 levels corresponds to a wavelength of 121.6 nm.

In the next lecture, we will look in more detail at the energy level emitted spectrum of the hydrogen atom.

The Danish physicist Niels Bohr (1872-1962) devised a more or less correct vision of the atom, though without the notion of waves or quantum mechanics. This classical Bohr atom with quantized orbits is often still used as a toy model.

Next Lecture - Atomic Spectra

Interference and Interferometry

Interference of wavefront at telescope causes blurring (resolution):

Interference between wavefronts received at separate telescopes makes an interferometer work:

Detectors

The Atom

Structure of the atom - nucleus and electron "cloud" for hydrogen and helium:

Coulomb Forces and Electron Orbits

The Bohr model of the atom - electrons as orbiting "planets" in an atomic "solar system":

Quantization and Energy Levels

Packing the electron wave into orbit - standing waves and quantized electron orbitals:

Orbitals correspond to different electron energy levels:

Jumping between these levels can be initiated by absorption of a photon of the correct energy, but the atom quickly returns to its de-excited state emitting a photon of the same energy:

Go to Previous Lecture ---- Go to Next Lecture

Back to the Lecture Notes Index
Back to the ASTR001/Sec3 Page

Steven T. Myers - Last revised 29Feb96