⬡ General labs

Build, optimize, properties, conformers, IR — accessible investigations, no heavy compute.

Lab 1Polarity face-off▶

Why are some molecules polar and others not — and can you predict it from shape alone?

Why it matters. Polarity quietly controls almost everything you can observe about a substance: its boiling point, what it dissolves in ("like dissolves like"), whether it slips through a cell membrane, and how tightly it docks into a protein. The common trap is to assume that polar bonds always make a polar molecule — this lab shows that molecular shape can cancel the bond dipoles completely, so you need both pieces to predict polarity.

buildoptimize (xTB)dipole

You’ll learn to: predict whether a molecule is polar from its shape alone, distinguish a bond dipole from the net molecular dipole, and explain how symmetry makes polar bonds cancel.

Background 1 — bond dipoles come from electronegativity

A bond is polar when the two atoms pull on the shared electrons unequally. The driver is the electronegativity difference (ΔEN): O (3.44) and N (3.04) far outpull H (2.20), so O–H and N–H bonds carry sizeable dipoles, while the C–H bonds of methane (ΔEN ≈ 0.35) are weakly polar. Each polar bond is a bond dipole — a vector pointing toward the more electronegative atom. But a bond dipole is only half the story: the molecule’s overall dipole is the vector sum of all of them.

Background 2 — geometry decides whether dipoles cancel

Whether those vectors add or cancel is fixed by molecular geometry (VSEPR). In CO₂ the two C=O dipoles are equal and point in exactly opposite directions along a linear axis, so they cancel to zero. Methane’s four C–H dipoles point to the corners of a tetrahedron and sum to zero by symmetry. Water (bent, ≈104.5°) and ammonia (pyramidal) lack that cancelling symmetry, so their bond dipoles — reinforced by the lone-pair density — survive as a net molecular dipole.

Background 3 — why the net dipole matters

Net polarity governs bulk behaviour. Water’s ≈1.85 D dipole underlies hydrogen bonding, its 100 °C boiling point, and “like dissolves like” solubility; nonpolar CH₄ boils at −161 °C. In MoleBench you build each molecule, optimize the geometry with GFN2-xTB, then read the computed dipole moment. Remember the method is fast, approximate and gas-phase, so values are estimates — but the qualitative split (water/ammonia polar, methane/CO₂ zero) is robust and reproduces the textbook picture.

Molecules

water ammonia methane CO₂

Procedure

Build the first molecule and press Optimize & compute (panel 2) to get an accurate geometry and its dipole moment; record the value in debye.
Rotate the 3D model and judge its symmetry — do the polar bonds point in directions that cancel out, or do they reinforce each other?
Repeat for all four molecules, then sort them into polar and nonpolar.

Dipole (D)	water	ammonia	methane	CO₂
dipole
polar?

Expected finding

Water (~2 D) and ammonia (~1.6 D) are polar — bent / pyramidal so the bond dipoles don't cancel. Methane (0 D, tetrahedral) and CO₂ (0 D, linear) are nonpolar — symmetric, so the bond dipoles cancel even though the bonds themselves are polar. Polarity = bond polarity + molecular shape.

Questions

CO₂ contains two strongly polar C=O bonds, yet its measured dipole moment is 0 D. Explain how a molecule built entirely from polar bonds can be nonpolar overall.
Both water and CO₂ are triatomic with two identical polar bonds, but only water is polar. Which single property accounts for the difference, and what is it for each?
Predict before you compute. Without running anything, rank water, ammonia, methane and CO₂ from most to least polar using only electronegativity and shape — then check against the optimized dipoles.
Ammonia (≈1.6 D) has a smaller dipole than water (≈1.85 D) even though N is less electronegative than O and ammonia has more N–H bonds. Suggest two reasons the water dipole still comes out larger.
Quick vector check: water’s O–H bond dipole is roughly 1.5 D and the H–O–H angle is 104.5°. Estimate the molecular dipole as the vector sum of the two O–H dipoles and compare with the ≈1.85 D value. (Hint: the resultant is 2 · 1.5 · cos(104.5°/2).)
Apply it: tetrahedral CF₄ and trigonal-planar BF₃ both have very polar bonds. Predict their net dipoles and state the geometric reason, by analogy to methane and CO₂.

Worked answers & discussion

Q1. The molecular dipole is the vector sum of the bond dipoles, not the sum of their magnitudes. CO₂ is linear, so the two equal C=O dipoles point in exactly opposite directions (180° apart) and cancel completely — net 0 D — even though each bond is strongly polar.
Q2. Shape. CO₂ is linear (O–C–O = 180°) so dipoles cancel; water is bent (≈104.5°) so its two O–H dipoles add to a net vector bisecting the angle. Same number of polar bonds, opposite outcome — geometry is the deciding variable.
Q3. Most → least polar: water (≈1.85 D) > ammonia (≈1.6 D) > methane = CO₂ (0 D). Water/ammonia have asymmetric (bent/pyramidal) shapes that don’t cancel; methane (tetrahedral) and CO₂ (linear) are symmetric, so they cancel to zero. The optimized GFN2-xTB dipoles confirm this ordering.
Q4. (i) O is more electronegative than N, so each O–H bond dipole is larger than each N–H one; (ii) geometry: water’s two dipoles are bent at ≈104.5° and reinforce strongly, whereas ammonia’s three N–H dipoles are partly opposed by the more pyramidal arrangement, and its lone-pair contribution adds along a different axis, so the components add less efficiently.
Q5. Resultant = 2 · (1.5 D) · cos(52.25°) = 3.0 · 0.612 ≈ 1.84 D — essentially the measured 1.85 D. The simple vector model works because the lone-pair contribution roughly tracks the bond-dipole sum here. (Treat the input 1.5 D as an effective value; it is calibrated to give this answer.)
Q6. Both are nonpolar, net 0 D. CF₄ is tetrahedral like methane — the four C–F dipoles cancel by symmetry. BF₃ is trigonal planar — three B–F dipoles at 120° sum to zero. Polar bonds, symmetric shape ⇒ nonpolar molecule.
Take‑home. Polarity = bond polarity (from ΔEN) combined with molecular shape — symmetric geometries can cancel even strongly polar bonds, so you need both pieces to predict it.

Lab 2Drug-likeness showdown▶

Which of these molecules would make a viable oral drug by Lipinski's rule of five?

Why it matters. Most medicines are swallowed, so before a molecule can work it has to dissolve, survive the gut, and cross cell membranes into the blood — its oral bioavailability. Lipinski's "rule of five" is the quick filter medicinal chemists use to flag molecules unlikely to be absorbed, connecting a handful of cheap, instantly-computed descriptors to a make-or-break real-world property.

propertieslogP / TPSALipinski

You’ll learn to: apply Lipinski’s rule of five, read logP, TPSA and H-bond counts as absorption proxies, and explain why a highly soluble sugar still fails as an oral drug.

Background 1 — the rule of five

Lipinski’s rule of five flags molecules unlikely to be well absorbed when swallowed. A compound tends to have poor oral absorption if it violates two or more of: molecular weight > 500 Da; logP > 5; more than 5 H-bond donors (counted as N–H + O–H); more than 10 H-bond acceptors (N + O atoms). Every cutoff is a multiple of five — hence the name. It is a coarse filter, not a law: it predicts passive membrane permeability and solubility, the gate every oral drug must pass before it can reach its target.

Background 2 — logP, TPSA and the permeability balance

logP is the log of the octanol/water partition coefficient — a lipophilicity gauge. Too low and a drug won’t cross the greasy lipid bilayer; too high and it won’t dissolve in blood or will stick in fat. TPSA (topological polar surface area) sums the area of polar N/O atoms and their attached H’s; it correlates tightly with H-bonding. A TPSA above ≈140 Å² usually means poor passive permeability. Aspirin, ibuprofen and caffeine sit in the sweet spot; sucrose, drowning in –OH groups, does not.

Background 3 — why sugar fails the test

Sucrose (C₁₂H₂₂O₁₁, 342 Da) carries 8 O–H donors and 11 oxygen acceptors — smashing both the ≤5 donor and ≤10 acceptor limits — with a TPSA near 190 Å². That dense polar shell binds water beautifully (sugar is sweet and soluble) but cannot shed its hydration to slip through a membrane. MoleBench computes these descriptors instantly from the structure; no quantum calculation is needed, since they are 2D counts. The numbers are cheap, but the principle — permeability vs polarity — is exactly why table sugar is food, not medicine.

Molecules

aspirin ibuprofen caffeine sucrose

Procedure

Build each molecule — the Properties panel updates instantly, no calculation needed.
Read off the molecular weight, cLogP (lipophilicity), TPSA (polar surface area), and the hydrogen-bond donor / acceptor counts.
Check the Lipinski flag and decide which molecules pass and which would struggle to be absorbed.

	aspirin	ibuprofen	caffeine	sucrose
MW
cLogP
Lipinski pass?

Expected finding

The three small drugs pass the rule of five. Sucrose fails — too many H-bond donors/acceptors and a huge TPSA (it's a sugar, not membrane-permeable). The rule of five captures why table sugar isn't an oral drug.

Questions

State the four Lipinski criteria and their cutoffs. How many of the four must a molecule break before the rule predicts poor oral absorption?
Count the descriptors by hand for sucrose (SMILES OCC1OC(OC2(CO)OC(CO)C(O)C2O)C(O)C(O)C1O): how many H-bond donors (O–H) and how many acceptors (all N + O)? Which two rules does it violate, and why is that enough to fail?
Aspirin, ibuprofen and caffeine all pass. Pick the one with the highest logP and explain, from its structure, why it is the most lipophilic of the three.
Caffeine has zero H-bond donors yet several acceptors. Explain how a molecule can have many acceptors but no donors, and why that helps its membrane permeability.
TPSA estimate: sucrose has 8 hydroxyl groups (each ≈20.2 Å²) plus 3 ring/ether oxygens (each ≈9.2 Å²). Roughly sum its TPSA and compare with the ≈140 Å² permeability threshold.
Apply it: a candidate has MW 480, logP 4.8, 4 donors and 9 acceptors. Does it pass Lipinski? What does passing the rule guarantee — and what does it not guarantee — about the drug?

Worked answers & discussion

Q1. MW ≤ 500 Da; logP ≤ 5; H-bond donors (N–H + O–H) ≤ 5; H-bond acceptors (N + O) ≤ 10. A molecule is flagged as likely poorly absorbed only when it breaks two or more of the four.
Q2. Sucrose has 8 O–H donors and 11 oxygen acceptors (8 hydroxyl O + 2 ring O + 1 glycosidic O). It violates the donor rule (8 > 5) and the acceptor rule (11 > 10) — two violations, so it fails. Its MW (342) and logP (≈−3.7) are fine, but two strikes are enough.
Q3. Ibuprofen (logP ≈ 3.5) is the most lipophilic. It carries a large hydrophobic skeleton — an aromatic ring plus an isobutyl and a methyl group — with only a single polar carboxylic acid. Aspirin (≈1.2) adds an ester and acid; caffeine (≈−0.1) is studded with polar carbonyls and ring nitrogens, lowering logP.
Q4. Acceptors are lone-pair-bearing N and O atoms; donors require an H actually bonded to N or O. Caffeine’s nitrogens are all fully methylated or part of the ring (no N–H) and its oxygens are carbonyls (no O–H), so it accepts H-bonds but donates none. Fewer donors means less polar surface to desolvate, aiding passive permeability — part of why caffeine crosses the blood–brain barrier readily.
Q5. TPSA ≈ 8 × 20.2 + 3 × 9.2 = 161.6 + 27.6 ≈ 189 Å² (literature ≈ 190 Å²). That is far above the ≈140 Å² ceiling for good passive permeability — an independent confirmation that sucrose won’t cross membranes well.
Q6. MW 480 ≤ 500 (ok), logP 4.8 ≤ 5 (ok), 4 donors ≤ 5 (ok), 9 acceptors ≤ 10 (ok): zero violations, so it passes. Passing only means it is not flagged for poor passive absorption — it says nothing about potency, selectivity, metabolic stability, toxicity, or active transport. Many real drugs (and most injectables) legitimately break the rule.
Take‑home. The rule of five is a fast permeability/solubility filter, and sucrose fails it on H-bonding alone — capturing precisely why table sugar, for all its solubility, is not an oral drug.

Lab 3Anti vs gauche▶

What shape does a flexible molecule prefer — and can an internal hydrogen bond change the answer?

Why it matters. A flexible molecule isn't a single shape but a shifting population of conformers, and the one it spends most of its time in governs how it reacts and how it fits a binding site. Conformational analysis is a daily tool in drug design and biochemistry — and this lab delivers a classic surprise: an internal hydrogen bond can overrule the "obvious" steric preference.

conformer searchBoltzmann populations

You’ll learn to: rank a flexible molecule’s conformers by energy, interpret their Boltzmann populations, and recognise when an intramolecular hydrogen bond overrides the usual steric preference.

Background 1 — conformers and the torsion angle

Rotating about a single C–C bond costs little energy, so a flexible molecule is a shifting population of conformers — shapes that interconvert without breaking bonds. They are distinguished by the dihedral (torsion) angle of the four atoms across the bond. For butane’s C1–C2–C3–C4 chain, the key staggered forms are anti (dihedral 180°, methyls far apart) and gauche (±60°, methyls close). Eclipsed forms (0°, 120°) are energy maxima. Conformational analysis — finding which of these dominates — underpins how molecules react and how they fit binding sites.

Background 2 — sterics usually wins: butane and 1,2-dichloroethane

The default driver is steric strain. In anti-butane the two methyls are as far apart as possible, so it is the global minimum; the gauche form lies ≈0.9 kcal/mol higher because the methyls crowd. 1,2-Dichloroethane behaves the same way in the gas phase: the bulky, mutually repelling (and dipole-opposed) chlorines prefer anti. Boltzmann statistics then convert these energy gaps into populations: at 298 K a gap of ≈0.9 kcal/mol leaves the anti form roughly 70–80% populated, accounting for both gauche wells.

Background 3 — when an internal H-bond overrules sterics

Ethylene glycol (HOCH₂CH₂OH) breaks the pattern. Pure sterics would favour anti (O’s apart), but the gauche form lets the two oxygens approach so one O–H can donate to the other’s lone pair — an intramolecular hydrogen bond (O–H···O). That stabilisation (a few kcal/mol) outweighs the modest steric penalty, so gauche wins. In MoleBench you run a conformer search, optimize each with GFN2-xTB, and read off relative energies and Boltzmann populations. The method is approximate and gas-phase, so treat numbers as estimates — but it captures this H-bond surprise correctly.

Molecules

butane ethylene glycol 1,2-dichloroethane

Procedure

Build each molecule, open panel 5 and press Search conformers to generate and Boltzmann-rank the rotamers at room temperature.
Record how many distinct conformers were found and their relative energies and populations.
Inspect the lowest-energy conformer — is the key dihedral anti (~180°) or gauche (~60°)? For glycol, look for an O–H pointing back at the other oxygen.

	butane	glycol	1,2-DCE
# conformers
lowest = anti/gauche?

Expected finding

Butane prefers anti (sterics). Ethylene glycol surprisingly favours a gauche form — an internal O–H⋯O hydrogen bond stabilises it. The lab shows that the "obvious" anti preference can be overridden by intramolecular interactions.

Questions

Define anti and gauche in terms of the dihedral angle across the central C–C bond, and explain why eclipsed conformers are energy maxima rather than minima.
For butane, why is the anti conformer the global minimum? Identify the specific steric interaction that makes gauche ≈0.9 kcal/mol higher.
The central surprise: ethylene glycol favours gauche, the opposite of butane. Name the stabilising interaction responsible and explain why it is only available in the gauche geometry.
1,2-Dichloroethane prefers anti in the gas phase. Give two reasons (one steric, one electrostatic) why the chlorines avoid the gauche arrangement.
Boltzmann calculation: at 298 K (RT ≈ 0.59 kcal/mol), use population ratio = exp(−ΔE/RT) to estimate the gauche-to-anti ratio for a single gauche well in butane (ΔE ≈ 0.9 kcal/mol). Then account for there being two equivalent gauche wells.
Apply it: 2-fluoroethanol (FCH₂CH₂OH) also prefers gauche. Predict the stabilising interaction by analogy to ethylene glycol, and name one situation where this kind of internal H-bond matters in drug design.

Worked answers & discussion

Q1. Anti = dihedral 180° (the two end groups are antiperiplanar, maximally separated); gauche = ±60° (synclinal, end groups close). Both are staggered minima. Eclipsed forms (0°, 120°) put bonds directly behind one another, raising torsional (and steric) strain — they are the maxima the molecule rolls over between the staggered wells.
Q2. Anti places the two terminal methyls 180° apart, minimising steric clash and torsional strain — the global minimum. In gauche the methyls are only ≈60° apart and their van der Waals shells crowd (a methyl–methyl gauche interaction), costing ≈0.9 kcal/mol.
Q3. An intramolecular O–H···O hydrogen bond. Only in gauche do the two oxygens swing close enough for one hydroxyl H to reach the other oxygen’s lone pair; in anti they point away from each other and no H-bond can form. That ≈2–3 kcal/mol of H-bond stabilisation outweighs the small steric penalty, so gauche becomes the preferred conformer.
Q4. (i) Steric: Cl is large, so gauche crowds the two chlorine atoms; (ii) electrostatic/dipole: each C–Cl bond is strongly polar, and in anti the two bond dipoles oppose and partly cancel, lowering energy, whereas gauche aligns them and raises electrostatic repulsion. (In polar solvent the larger-dipole gauche form is stabilised and its population rises — a nice solvent-effect aside.)
Q5. Per well: exp(−0.9/0.59) = exp(−1.53) ≈ 0.22, so each gauche well is about 22% as populated as anti. With two equivalent gauche wells the total gauche:anti ratio is ≈ 2 × 0.22 = 0.44, i.e. anti ≈ 1/(1+0.44) ≈ 69%, gauche ≈ 31% — consistent with the “anti dominates” result.
Q6. An internal O–H···F hydrogen bond (the hydroxyl H to the fluorine lone pairs), available only in gauche — directly analogous to ethylene glycol. In drug design such intramolecular H-bonds can lock a bioactive conformation, mask polarity to boost membrane permeability, and reduce the entropic cost of binding.
Take‑home. Conformer preference is usually steric (anti wins, as in butane and 1,2-dichloroethane), but a well-placed intramolecular hydrogen bond can overrule sterics and make gauche the favoured shape — exactly what ethylene glycol demonstrates.

Lab 4The carbonyl stretch▶

Does the C=O infrared frequency depend on what's attached to the carbonyl?

Why it matters. Infrared spectroscopy identifies functional groups from the frequencies at which bonds vibrate, and the carbonyl (C=O) stretch near 1700 cm⁻¹ is the single most diagnostic band in all of organic chemistry. Its exact position is a sensitive reporter of the carbonyl's electronic environment — learning to read those shifts is a core skill for interpreting real IR spectra.

IR spectrumfunctional groups

You’ll learn to: relate a C=O stretching frequency to its force constant through Hooke’s law, and predict the direction a substituent shifts the carbonyl band.

Background 1 — a vibrating bond is a spring

A stretching bond behaves like two masses on a spring, so its frequency follows Hooke’s law: ν̅ ∝ √(k/μ), where k is the force constant (bond stiffness) and μ the reduced mass of the two atoms. The C=O double bond is stiff (k ≈ 12 N/cm, about twice a C–O single bond), which places its stretch high, near 1700 cm⁻¹ — clear of the crowded fingerprint region. Because the dipole changes sharply as it stretches, the band is also intense, making the carbonyl the single most diagnostic feature in an organic IR spectrum.

Background 2 — why the exact frequency shifts

Anything that changes the C=O force constant moves the band. Two electronic effects dominate. Inductive (σ) electron withdrawal by an attached atom strengthens the C=O, raising the frequency. Resonance/conjugation that donates lone-pair density into the C=O π* gives the bond partial single-bond character, weakening it and lowering the frequency. In an ester or amide these compete; the molecules here let you isolate trends: formaldehyde (H₂C=O, ≈1746 cm⁻¹) vs the alkyl-substituted acetaldehyde (≈1730) and acetone (≈1715), then acetic acid where the –OH enters the picture.

Background 3 — reading the trend in these four

Formaldehyde sits highest because it has only light H’s and no electron-donating alkyls. Replacing an H with a methyl (acetaldehyde, ≈1730) and then adding a second (acetone, ≈1715) lowers k stepwise via hyperconjugative donation. Acetic acid’s monomer absorbs near 1760 cm⁻¹, but in the liquid it forms hydrogen-bonded dimers that pull the band down to ≈1710. In MoleBench you optimize with GFN2-xTB and compute the IR spectrum; semi-empirical frequencies are gas-phase estimates and often need scaling, so trust the ordering and shifts more than absolute values.

Molecules

formaldehyde acetaldehyde acetone acetic acid

Procedure

Build each carbonyl compound, open panel 3 and press Compute IR + thermo.
In the stick spectrum, find the tall band between roughly 1700 and 1850 cm⁻¹ — that's the C=O stretch. Click that peak to animate the vibration in the 3D viewer and watch the C=O bond stretch and compress.
Record its position for all four and look for a trend as the groups attached to the carbonyl change. Click a few other peaks too — identify the C–H stretches (~3000 cm⁻¹) and the bends (lower) by watching how the atoms move.

	H₂C=O	CH₃CHO	acetone	AcOH
C=O stretch (cm⁻¹)

Expected finding

All four show a strong C=O band, but the exact frequency shifts with the neighbouring groups (alkyl substitution and the –OH of the acid change the bond's force constant). The lab demonstrates that the carbonyl is a recognisable IR "fingerprint" whose position carries chemical information.

Questions

Using Hooke’s law for a vibrating bond, explain why the C=O stretch (≈1700 cm⁻¹) appears at a much higher frequency than a C–O single bond (≈1100 cm⁻¹). Which factor — force constant or reduced mass — dominates?
Why is the carbonyl band not only well-positioned but also intense? Tie your answer to what an IR-active vibration requires.
Rank the four C=O stretches (formaldehyde, acetaldehyde, acetone, acetic acid monomer) from highest to lowest frequency and justify each step in terms of inductive vs hyperconjugative/resonance effects.
Acetic acid’s carbonyl band shifts from ≈1760 cm⁻¹ (monomer) to ≈1710 cm⁻¹ in the neat liquid. What structural change causes this, and why does it lower the frequency?
Hooke’s-law estimate: if a carbonyl’s force constant rises by 10% with no change in reduced mass, by roughly what percentage does its stretching frequency increase? (Use ν̅ ∝ √k.)
Apply it: an unknown shows a strong band at 1685 cm⁻¹, lower than a simple ketone. Suggest a structural feature (e.g. conjugation) that could explain the red shift, and predict whether an amide would absorb higher or lower than a ketone.

Worked answers & discussion

Q1. ν̅ ∝ √(k/μ). The C–O atoms are the same in both bonds, so μ is essentially identical; the difference is force constant. A C=O double bond is roughly twice as stiff (k ≈ 12 vs 5 N/cm) as a C–O single bond, and √2 ≈ 1.4 × higher frequency — pushing the stretch from ≈1100 up toward 1700 cm⁻¹.
Q2. An IR-active vibration must change the molecular dipole moment. The C=O bond is already very polar, and stretching it modulates that large dipole strongly, giving a big dμ/dr — hence a very intense band. (By contrast a symmetric nonpolar stretch, like C=C in ethylene, is weak or IR-silent.)
Q3. Highest → lowest: acetic acid monomer (≈1760) > formaldehyde (≈1746) > acetaldehyde (≈1730) > acetone (≈1715). Two effects combine. Going formaldehyde → acetaldehyde → acetone, each H replaced by an electron-donating methyl pushes density into the C=O (hyperconjugation/induction), softening the bond and lowering the stretch. The acid monomer sits above them all because the electronegative –OH withdraws density inductively and stiffens the C=O (higher k) — only when it pairs into the H-bonded dimer in the neat liquid does its band fall to ≈1710 (Q4). Rule of thumb: inductive withdrawal raises the stretch, alkyl donation lowers it.
Q4. In the neat liquid two acetic acid molecules pair into a cyclic hydrogen-bonded dimer (two O–H···O=C bridges). Accepting an H-bond at the carbonyl oxygen drains C=O π density and lengthens/weakens the bond, lowering k and dropping the stretch by ≈50 cm⁻¹ to ≈1710.
Q5. ν̅ ∝ √k, so a 10% rise in k gives √1.10 = 1.049, about a +4.9% increase in frequency — e.g. a 1715 cm⁻¹ band would move up by ≈84 cm⁻¹. Small force-constant changes produce modest but clearly measurable shifts.
Q6. A band at 1685 suggests conjugation (e.g. an α,β-unsaturated ketone or an aryl ketone): delocalising the C=O π into an adjacent π system gives partial single-bond character, lowering k and the frequency. An amide absorbs lower than a ketone (≈1650 cm⁻¹) because the nitrogen lone pair donates strongly into the C=O π*, weakening it (resonance dominates over induction).
Take‑home. The carbonyl is a recognisable ≈1700 cm⁻¹ IR fingerprint, and its exact position is a sensitive electronic reporter — alkyl donation, inductive withdrawal, conjugation and hydrogen bonding each shift the C=O force constant and move the band in a predictable direction.

Lab 5Bond order vs bond length▶

How does the carbon–carbon bond length change from single to double to triple?

Why it matters. Bond order and bond length are two faces of the same idea: each extra shared electron pair pulls the atoms closer and makes the bond stronger and stiffer. Putting numbers on this turns the abstract notion of "bond order" into something measurable, and the lengths you find here are the same ones used to recognise single, double and triple bonds in any structure.

optimize (xTB)📏 measure

You’ll learn to: correlate bond order with bond length and stiffness, and recall the characteristic single, double and triple C–C bond lengths.

Background 1 — bond order and the σ/π framework

Bond order is the number of shared electron pairs between two atoms: one for a single bond, two for a double, three for a triple. The first pair forms a σ bond (head-on overlap along the internuclear axis); each additional pair is a π bond from side-on p-orbital overlap. So ethane (C–C) is one σ; ethylene (C=C) is one σ + one π; acetylene (C≡C) is one σ + two π. Each extra shared pair adds binding electron density between the nuclei, pulling them closer and stiffening the bond.

Background 2 — more shared pairs, shorter and stronger bond

Two quantitative consequences follow. Length shortens: the experimental C–C values are ≈1.54 Å (ethane), ≈1.34 Å (ethylene) and ≈1.20 Å (acetylene). Strength rises: the bond dissociation/energy climbs from ≈83 (single) to ≈146 (double) to ≈200 kcal/mol (triple) — but note it is not simply additive, because the second and third π bonds overlap less efficiently than the σ. Hybridisation tracks this too: the carbons go sp³ → sp² → sp, and higher s-character holds bonding electrons closer to the nuclei, reinforcing the contraction.

Background 3 — measuring it in MoleBench

Here you build ethane, ethylene and acetylene, optimize each geometry with GFN2-xTB, then use the measure tool to read the C–C distance directly. Bond length is one of the properties semi-empirical methods reproduce best, so the computed values should land within a few hundredths of an Ångström of experiment and recover the ≈1.53 / 1.33 / 1.20 Å ladder. These are the same reference lengths chemists use in reverse: spotting a ≈1.20 Å carbon–carbon distance in a crystal structure immediately signals a triple bond.

Molecules

ethane ethylene acetylene

Procedure

Build each molecule and press Optimize & compute so the geometry is physically accurate, not just a sketch.
Switch on 📏 measure and click the two carbons to read the C–C distance.
Tabulate single vs double vs triple and note how much each extra bond shortens the link.

	ethane (C–C)	ethylene (C=C)	acetylene (C≡C)
length (Å)

Expected finding

Bond length shortens as bond order rises: single ≈ 1.53 Å, double ≈ 1.33 Å, triple ≈ 1.20 Å. More shared electron pairs pull the carbons closer.

Questions

Describe the σ/π composition of the C–C bond in ethane, ethylene and acetylene. How many σ and how many π bonds does each contain?
Explain the trend: why does adding electron pairs (single → double → triple) shorten the carbon–carbon bond? Refer to electron density between the nuclei and to changing hybridisation (sp³ → sp² → sp).
Bond energies are ≈83 (single), 146 (double), 200 kcal/mol (triple). Show that the increase is not simply additive, and explain why the second and third bonds add less than the first.
Quantitative check: from ethane (1.53 Å) to ethylene (1.33 Å) the bond shortens by 0.20 Å; from ethylene to acetylene (1.20 Å) by 0.13 Å. Express each as a percentage of the single-bond length, and comment on why the second contraction is smaller.
Acetylene is linear (H–C≡C–H, 180°) while ethylene is planar (120° angles). Predict the H–C–H/H–C–C angles in each from hybridisation, and relate this to the bond-order trend.
Apply it: a structure shows a carbon–carbon distance of 1.40 Å, between single and double. What bonding situation would give an intermediate length, and name a molecule where every C–C bond is ≈1.39 Å.

Worked answers & discussion

Q1. Ethane: 1 σ (bond order 1). Ethylene: 1 σ + 1 π (bond order 2). Acetylene: 1 σ + 2 π (bond order 3). The first shared pair is always the σ (axial overlap); each extra pair is a side-on π bond.
Q2. Each additional shared pair piles more bonding electron density directly between the two carbon nuclei; that density screens the nuclear repulsion and pulls the nuclei together, so the equilibrium distance shrinks. Hybridisation reinforces it: going sp³ (25% s) → sp² (33%) → sp (50%), the bonding orbitals gain s-character, which is held closer to the nucleus, contracting the bond further. The result is the 1.53 → 1.33 → 1.20 Å ladder.
Q3. Doubling bond order doesn’t double energy: 146 / 83 ≈ 1.76 (not 2×) and 200 / 83 ≈ 2.4 (not 3×). The first (σ) bond enjoys strong head-on overlap; the π bonds rely on weaker side-on p-overlap, so each adds less than the σ — the increments are ≈63 then ≈54 kcal/mol, diminishing.
Q4. First contraction: 0.20 / 1.53 = 13.1%; second: 0.13 / 1.53 = 8.5%. The second contraction is smaller because, as above, the third shared pair (second π bond) overlaps less effectively and contributes less binding density than the first π, so it draws the nuclei in by a smaller increment.
Q5. Acetylene carbons are sp-hybridised ⇒ linear, H–C–C = 180°; ethylene carbons are sp² ⇒ trigonal planar, H–C–H/H–C–C ≈ 120°. Higher s-character (sp) goes hand-in-hand with the shortest, strongest bond — geometry and bond order move together.
Q6. An intermediate length signals delocalised / partial double-bond character — resonance spreading π density so each bond is fractional in order. The classic example is benzene, where all six C–C bonds are equal at ≈1.39 Å, midway between single (1.54) and double (1.33).
Take‑home. Bond order and bond length are two faces of the same idea: each extra shared electron pair shortens, strengthens and stiffens the carbon–carbon bond, giving the diagnostic ≈1.53 / 1.33 / 1.20 Å ladder used to identify single, double and triple bonds anywhere.

Lab 6VSEPR — shapes & bond angles▶

Does adding lone pairs really squeeze the bond angle below the perfect tetrahedral 109.5°?

Why it matters. VSEPR is the first model students use to predict a molecule's 3D shape, and its central claim — that lone pairs repel more strongly than bonding pairs and so compress bond angles — is directly testable. Since shape decides polarity, reactivity and how molecules pack together, confirming VSEPR against measured angles builds trust in a model you'll lean on constantly.

optimize (xTB)📐 measure angleVSEPR

You’ll learn to: predict molecular shapes and bond angles with VSEPR, and explain why lone pairs compress angles below the ideal tetrahedral value.

Background 1 — electron domains set the shape

VSEPR (Valence‑Shell Electron‑Pair Repulsion) treats every electron domain — a bonding pair, a lone pair, or a whole multiple bond — as a cloud of negative charge that pushes the others as far apart as possible. Four domains around a central atom point to the corners of a tetrahedron at 109.5°; two domains go linear at 180°. Methane, ammonia and water all have four domains on the central atom, so all three start from the same tetrahedral parent geometry — the differences in their measured angles come entirely from what occupies those four positions.

Background 2 — why lone pairs squeeze harder

A lone pair is held by only one nucleus, so its cloud is fatter and sits closer to the central atom than a bonding pair, which is pulled out between two nuclei. The fatter lone pair therefore exerts stronger repulsion, and the order of repulsions is lp–lp > lp–bp > bp–bp. Each lone pair pushes the remaining bonds together, compressing the bond angle below 109.5°. Ammonia (one lone pair) drops to ~107°; water (two lone pairs) drops further to ~104.5° — a clean, additive trend you can measure directly.

Background 3 — method note & the linear case

CO₂ has no lone pairs on carbon: the central atom owns just two electron domains (the two double bonds), which repel to a straight 180° line. So the "compression" story only applies when lone pairs are actually present on the central atom. The GFN2‑xTB optimization used here is a fast semi‑empirical method, gas‑phase and approximate, but it reproduces these textbook angles to within about a degree — good enough to confirm the qualitative VSEPR ranking, which is the point of the lab rather than benchmark‑accuracy geometry.

Molecules

methane ammonia water CO₂

Procedure

Build each hydride and press Optimize & compute for a real geometry.
Switch on 📐 measure and click H, then the central atom, then another H to read the bond angle.
Count the lone pairs on the central atom and check whether the angle shrinks as more are added.

	methane	ammonia	water	CO₂
lone pairs on centre
bond angle (°)

Expected finding

The angle compresses as lone pairs are added: methane 109.5° (0 LP) → ammonia ~107° (1 LP) → water ~104.5° (2 LP). Lone pairs take more room than bonding pairs and push the bonds together. CO₂ is linear (180°) — no lone pairs on carbon. VSEPR, confirmed by measurement.

Questions

All three of methane, ammonia and water have four electron domains around the central atom, yet their bond angles differ. Explain in one sentence why they don’t all measure 109.5°.
Quantify the squeeze. Going methane (109.5°) → ammonia (~107°) → water (~104.5°), roughly how many degrees does each added lone pair cost? Is the trend additive?
Rank the three pairwise repulsions (lp–lp, lp–bp, bp–bp) from strongest to weakest, and use that ranking to explain why water’s angle falls below ammonia’s.
CO₂ is linear at 180° even though each C=O bond is polar. Why does carbon adopt a linear shape here while the other three central atoms are bent or pyramidal?
Apply it. Predict the H–X–H bond angle in the hydronium ion H₃O⁺ (one lone pair on O) and in H₂S relative to water. Should they be larger or smaller than 109.5°, and why?
Why does a 1–2° error from a fast gas‑phase method like GFN2‑xTB still let you confirm VSEPR, even though it’s not benchmark‑accurate?

Worked answers & discussion

Q1. Four domains give the same tetrahedral parent geometry, but the domains aren’t identical: methane has four bonding pairs, ammonia has three bonds + one lone pair, water has two bonds + two lone pairs. Lone pairs repel more strongly than bonding pairs, so they compress the H–X–H angle below 109.5°.
Q2. 109.5° → 107° is about −2.5° for the first lone pair; 107° → 104.5° is another ~−2.5° for the second — so each added lone pair costs roughly 2–3°, and the trend is close to additive (each lone pair contributes a similar extra squeeze).
Q3. lp–lp > lp–bp > bp–bp. A lone pair sits closer to the nucleus and is fatter, so it repels harder. Water has two lone pairs (introducing strong lp–lp repulsion) versus ammonia’s one, so the bonds in water are pushed together more — 104.5° < 107°.
Q4. Carbon in CO₂ has no lone pairs — only two electron domains (the two C=O double bonds). Two domains repel to a straight line (180°). The other three central atoms each carry four domains including lone pairs, forcing a bent (water) or pyramidal (ammonia) shape. Linearity isn’t about bond polarity; it’s about the domain count on the central atom.
Q5. Simple VSEPR treats H₃O⁺ like ammonia — one lone pair on O — so it predicts ~107°, just below the tetrahedral 109.5°. The measured angle is actually a touch larger (≈111–113°): the +1 charge contracts the oxygen lone pair, so it repels the O–H bonds less than nitrogen’s neutral lone pair does. That is the useful refinement — charge tunes lone-pair repulsion on top of the basic domain count, so VSEPR gets the direction right (below tetrahedral) but a cation flattens more than the naive rule expects. H₂S also has two lone pairs like water but a larger, more diffuse central atom; its angle drops well below water’s to ~92° because the longer S–H bonds ease bp–bp repulsion and the bonds use nearly pure p orbitals.
Q6. The lab tests a ranking (does the angle fall as lone pairs are added?), not an absolute value. A consistent ~1–2° error shifts all the numbers together without scrambling the order, so the trend 109.5 > 107 > 104.5 survives — enough to confirm VSEPR qualitatively.
Take‑home. Count electron domains to get the parent shape, then let lone pairs — which repel more than bonding pairs — compress the bond angle by a few degrees each, exactly as the measured methane > ammonia > water sequence shows.

Lab 7Which alkene is most stable?▶

For three isomers with the same formula (C₄H₈), does substitution and cis/trans geometry change the energy?

Why it matters. Which alkene forms is the heart of elimination chemistry (Zaitsev's rule) and of why one isomer dominates at equilibrium. Because these three molecules share a formula, their total energies can be compared directly — a clean, rigorous way to rank stability and put a number on ideas like "more substituted is more stable" and "trans beats cis."

optimize (xTB)relative energyisomers

You’ll learn to: rank constitutional/geometric isomers by relative energy, and explain substitution and cis/trans stability effects (Zaitsev) quantitatively.

Background 1 — why isomers can be compared directly

1‑butene, cis‑2‑butene and trans‑2‑butene are constitutional / geometric isomers: same formula C₄H₈, same atoms, same bonds counted by type. Because they contain exactly the same nuclei and electrons, their computed total electronic energies are on one common scale, so the difference in energy is the difference in stability — no reference reaction needed. The lowest‑energy isomer is the thermodynamically most stable one and the one favoured at equilibrium. This is what makes alkene isomers such a clean test of the rules below.

Background 2 — more substituted means more stable

The C=C π bond is stabilised by neighbouring C–H bonds through hyperconjugation — electron density from adjacent σ(C–H) bonds delocalises into the empty π* orbital. An internal (disubstituted) double bond, as in 2‑butene, has more alkyl groups flanking it than the terminal (monosubstituted) double bond of 1‑butene, so it enjoys more hyperconjugation plus favourable alkyl‑group inductive donation. This is the structural basis of Zaitsev’s rule: elimination preferentially gives the more‑substituted, more‑stable alkene, and you are measuring exactly that preference here.

Background 3 — trans beats cis

Among the two 2‑butenes the difference is geometry. In the cis (Z) isomer the two methyl groups sit on the same side of the double bond and crowd each other, producing steric strain (van der Waals repulsion). In the trans (E) isomer the methyls are on opposite sides and stay clear, so trans is lower in energy by roughly 1–1.5 kcal/mol experimentally. GFN2‑xTB is fast, approximate and gas‑phase, but it captures both effects — substitution and cis/trans — well enough to reproduce the correct stability ordering.

Molecules

1-butene cis-2-butene trans-2-butene

Procedure

Build each isomer and press Optimize & compute; record the total energy in hartree (Eh).
Since all three are C₄H₈, subtract to get the energy differences, then multiply by 627.5 to convert to kcal/mol.
Rank them from most to least stable and connect the order to substitution and cis/trans strain.

	1-butene	cis-2-butene	trans-2-butene
energy (Eh)
ΔE vs lowest (kcal/mol)

Expected finding

trans-2-butene is lowest — the more-substituted (internal) double bond beats the terminal one, and trans beats cis because the two methyls avoid steric clash. Order: trans-2-butene < cis-2-butene < 1-butene. This is the "more substituted alkenes are more stable" (Zaitsev) rule, measured.

Questions

Why can the total energies of these three molecules be compared directly, when in most labs you need a reference reaction to compare two different molecules?
Name the two electronic/steric factors that together explain the ordering trans‑2‑butene < cis‑2‑butene < 1‑butene. Which factor separates the two 2‑butenes from 1‑butene, and which one separates trans from cis?
Put a number on it. If your optimization places cis‑2‑butene about 1 kcal/mol above trans‑2‑butene, use ΔG = −RT ln K at 298 K (RT ≈ 0.593 kcal/mol) to estimate the trans:cis ratio at equilibrium.
Explain hyperconjugation in one sentence and state how it makes an internal double bond more stable than a terminal one.
Apply Zaitsev. Dehydration (−H₂O) of 2‑butanol can give 1‑butene or 2‑butene. Based on this lab, which alkene should dominate as the major product, and which geometric isomer of it?
GFN2‑xTB is a fast, gas‑phase, approximate method. Why is it still trustworthy for ranking these isomers even if the absolute energy gaps are off by a few tenths of a kcal/mol?

Worked answers & discussion

Q1. They are isomers — identical atoms and electron count (C₄H₈) — so their total electronic energies live on the same absolute scale. The energy difference between them is the stability difference; nothing cancels out incompletely the way it would when comparing molecules of different formula.
Q2. Substitution (hyperconjugation + alkyl donation) makes the internal double bond of both 2‑butenes more stable than the terminal one in 1‑butene — that’s the big gap. Steric strain then separates the two 2‑butenes: cis crowds its two methyls, trans keeps them apart, so trans < cis.
Q3. K = e^−ΔG/RT = e^−1/0.593 = e^−1.69 ≈ 0.18, so cis:trans ≈ 0.18:1, i.e. trans:cis ≈ ~5.5:1. The ~1 kcal/mol gap already gives a clear majority of trans — small energy differences translate into sizeable population differences.
Q4. Hyperconjugation is the delocalisation of electron density from a filled adjacent σ(C–H) bond into the empty π* orbital of the double bond. An internal double bond is flanked by more C–H bonds (more alkyl substituents) than a terminal one, so it receives more hyperconjugative stabilisation and sits lower in energy.
Q5. By Zaitsev, the more‑substituted 2‑butene dominates over 1‑butene, and of the 2‑butenes the lower‑energy trans‑2‑butene is the major product — exactly the stability order measured in this lab.
Q6. The lab asks for a ranking, and systematic errors in a fast method tend to shift the isomers’ energies in the same direction, preserving their order. A few tenths of a kcal/mol of noise doesn’t flip the >1 kcal/mol gaps that set trans < cis < 1‑butene.
Take‑home. More‑substituted and less‑crowded wins: trans‑2‑butene is the most stable C₄H₈ isomer, putting a measured number on Zaitsev’s rule and the “trans beats cis” preference.

Lab 8Ring strain▶

Why is cyclopropane so reactive? Quantify the strain by comparing energy per CH₂.

Why it matters. Ring strain explains why cyclopropane behaves almost like an alkene and why six-membered rings are everywhere in nature — they are essentially strain-free. Because the rings are different sizes, you compare energy per CH₂ unit, a normalization trick worth having in your toolkit any time you want to compare molecules of different size on equal footing.

optimize (xTB)energy per CH₂ring strain

You’ll learn to: quantify ring strain using an energy-per-CH₂ normalisation, and connect stored strain to a ring’s reactivity.

Background 1 — angle strain and the tetrahedral ideal

An sp³ carbon “wants” bond angles of 109.5°. In a ring the internal angle is fixed by geometry: a flat triangle forces ~60°, a square ~90°, a pentagon ~108°, a hexagon ~120°. Squeezing the C–C–C angle far from 109.5° forces the bonding orbitals to overlap poorly — the bonds bend outward (“banana bonds”) and the stored cost is angle strain. Cyclopropane (~60°) and cyclobutane (~90°) are badly strained; cyclopentane and especially cyclohexane sit close to the ideal and pucker to relieve what little strain remains.

Background 2 — why normalise per CH₂

The rings differ in size, so their total energies aren’t comparable — a bigger ring is lower in energy simply because it has more atoms. The fix is to divide each molecule’s energy by its number of CH₂ units, giving energy per CH₂. A strain‑free CH₂ in a long alkane sets the baseline; any excess per CH₂ is the ring strain. Textbook total ring strains are roughly ~27 (C3), ~26 (C4), ~6 (C5) and ~0 kcal/mol (C6) — so per‑CH₂ strain falls steeply from the small rings to a near‑zero hexagon.

Background 3 — torsional strain and the chair

Angle strain isn’t the whole story. Flat small rings also suffer torsional (eclipsing) strain because their C–H bonds are forced near‑eclipsed; this is a large part of cyclopropane’s and cyclobutane’s cost. Cyclohexane escapes both by folding into the chair conformation, where every angle is ~111° and all bonds are perfectly staggered — essentially zero strain. That stored strain in the small rings is exactly why cyclopropane reacts almost like an alkene, opening readily. GFN2‑xTB is fast and approximate (gas‑phase), but it reproduces this per‑CH₂ ordering cleanly.

Molecules

cyclopropane cyclobutane cyclopentane cyclohexane

Procedure

Optimize & compute each cycloalkane (CₙH₂ₙ) and record its total energy.
Divide each energy by the number of carbons n to get a per-CH₂ value the rings can be compared on.
Rank the rings by per-CH₂ energy and identify which store the most strain.

	C₃ (n=3)	C₄ (n=4)	C₅ (n=5)	C₆ (n=6)
energy (Eh)
energy per CH₂ (Eh)

Expected finding

Energy per CH₂ is highest (least stable) for cyclopropane and cyclobutane — their bond angles are forced far from 109.5°, so they store angle strain. Cyclohexane is the most stable per CH₂ (the chair has ideal angles, ~0 strain). That stored strain is exactly why small rings open up so easily.

Questions

Why do you have to compare energy per CH₂ rather than the total energy of each ring? What would go wrong if you compared totals directly?
Name the two kinds of strain at work in cyclopropane and cyclobutane, and say which one a planar geometry makes unavoidable.
Do the arithmetic. Using total ring strains of ~27 (cyclopropane) and ~6 kcal/mol (cyclopentane), compute the strain per CH₂ for each (3 and 5 carbons). By what factor is cyclopropane more strained per CH₂?
Cyclohexane comes out essentially strain‑free. What conformation lets it achieve ~109.5° angles and fully staggered bonds at once?
Apply it. Stored strain is released when a ring opens. Predict which of cyclopropane or cyclohexane reacts more readily in a ring‑opening reaction, and explain why cyclopropane behaves “almost like an alkene.”
The C–C–C internal angle is forced to ~60° in cyclopropane. Why does forcing the angle so far below 109.5° cost energy at the level of orbital overlap?

Worked answers & discussion

Q1. Bigger rings have more atoms and therefore lower total energy automatically — comparing totals would just rank rings by size, not by strain. Dividing by the number of CH₂ units puts every ring on a per‑unit basis, so the excess above a strain‑free CH₂ isolates the ring strain itself.
Q2. Angle strain (C–C–C angles forced far from 109.5°) and torsional strain (eclipsed C–H bonds). A flat ring makes the torsional/eclipsing strain unavoidable, since the hydrogens are locked nearly eclipsed in the plane.
Q3. Cyclopropane: 27 ÷ 3 = 9 kcal/mol per CH₂. Cyclopentane: 6 ÷ 5 = 1.2 kcal/mol per CH₂. Cyclopropane is about 7.5× more strained per CH₂ — a huge difference, which is why it is the standout reactive ring.
Q4. The chair conformation: it puckers so every C–C–C angle is ~111° (essentially tetrahedral) and every adjacent C–H pair is perfectly staggered, killing both angle and torsional strain — hence ~0 strain.
Q5. Cyclopropane reacts far more readily. Opening its ring releases ~27 kcal/mol of stored strain, providing a strong thermodynamic push, and its bent “banana” bonds have high p character and poor overlap, making them weak and electron‑rich — so cyclopropane undergoes addition‑like ring‑opening much as an alkene’s π bond does. Cyclohexane, being strain‑free, has no such driving force.
Q6. sp³ hybrid orbitals point at ~109.5°. Forcing the C–C–C angle to ~60° means the orbitals can’t point directly at each other; they overlap off‑axis (bent “banana” bonds) with reduced overlap, giving weaker, higher‑energy bonds — that energy penalty is the angle strain.
Take‑home. Normalising to energy per CH₂ exposes ring strain that totals hide: it plummets from ~9 kcal/mol per CH₂ in cyclopropane to nearly zero in the strain‑free cyclohexane chair, which is precisely why small rings open so easily.

Lab 9Keto vs enol▶

A molecule can flip between two tautomers — which one wins, and can an internal H-bond tip the balance?

Why it matters. Tautomers are isomers that interconvert by shifting a single proton, and the keto/enol equilibrium sits underneath enolate chemistry, the behaviour of sugars, and even how DNA bases pair. This lab shows that a subtle stabilizing effect can flip the "obvious" answer — and teaches the honest lesson that fast methods don't always nail close energy contests.

optimize (xTB)tautomersrelative energy

You’ll learn to: compare tautomer energies, explain how conjugation and an internal H-bond stabilise an enol, and judge when a fast method is too close to call.

Background 1 — what tautomers are

Tautomers are constitutional isomers that interconvert by relocating a single proton together with a double bond. In the keto–enol pair, the keto form has a C=O and an α‑C–H; shifting that α hydrogen onto the oxygen converts it to the enol (C=C–O–H). Because only a proton moves, the two are a genuine equilibrium, not different molecules in the usual sense. For an ordinary ketone such as acetone the keto form is far more stable — the enol lies many kcal/mol higher, so acetone is essentially 100% keto.

Background 2 — why keto normally wins

The keto preference comes from bond energies: the keto form keeps a strong C=O π bond and a C–H bond, whereas the enol trades these for a weaker C=C π bond and an O–H bond. Summing typical bond strengths leaves the keto tautomer lower by roughly 10–15 kcal/mol for a simple ketone — an equilibrium so lopsided that the enol is a vanishing trace. This is the “obvious” answer, and it’s why most carbonyl compounds are drawn and behave as the keto form.

Background 3 — why acetylacetone breaks the rule (& a method caveat)

Acetylacetone (pentane‑2,4‑dione) is a 1,3‑diketone, and its enol is special. The enol places a C=C between two carbonyls, giving an extended conjugated π system, and it forms a six‑membered ring closed by a strong intramolecular O–H···O hydrogen bond. Together these nearly cancel the usual keto advantage, so the two tautomers come out only a few kcal/mol apart — and experimentally the enol is the major gas‑phase form. Caveat: when two states are nearly degenerate, a fast method like GFN2‑xTB can pick the wrong winner (it slightly favours keto here); the robust lesson is the unusual near‑degeneracy, not the sign of a sub‑kcal gap.

Molecules

acetylacetone (keto) acetylacetone (enol) acetone (keto only)

Procedure

Optimize the keto and the enol form of acetylacetone (they share the formula C₅H₈O₂) and record both total energies.
Convert the difference to kcal/mol (×627.5) to see how close the two tautomers really are.
Examine the enol's 3D shape and look for the internal O–H⋯O hydrogen bond that stabilises it.

	keto	enol
energy (Eh)
more stable?

Expected finding

For a normal ketone the keto form wins by a mile (the enol is many kcal/mol higher). Acetylacetone is special: its enol is stabilised by conjugation plus a strong intramolecular O–H⋯O hydrogen bond, so the two tautomers come out only a few kcal/mol apart — and experimentally the enol is actually the major gas-phase form. Note: which one the fast GFN2-xTB method places lower is sensitive to the method (it slightly favours keto here); the real lesson is the unusual near-degeneracy — special stabilisation has nearly erased the normal keto preference.

Questions

Define a tautomer in one sentence, and identify what moves and what changes when acetone’s keto form is converted to its enol.
Using bond energies, explain why the keto form of an ordinary ketone like acetone is more stable than its enol. Which bonds does the keto form keep that the enol gives up?
The special case. Name the two stabilising effects that make acetylacetone’s enol competitive with its keto form, and explain why neither is available to acetone.
Estimate a population. If acetone’s enol is ~12 kcal/mol above keto, use K = e^−ΔG/RT (RT ≈ 0.593 kcal/mol at 298 K) to estimate the fraction of molecules in the enol form. Is the enol detectable?
Your GFN2‑xTB run places acetylacetone’s keto form slightly below the enol, yet experiment says the enol dominates in the gas phase. What is the honest interpretation — is the lab “wrong,” and what should you actually take away?
Apply it. Keto–enol tautomerism underlies enolate chemistry and even DNA base pairing. Give one reason why a rare tautomer of a DNA base could matter biologically.

Worked answers & discussion

Q1. Tautomers are isomers that interconvert by relocating a single proton (with an accompanying double‑bond shift). Converting acetone keto → enol moves an α C–H hydrogen onto the carbonyl oxygen, turning C=O + C–H into C=C + O–H.
Q2. The keto form keeps a strong C=O π bond and a C–H bond; the enol trades these for a weaker C=C π bond and an O–H bond. The C=O / C–H combination is worth more, so keto sits ~10–15 kcal/mol lower for a simple ketone.
Q3. (1) Conjugation — the enol’s C=C lies between the two carbonyls of the 1,3‑diketone, giving an extended delocalised π system; and (2) a strong intramolecular O–H···O hydrogen bond closing a six‑membered ring. Acetone has only one carbonyl, so it can offer neither conjugation between two C=O groups nor an internal H‑bond.
Q4. K = e^−12/0.593 = e^−20.2 ≈ 1.7×10⁻⁹. So well under one molecule in 10⁸ is enol — a trace far too small to matter, confirming acetone is effectively all keto.
Q5. The lab isn’t “wrong” in its real lesson. When two states are within a couple of kcal/mol, a fast semi‑empirical method can’t reliably resolve which is lower — the sub‑kcal sign is below its accuracy. The take‑away is the near‑degeneracy itself: special stabilisation (conjugation + H‑bond) has almost erased the normal ~12 kcal/mol keto preference, bringing the tautomers to within a few kcal/mol — that’s the chemistry, and it’s an honest reminder that fast methods can miss close contests.
Q6. A rare enol/imino tautomer of a DNA base changes its hydrogen‑bonding pattern, so it can mispair (e.g. an unusual tautomer pairing with the wrong partner). If that happens during replication it can cause a point mutation — tautomeric shifts are one proposed source of spontaneous mutations.
Take‑home. Keto normally wins by a landslide, but acetylacetone’s conjugation‑plus‑intramolecular‑H‑bond stabilisation drags its enol to near‑degeneracy — a contest so close that a fast method can pick the wrong side, which is itself the lesson.

Lab 10Hot or cold? Reaction energetics▶

Will a reaction release or absorb heat, will it happen on its own — and can temperature flip the answer?

Why it matters. Three numbers decide a reaction's fate: ΔH (heat released or absorbed), ΔS (the change in disorder), and the one that actually rules spontaneity, ΔG = ΔH − TΔS. Their interplay explains why some reactions need heating, why others reverse when hot, and why an "uphill" reaction can still be driven by entropy. The equilibrium constant K = e^−ΔG/RT turns that energy into how far the reaction actually goes.

thermodynamics calculatorΔH · ΔS · ΔGequilibrium Kcrossover T

Tool: Reaction thermodynamics (Studio → Calculators) — use the example-reaction buttons

You’ll learn to: use ΔG = ΔH − TΔS to predict spontaneity, locate the crossover temperature, and connect ΔG to the equilibrium constant K.

Background 1 — the three numbers and ΔG = ΔH − TΔS

A reaction’s fate is set by three quantities. ΔH is the heat exchanged: negative means exothermic (heat released, bonds in products stronger), positive means endothermic. ΔS is the change in disorder: positive when the products are more spread out (more moles of gas, more freedom). Spontaneity is decided not by either alone but by the Gibbs free energy, ΔG = ΔH − TΔS. A reaction is spontaneous (forward‑favoured) only when ΔG < 0. Temperature enters only through the −TΔS term, which is why heat can help or hurt depending on the sign of ΔS.

Background 2 — reading the four sign combinations

The signs of ΔH and ΔS give four cases. ΔH<0, ΔS>0: spontaneous at all temperatures (both terms favour it). ΔH>0, ΔS<0: never spontaneous. The two interesting temperature‑dependent cases flip at the crossover temperature T = ΔH/ΔS (where ΔG = 0): an exothermic, entropy‑losing reaction (ΔH<0, ΔS<0, like Haber) is spontaneous only below T; an endothermic, entropy‑gaining reaction (ΔH>0, ΔS>0, like limestone decomposition) becomes spontaneous only above T. Reading the two signs together tells you instantly which case you’re in.

Background 3 — from ΔG to how far it goes (K), & a caveat

ΔG fixes the equilibrium constant through K = e^−ΔG/RT. Because K depends exponentially on ΔG, a strongly negative ΔG (like methane combustion, ΔG ≈ −818 kJ/mol) gives an astronomically large K (~10¹⁴³) — the reaction runs essentially to completion. A modestly negative ΔG gives an incomplete, balanced equilibrium; a positive ΔG gives K<1. Caveat: this calculator uses tabulated standard ΔH° and ΔS° (298 K) and assumes they don’t change with temperature — a good approximation for estimating crossover temperatures, but not an exact, fully temperature‑corrected treatment.

Procedure

Open the Reaction thermodynamics calculator and run methane combustion, the Haber process, and limestone decomposition (the preset buttons). For each, record ΔH, ΔS, ΔG and K at 298 K, and whether it is spontaneous.
For limestone (CaCO₃ → CaO + CO₂), read off the crossover temperature where ΔG = 0 — above it the reaction turns spontaneous. (This is the real temperature a lime kiln runs at.)
For the Haber process, raise the temperature to 800 K and watch ΔG and K change — explain why ammonia synthesis is run warm but not too hot.

	methane comb.	Haber	limestone
ΔH (kJ/mol)
ΔG₂₉₈ (kJ/mol)
spontaneous at 298 K?
crossover T (K)

Expected finding

Methane combustion: ΔH ≈ −890 kJ/mol (strongly exothermic), ΔG ≈ −818, spontaneous, K ≈ 10¹⁴³ (goes to completion). Haber (N₂+3H₂→2NH₃): exothermic but ΔS < 0 — spontaneous at 298 K (ΔG ≈ −33) but it turns non-spontaneous above ~460 K, so industry compromises (moderate heat for speed, high pressure to push K). Limestone: endothermic (ΔH ≈ +179) and non-spontaneous at room temperature, but ΔS > 0 means it turns spontaneous above ~1120 K (~850 °C) — exactly a lime-kiln temperature. The lesson: sign of ΔH and ΔS together, read through ΔG = ΔH − TΔS, predicts and tunes spontaneity.

Questions

Write out ΔG = ΔH − TΔS and explain why a reaction’s spontaneity is governed by ΔG rather than by ΔH alone.
For each of the four sign combinations of (ΔH, ΔS), state whether the reaction is spontaneous always, never, only at high T, or only at low T.
The Haber process (N₂ + 3H₂ → 2NH₃) is exothermic but has ΔS < 0. Explain why it is spontaneous at 298 K yet turns non‑spontaneous above ~460 K, and why industry still runs it hot.
Find the crossover. Limestone decomposition has ΔH ≈ +179 kJ/mol and ΔS ≈ +160 J/(mol·K). Compute the crossover temperature T = ΔH/ΔS (watch the units) and compare it to a lime‑kiln temperature.
Methane combustion gives ΔG ≈ −818 kJ/mol and K ≈ 10¹⁴³. Using K = e^−ΔG/RT, explain qualitatively why such a modest‑looking ΔG produces such an enormous K, and what that means for how far the reaction goes.
Apply it. An endothermic reaction with ΔS > 0 won’t go at room temperature. What single, free knob can you turn to make it spontaneous, and which thermodynamic term does that knob act on?

Worked answers & discussion

Q1. ΔG = ΔH − TΔS. A reaction can release heat (ΔH<0) yet still be forbidden if it orders the system enough (ΔS<0 with a large −TΔS penalty), or it can absorb heat yet proceed if it creates enough disorder. Only ΔG — which balances both — tells you the direction; spontaneous means ΔG<0.
Q2. (−,+): spontaneous at all temperatures. (+,−): never spontaneous. (−,−): spontaneous only at low T (below T=ΔH/ΔS). (+,+): spontaneous only at high T (above T=ΔH/ΔS).
Q3. Haber is ΔH<0, ΔS<0 (4 gas moles → 2). At 298 K the favourable −ΔH outweighs the −TΔS penalty, so ΔG≈−33 kJ/mol < 0. As T rises the −TΔS penalty grows until it overtakes ΔH at the crossover (~460 K), making ΔG>0. Industry still heats it because the room‑temperature rate is hopelessly slow; they accept a moderate temperature (a smaller K) for usable speed and then restore yield with high pressure, which by Le Chatelier pushes the 4→2‑mole equilibrium toward ammonia.
Q4. T = ΔH/ΔS = 179{,}000 J ÷ 160 J/(mol·K) ≈ 1119 K ≈ 850 °C. (Convert ΔH to joules to match ΔS.) That is exactly a lime‑kiln operating temperature — the endothermic, gas‑releasing decomposition only becomes spontaneous once you push above ~1120 K.
Q5. K = e^−ΔG/RT depends exponentially on ΔG. With ΔG = −818{,}000 J/mol and RT ≈ 2478 J/mol at 298 K, the exponent is ~+330, and e³³⁰ ≈ 10¹⁴³. Because ΔG sits in an exponent, even a “moderate” energy translates into an astronomically large K — the reaction goes essentially to completion.
Q6. Raise the temperature. For ΔS>0 the −TΔS term becomes more negative as T grows, eventually driving ΔG below zero (above T=ΔH/ΔS). Temperature acts through the entropy term — that’s why heating drives entropy‑favoured endothermic reactions.
Take‑home. Read the signs of ΔH and ΔS together through ΔG = ΔH − TΔS: they tell you whether a reaction goes, whether temperature flips it (at T = ΔH/ΔS), and — via K = e^−ΔG/RT — how far.

★ Advanced labs

Molecular orbitals, TD-DFT spectra, reactivity maps and torsional scans — real quantum chemistry. (Small molecules; these take a little longer to compute.)

Lab 11Conjugation, the gap & colour▶

Why do longer conjugated molecules absorb redder light? Tie the HOMO–LUMO gap to the UV-Vis spectrum.

Why it matters. Conjugation — a chain of alternating double bonds sharing their π electrons — shrinks the energy gap between the highest filled and lowest empty orbital, and that gap fixes the wavelength of light a molecule absorbs. It's the reason dyes and pigments are coloured, why carrots are orange, and how the chromophore in your retina detects light. This lab links an orbital number directly to a spectrum.

orbitals (gap)UV-Vis (TD-DFT)

You’ll learn to: link the HOMO–LUMO gap to absorption wavelength via ΔE = hc/λ, and explain why extending conjugation red-shifts a molecule’s colour.

Background 1 — conjugation spreads the π system

When double bonds alternate with single bonds, their p orbitals overlap continuously and the π electrons delocalise over the whole chain. Each new conjugated double bond adds another π and another π^* level, so a set of evenly spaced bonding and antibonding orbitals builds up. Crucially, the orbitals spread apart in energy: the highest HOMO rises and the lowest LUMO falls. Ethylene has one isolated π/π^* pair; 1,3-butadiene has two of each; benzene has a fully cyclic, aromatic six-electron π system. More conjugation always means a smaller frontier gap.

Background 2 — particle-in-a-box intuition

A clean way to see why is the particle-in-a-box model: treat the delocalised π electrons as confined to a box of length L (the conjugated chain). The allowed energies are E_n = n²h²/8mL², so levels scale as 1/L². Lengthen the box and every gap, including the HOMO→LUMO gap, shrinks. This toy model captures the headline trend without any heavy computation: a longer conjugated chain → a smaller ΔE → a lower-energy, longer-wavelength absorption. The same scaling underlies the dyes and the carotene in carrots.

Background 3 — from gap to colour (TD-DFT)

The absorbed photon energy obeys ΔE = hc/λ, so λ is inversely proportional to the gap. A smaller gap pushes absorption to longer λ (a red-shift); once it reaches the visible (≈400–700 nm), the molecule shows the complementary colour. The bare orbital-energy difference is only a rough estimate — it ignores electron–hole attraction and orbital relaxation — so MoleBench uses TD-DFT for a proper excited-state energy. Treat both as qualitative: the ordering ethylene > butadiene > benzene-region gaps is robust, but absolute nm values from a single functional carry real error.

Molecules

ethylene 1,3-butadiene benzene

Procedure

For each molecule, Compute orbitals (panel 4) and read the HOMO–LUMO gap in eV.
In panel 3, press UV-Vis (TD-DFT) and record the longest-wavelength (lowest-energy) absorption band.
Compare gap against wavelength across the series — as conjugation grows, which way does each move?

	ethylene	butadiene	benzene
HOMO–LUMO gap (eV)
longest λ (nm)

Expected finding

More conjugation → a smaller HOMO–LUMO gap → absorption shifts to longer wavelength (red-shift). Extend the idea: a long enough conjugated chain absorbs visible light — which is literally why carrots and dyes are coloured.

Questions

Rank ethylene, 1,3-butadiene and benzene by HOMO–LUMO gap, and explain in terms of how many π levels each builds and how far they spread.
Using ΔE = hc/λ, compute the wavelength absorbed by a molecule whose gap is 6.0 eV, and again for one whose gap is 2.5 eV. Which value lands in the visible, and what does that imply about colour? (Take hc ≈ 1240 eV·nm.)
The simple orbital-energy difference and the TD-DFT excitation energy disagree somewhat. Name two physical effects the bare gap leaves out that TD-DFT partly recovers.
In the particle-in-a-box picture, energy levels scale as 1/L². If you doubled the conjugated chain length, by roughly what factor would the HOMO–LUMO gap change, and which way would λ shift?
Predict & apply. β-carotene has eleven conjugated double bonds and looks orange. Using the trends from this lab, explain why it absorbs visible (blue) light while ethylene's absorption is far in the UV — and why we therefore see carotene as orange.
Benzene is cyclic and aromatic rather than a short open chain. Why is a simple linear box length only a loose proxy for its gap, and how does that caution your ranking in Q1?

Worked answers & discussion

Q1. Gap order is ethylene (largest) > 1,3-butadiene > benzene-region (smallest). Ethylene has a single π/π^* pair widely separated. Butadiene mixes two π orbitals into a bonding-pair whose HOMO is raised and two π^* whose LUMO is lowered, narrowing the gap. Adding conjugation keeps stacking levels and squeezing the frontier pair together — so more conjugation always means a smaller gap and a red-shift.
Q2. λ = hc/ΔE = 1240/ΔE(eV) nm. For 6.0 eV: λ ≈ 207 nm (deep UV). For 2.5 eV: λ ≈ 496 nm (visible, blue-green). Only the small-gap molecule absorbs visible light, so only it can be coloured; the wide-gap one is colourless because its absorption is buried in the UV.
Q3. The bare HOMO–LUMO difference omits (i) the electron–hole (excitonic) attraction that lowers the true excitation energy, and (ii) orbital relaxation — the orbitals readjust once an electron is promoted. TD-DFT recovers these by computing the actual excited state rather than subtracting two ground-state orbital energies.
Q4. Since E ∝ 1/L², doubling L drops every level (and the gap) by roughly a factor of 4. A four-fold smaller gap means λ ≈ 4× longer — a strong red-shift toward and into the visible.
Q5. β-carotene's eleven conjugated double bonds form a very long "box," so its gap is small enough to absorb visible blue light (≈450–480 nm); we see the complementary colour, orange. Ethylene's single, isolated π bond has a huge gap, so it absorbs only in the far UV and appears colourless. Same governing principle — longer conjugation, smaller gap, redder absorption — scaled up.
Q6. Benzene is a closed aromatic ring, not a linear chain, so its delocalisation pattern and degenerate frontier orbitals aren't captured by a simple 1D box length; the box model gives the right direction but not a reliable cyclic gap. So the "more conjugation, smaller gap" ordering holds qualitatively, but benzene's exact placement is best trusted from the TD-DFT result, not the box analogy.
Take‑home. Each added conjugated double bond shrinks the HOMO–LUMO gap, which by ΔE = hc/λ red-shifts absorption — the orbital-level origin of why long conjugated chains, from dyes to carotene, are coloured.

Lab 12Where does it react? (reactivity maps)▶

How do an electron-donating vs electron-withdrawing group change where a ring is attacked?

Why it matters. Electrophilic aromatic substitution — how benzene rings are functionalised to build dyes, drugs and polymers — is decided by where the ring is most electron-rich. Substituents either push electrons in (activating, directing to ortho/para) or pull them out (deactivating, meta). Property maps turn those arrow-pushing rules you memorised into something you can actually see and rank.

ESP mapLUMO mapionization mapreactivity indices

You’ll learn to: read ESP and LUMO maps to predict where an aromatic ring is attacked, and classify substituents as activating (o/p) or deactivating (m).

Background 1 — electrophilic aromatic substitution follows electron density

In electrophilic aromatic substitution (EAS) an electron-poor electrophile attacks the ring's π cloud, so the reaction goes fastest where the ring is most electron-rich. A substituent that donates electron density — phenol's –OH, whose oxygen lone pair conjugates into the ring — is activating and steers the electrophile to the ortho/para positions, where the donated density piles up. A substituent that withdraws density — nitrobenzene's –NO₂, which pulls π density out by resonance and induction — is deactivating and meta-directing, because ortho/para attack would place positive charge next to the electron-hungry group.

Background 2 — reading the electrostatic-potential and frontier maps

An ESP map paints the molecular surface by electrostatic potential: red = electron-rich (negative, nucleophilic), blue = electron-poor. Phenol's ring face glows redder than benzene's — especially at ortho/para — while nitrobenzene's ring is drained toward blue. Because the electrophile is captured by the ring's filled orbitals, the HOMO and the local ionization energy map also flag the reactive sites: a lower ionization energy / higher HOMO amplitude marks where electrons are most available to share. These maps make the arrow-pushing "activating vs deactivating" rule something you can actually see.

Background 3 — putting a number on it: the electrophilicity index ω

Conceptual DFT condenses reactivity into indices from the frontier energies. The chemical potential μ ≈ ½(E_HOMO+E_LUMO) and hardness η ≈ (E_LUMO−E_HOMO) combine into the electrophilicity index ω = μ²/2η — how strongly a molecule wants electrons. Electron-withdrawing –NO₂ raises nitrobenzene's ω (a better electrophile, poorer at EAS), while electron-donating –OH lowers phenol's ω. These indices are derived from approximate orbital energies, so trust the ranking, not the absolute numbers.

Molecules

benzene phenol nitrobenzene

Procedure

Compute orbitals for each arene, then view the ESP map and the ionization-energy map — red marks the electron-rich sites an electrophile will attack.
Run DFT to get the conceptual-DFT reactivity indices, and note the electrophilicity ω and hardness η.
Decide which ring is activated and which is deactivated, and where each would react.

	benzene	phenol (–OH)	nitrobenzene (–NO₂)
most electron-rich site
electrophilicity ω (eV)

Expected finding

Phenol's –OH (electron-donating) makes the ring more electron-rich (ortho/para) — it reacts faster with electrophiles. Nitrobenzene's –NO₂ (electron-withdrawing) drains the ring (higher ω, deactivated). The maps make "activating vs deactivating" visible.

Questions

On the ESP maps, which ring face is reddest and which is bluest across benzene, phenol and nitrobenzene? Connect each colour to whether the substituent donates or withdraws electron density.
Why does –OH direct ortho/para while –NO₂ directs meta? Use resonance to explain where the donated (or withdrawn) density lands and why the electrophile follows.
Rank benzene, phenol and nitrobenzene by expected rate of EAS, and state which are activated and which deactivated relative to benzene.
The electrophilicity index is ω = μ²/2η, with μ ≈ ½(E_HOMO+E_LUMO) and η ≈ (E_LUMO−E_HOMO). Predict which molecule has the highest ω and explain what that says about its appetite for electrons versus its EAS reactivity.
EAS is governed by the ring's filled orbitals, yet a LUMO map is also offered. Why is the HOMO / ionization map the more relevant guide for predicting where an electrophile attacks?
Predict & apply. Aniline (–NH₂) and benzaldehyde (–CHO) aren't in the set. From the principles here, predict each one's directing effect and whether it activates or deactivates the ring.

Worked answers & discussion

Q1. Phenol's ring is the reddest (most electron-rich) because the –OH oxygen lone pair donates into the π system; benzene is intermediate; nitrobenzene's ring is the bluest (drained) because –NO₂ withdraws density — its own oxygens carry the red. Red tracks donation, blue tracks withdrawal.
Q2. Drawing the resonance structures for –OH donation places the extra electron density (and the negative formal charge) specifically at the ortho and para carbons, so the electrophile is captured there. For –NO₂, the withdrawing resonance puts positive character at ortho/para; attack there would set a cation next to the electron-poor nitro — destabilising — so the least-bad site is meta. The electrophile follows the electron density.
Q3. Rate: phenol > benzene > nitrobenzene. Phenol is strongly activated (reacts faster than benzene); nitrobenzene is strongly deactivated (much slower).
Q4. Nitrobenzene has the highest ω — –NO₂ lowers both frontier orbitals and makes the molecule hungriest for electrons. Note the irony: high ω means it is a good electron acceptor, which is exactly why it is a poor substrate for EAS (an electrophile-seeking-electrons reaction). Phenol has the lowest ω and the highest EAS reactivity.
Q5. An incoming electrophile bonds to electrons the ring gives up, so the relevant frontier orbital is the HOMO; the local ionization-energy map shows where electrons are most loosely held and thus most available. The LUMO map matters for the opposite case — nucleophilic attack — so it is the secondary guide here.
Q6. Aniline's –NH₂ donates a nitrogen lone pair even more strongly than –OH: strongly activating, ortho/para-directing. Benzaldehyde's –CHO withdraws by resonance (C=O): deactivating, meta-directing — like nitrobenzene but milder.
Take‑home. EAS is steered by ring electron density: donating groups (–OH, –NH₂) make the ring redder, activated and ortho/para-directing, while withdrawing groups (–NO₂, –CHO) drain it bluer, raise ω and deactivate it toward meta.

Lab 13Bonds & lone pairs▶

Where are the bonding electrons and lone pairs really located? Count them for the first-row hydrides.

Why it matters. The molecular orbitals that quantum chemistry computes are spread over the whole molecule and don't look anything like the neat bonds and lone pairs you draw. Localized orbitals mathematically recover that familiar Lewis picture — they are the bridge between rigorous electronic structure and the dot diagrams of intro chemistry, and they show those drawings are more than a convenient fiction.

localized orbitalsorbital diagram

You’ll learn to: connect the delocalised orbitals quantum chemistry computes to the familiar Lewis bonds and lone pairs, and count them for the first-row hydrides.

Background 1 — canonical orbitals don't look like Lewis bonds

The molecular orbitals a quantum calculation hands you — the canonical orbitals, eigenfunctions of the Fock operator — are spread symmetrically over the whole molecule. Water's four occupied valence orbitals, for instance, are delocalised combinations with the molecule's full symmetry; none of them is a single "O–H bond" or a single "lone pair." This is correct physics, but it clashes with the intro-chemistry picture of two localized O–H bonds and two distinct lone pairs. The total electron density is identical either way — the question is only which orthogonal basis of orbitals you choose to describe it.

Background 2 — localization recovers the Lewis picture

Because any unitary mixing of the occupied orbitals leaves the total density and energy unchanged, you are free to rotate them into the most spatially compact, separated set. Localization schemes (Boys, Pipek–Mezey, Edmiston–Ruedenberg) do exactly this — Boys minimises the orbitals' spatial spread, Pipek–Mezey maximises atomic charge localization. The result is the familiar inventory: two-centre bonding orbitals sitting between bonded atoms and one-centre lone pairs on a single atom, plus an essentially untouched 1s core. Localized orbitals are the rigorous bridge from delocalised MOs to the dot diagrams of general chemistry.

Background 3 — counting pairs across the first-row hydrides

The central atom contributes its valence electrons to four electron pairs total — the octet. Carbon in methane has four bonding pairs and zero lone pairs; nitrogen in ammonia, three bonds and one lone pair; oxygen in water, two bonds and two lone pairs. That is precisely the VSEPR "four pairs around the central atom" that explains their shapes (tetrahedral → pyramidal → bent). Localization reproduces this exactly — though note real lone pairs are often two equivalent rabbit-ear orbitals rather than the textbook one-s/one-p split, and the count, not the precise shape, is the robust result.

Molecules

methane ammonia water

Procedure

Compute orbitals for each hydride, then click Localized (bonds) to transform them into bond and lone-pair orbitals.
Step through the localized orbitals and classify each as an X–H bond (spread over two atoms) or a lone pair (sitting on one atom).
Count the bonds and lone pairs and check they match the Lewis structure and VSEPR.

	methane	ammonia	water
# X–H bonds
# lone pairs

Expected finding

Methane = 4 bonds, 0 lone pairs; ammonia = 3 bonds, 1 lone pair; water = 2 bonds, 2 lone pairs (plus a 1s core each). The localized orbitals reproduce the VSEPR "4 electron pairs around the central atom" picture exactly.

Questions

For methane, ammonia and water, state the number of bonding pairs and lone pairs the localized orbitals reveal on the central atom, and confirm each totals four valence pairs.
The canonical and localized orbitals describe the same molecule, yet look completely different. What is conserved when you switch from one set to the other, and why does that make the localization legitimate?
Methane, ammonia and water all have four electron pairs around the central atom but different shapes (tetrahedral, pyramidal, bent). Explain how the lone-pair count from this lab accounts for the shape change, linking back to VSEPR.
Each central atom also has a low-lying, nearly spherical orbital that doesn't participate in bonding. Identify it and explain why localization barely touches it.
The textbook draws water's two lone pairs as one s-rich and one p-type orbital, but a Boys localization often gives two equivalent "rabbit-ear" lone pairs. Which feature — the precise lone-pair shapes or the count of two — is the robust, method-independent result, and why?
Predict & apply. Using the same counting, predict the bonds and lone pairs a localization would give for hydrogen fluoride (HF) and for the hydronium ion (H₃O⁺), and the shape each implies.

Worked answers & discussion

Q1. Methane: 4 bonding pairs, 0 lone pairs. Ammonia: 3 bonding pairs, 1 lone pair. Water: 2 bonding pairs, 2 lone pairs. Each central atom carries exactly four valence pairs — a filled octet — plus a separate 1s core.
Q2. What is conserved is the total electron density (and the total energy): localized orbitals are just a unitary rotation of the same occupied space, so they integrate to identical observables. Because any orthonormal mixing of filled orbitals is physically equivalent, choosing the most compact, separated set is mathematically legitimate — it changes the picture, not the physics.
Q3. Replacing a bonding pair with a lone pair removes one ligand and lets the remaining bonds bend: 4 bonds → tetrahedral methane; 3 bonds + 1 LP → pyramidal ammonia; 2 bonds + 2 LP → bent water. The lone-pair count is exactly the VSEPR input, and lone-pair repulsion further compresses the angles (109.5° → ~107° → ~104.5°).
Q4. It is the 1s core orbital — a tight, spherical shell of inner electrons already localized on the single central atom. Since it is essentially atomic and not shared, localization has almost nothing to rotate it with, so it comes through unchanged.
Q5. The robust, method-independent result is the count of two lone pairs (and the total electron density). The precise shapes — equivalent rabbit-ears versus an s/p split — depend on the localization criterion (Boys vs the canonical view) and are interconvertible, so they are a matter of representation, not physics.
Q6. HF: 1 bonding pair + 3 lone pairs on F (linear, trivially). H₃O⁺: 3 bonding pairs + 1 lone pair on O — isoelectronic with ammonia, so pyramidal. Both keep four valence pairs around the central atom.
Take‑home. Localized orbitals rotate the delocalised MOs — at constant total density — into exactly the bonds and lone pairs of the Lewis/VSEPR picture: 4/0, 3/1 and 2/2 for methane, ammonia and water.

Lab 14Torsional barriers▶

How big is the energy barrier to rotating a single bond, and what does the curve look like?

Why it matters. The energy cost of twisting around a single bond — the torsional barrier — sets which conformers a molecule adopts, contributes to how proteins fold, and even feeds into reaction rates. A relaxed dihedral scan is exactly the method computational chemists use to map these barriers, and the same three molecules give three strikingly different rotational profiles.

dihedral scanchoose bondCSV download

You’ll learn to: map a single-bond rotational profile with a relaxed dihedral scan, and interpret its barrier heights and minima.

Background 1 — the torsional barrier and the relaxed scan

Rotating about a single bond traces a periodic energy curve, the torsional (rotational) potential, whose maxima are barriers between conformers. MoleBench maps it with a relaxed dihedral scan: the chosen dihedral ω is held fixed at each step while every other coordinate is re-optimized. This is essential — a rigid scan that froze the rest of the molecule would inflate barriers by forcing bad bond lengths and angles at the eclipsed points. The constrained minimisation lets the molecule relieve what strain it can, so the curve reflects the true rotation cost. The numbers come from GFN2-xTB, fast and qualitatively reliable for these small barriers.

Background 2 — ethane: hyperconjugation, not just sterics

Ethane gives the textbook case: a clean threefold (period 120°) curve with three equivalent staggered minima and three eclipsed maxima, a barrier of about 3 kcal·mol⁻¹. The minima aren't mainly about the tiny H···H steric clash; the dominant stabilisation is hyperconjugation — in the staggered geometry each C–H bonding orbital aligns to donate into the antiperiplanar C–H σ^* antibond, which it cannot do when eclipsed. The barrier is the cost of switching off that favourable σ→σ^* overlap.

Background 3 — butane and H₂O₂: richer curves

Butane's central C–C scan is no longer threefold-symmetric: the bulky methyls make a deep global anti minimum (180°), two shallower gauche minima (±60°) a bit higher, and an extra-tall syn-eclipsed barrier (~4–6 kcal·mol⁻¹) where the two methyls eclipse. Hydrogen peroxide breaks the pattern entirely: its minimum is non-planar (a skew O–O–H dihedral near 110–120°), a compromise between lone-pair–lone-pair repulsion (avoided at 0°) and favourable orbital interactions (lost at 180°). One tool, three strikingly different profiles — export the CSV to plot them.

Molecules

ethane butane hydrogen peroxide

Procedure

Build and Optimize each molecule, then in panel 5 choose the bond to rotate (it highlights gold).
Scan it 0 → 360°; read the barrier height off the energy curve and count how many minima appear.
Click points on the curve to inspect the eclipsed and staggered geometries, and download the CSV.

	ethane (H–C–C–H)	butane (C–C–C–C)	H₂O₂ (H–O–O–H)
barrier (kcal/mol)
# minima

Expected finding

Ethane: a small ~3 kcal/mol threefold barrier (staggered vs eclipsed). Butane: a richer curve with anti + gauche minima and a ~4–5 kcal/mol eclipsed barrier. H₂O₂ has a non-planar minimum. Same tool, three very different rotational profiles.

Questions

Sketch the expected shape of ethane's scan: how many minima and maxima appear over a full 360°, what is its periodicity, and which geometries (staggered/eclipsed) correspond to each?
The standard ethane barrier is about 3 kcal·mol⁻¹. Explain why hyperconjugation — not steric H···H repulsion alone — is the leading reason the staggered form is lower.
Compare butane's central-bond curve to ethane's: identify the anti and gauche minima and the syn-eclipsed barrier, and explain why butane's curve loses ethane's threefold symmetry.
Hydrogen peroxide's minimum is neither planar-cis (0°) nor planar-trans (180°) but a skewed angle in between. Explain the two competing effects that push it off both planar geometries.
Why does MoleBench run a relaxed rather than a rigid scan, and how would the reported barriers differ if the rest of the molecule were frozen at each dihedral step?
Predict & apply. Using the staggered/anti minimum and ethane's ~3 kcal·mol⁻¹ barrier, estimate roughly how readily ethane interconverts between staggered forms at room temperature (thermal energy RT ≈ 0.6 kcal·mol⁻¹). Is the rotation effectively free?

Worked answers & discussion

Q1. Over 360° ethane shows three minima and three maxima with a 120° periodicity (threefold symmetry). The three equal minima are the staggered conformers; the three equal maxima are the eclipsed ones, ~3 kcal·mol⁻¹ higher.
Q2. The H···H atoms are small and far apart, so direct steric repulsion is minor. The dominant effect is hyperconjugation: only in the staggered geometry can each filled C–H σ orbital align antiperiplanar to a C–H σ^* on the other carbon and donate into it, a stabilising σ→σ^* interaction. Eclipsing kills that overlap, so the ~3 kcal·mol⁻¹ barrier is mostly the lost hyperconjugation.
Q3. Butane has a deep anti minimum at 180°, two shallower gauche minima at ±60° (raised by methyl–methyl strain), and a tall syn-eclipsed barrier (~4–6 kcal·mol⁻¹) at 0° where the methyls eclipse. The bulky, inequivalent substituents break ethane's threefold symmetry — the three minima and three maxima are no longer equal.
Q4. At the planar-cis (0°) geometry the two oxygen lone pairs eclipse and repel, destabilising it; at planar-trans (180°) a favourable lone-pair / σ^* orbital interaction is lost. The molecule compromises at a skewed dihedral (~110–120°) that minimises lone-pair repulsion while preserving useful overlap — hence a non-planar minimum.
Q5. A relaxed scan re-optimizes all other coordinates at each fixed dihedral, so bonds and angles relieve strain and the curve reflects the genuine rotation cost. A rigid scan would freeze poor geometries at the eclipsed points and overestimate the barriers.
Q6. With a barrier of ~3 kcal·mol⁻¹ and RT ≈ 0.6 kcal·mol⁻¹, the barrier is only ~5× RT — low enough that thermal collisions clear it billions of times per second. Rotation is nearly free at room temperature: the molecule samples all three staggered wells rapidly, spending little time eclipsed.
Take‑home. A relaxed dihedral scan turns single-bond rotation into an energy curve — ethane's simple threefold ~3 kcal·mol⁻¹ barrier (hyperconjugation), butane's anti/gauche/syn structure, and H₂O₂'s skewed non-planar minimum — three molecules, three distinct rotational profiles.

Lab 15Substituent effects on the carbonyl▶

Pull four properties together: how do substituents change the C=O bond, its π* orbital, its IR stretch and the dipole?

Why it matters. This is a capstone: it ties four independent observables — geometry, orbital energy, vibration and dipole — to a single structural change. Real computational chemistry is rarely one number; it's about cross-checking several properties until a consistent story emerges. Practising that here trains the most transferable skill in the whole subject.

DFTLUMO mapIRdipole

You’ll learn to: cross-check four independent observables — geometry, π^* energy, IR stretch and dipole — on one functional group to build a consistent story.

Background 1 — one bond, four observables

The carbonyl is the same C=O group in formaldehyde, acetone and formic acid, yet four independent properties — the C=O bond length, the energy of the π^* (LUMO), the IR stretch and the molecular dipole — all shift when you change the substituents. They are not four separate facts: each reads out the same underlying change in how electron density sits on the carbon and oxygen. A capstone like this trains you to cross-check several numbers until one consistent electronic story explains them all, rather than trusting any single value in isolation.

Background 2 — how substituents push and pull

Acetone’s two methyls are weak electron donors (hyperconjugation + induction): they feed density toward the carbonyl carbon, easing the partial positive charge, lengthening C=O slightly and lowering the stretch. Formic acid’s –OH does two opposing things — an inductive σ-withdrawal of density and a resonance donation of an oxygen lone pair into the π^*. The lone-pair conjugation dominates the geometry, giving carboxylic acids a recognisably different carbonyl from a plain ketone or aldehyde, and a distinctly different dipole direction.

Background 3 — the π^*, the dipole and method caveats

The carbonyl LUMO is the π^* orbital, lobed largest on carbon — that is why nucleophiles attack carbon. A LUMO map paints this empty orbital onto the surface; reading it is qualitative, not a measured energy. The dipole points from the δ+ carbon toward the δ− oxygen and grows with C=O polarity. Remember these come from DFT (geometry, orbitals, dipole) and a harmonic IR treatment: absolute frequencies run ~5–10% high without scaling, and orbital energies are model-dependent, so trust the trends across the three molecules, not the raw digits.

Molecules

formaldehyde acetone formic acid

Procedure

For each carbonyl, run DFT (records the dipole and HOMO–LUMO gap).
Compute orbitals → LUMO map to see the electrophilic carbon, and Compute IR to read the C=O stretch.
Enter all four numbers in the table and look for correlations — does a bigger dipole track with a particular orbital or IR shift?

	formaldehyde	acetone	formic acid
dipole (D)
HOMO–LUMO gap (eV)
C=O stretch (cm⁻¹)

Expected finding

All have a carbonyl, but the alkyl groups of acetone and the –OH of formic acid shift the dipole, the LUMO (π*) energy and the C=O stretch. This is the capstone lab — one functional group seen through four independent MoleBench tools at once.

Questions

Cross-check the story: for each of the four observables (C=O length, π^*/LUMO energy, IR stretch, dipole), state whether you expect acetone to sit higher or lower than formaldehyde, and explain why a single electronic cause links all four.
The carbonyl LUMO is the π^* orbital. Why is its largest lobe on carbon rather than oxygen, and how does that explain why nucleophiles add to the carbonyl carbon?
Formic acid’s –OH both withdraws density inductively and donates a lone pair by resonance into the π^*. Which effect wins for the C=O geometry, and how would you tell from the bond length and stretch alone?
Acetone’s two methyls are electron-donating relative to formaldehyde’s two H’s. Predict the direction of the C=O stretch shift and the dipole change, and tie both to the same density change at the carbon.
Your computed IR stretch comes out near 1900 cm⁻¹ for formaldehyde, but the textbook aldehyde band is ~1730 cm⁻¹. What is the methodological reason for the gap, and why is the trend still trustworthy?
Predict/apply: acetyl chloride (CH₃COCl) has a strongly electron-withdrawing Cl with little π-donation. Relative to acetone, predict its C=O stretch, π^* energy and dipole, and say which lab molecule it should most resemble.

Worked answers & discussion

Q1. Acetone’s electron-donating methyls relieve the carbonyl carbon’s δ+, weakening C=O. So relative to formaldehyde: C=O length is slightly longer, the force constant and IR stretch are lower, and the C=O is less polarised. The dipole comparison is subtler — both methyls add their own small bond moments — but the carbonyl contribution falls. The single cause is the amount of positive character on carbon: ease it, and length goes up while stiffness, stretch and C=O polarity go down together.
Q2. The π^* is the antibonding combination of the C and O p-orbitals; because oxygen is more electronegative, its p-orbital sits lower in energy and contributes more to the filled π, leaving the empty π^* weighted onto the higher-energy carbon. A nucleophile’s electrons flow into the biggest LUMO lobe, so attack happens at carbon — the LUMO map makes this electrophilic site visible.
Q3. For geometry, resonance donation wins: the –OH lone pair delocalises into the π^*, partially populating an antibonding orbital, which lengthens C=O and lowers its force constant more than pure induction would. You read it from the pair of numbers — if C=O is longer and the stretch is lower than a comparable aldehyde/ketone, an antibonding-occupying donation (resonance), not just σ-withdrawal, is responsible.
Q4. Donation toward carbon lowers the C=O force constant, so the stretch shifts down (acetone’s carbonyl band lies below formaldehyde’s). The carbonyl’s own contribution to the dipole shrinks because C=O is less polar. Both come from the identical change: more density at the carbon means a softer, less-polar C=O.
Q5. Two reasons stacked: the harmonic approximation ignores anharmonicity (real bonds are softer near dissociation), and the DFT force constant is systematically too stiff — together they inflate frequencies by ~5–10%, which is why a uniform scale factor is normally applied. Because the error is roughly constant across similar bonds, the relative ordering of the three carbonyls survives even though the absolute cm⁻¹ values are high.
Q6. Cl is strongly σ-withdrawing with weak π-donation, so it pulls density off the carbonyl carbon, increasing its δ+: C=O stiffens (higher stretch, ~1800 cm⁻¹ region), the π^*/LUMO drops in energy (more electrophilic carbon), and the C=O polarity/dipole rises. It is the opposite extreme from formic acid’s resonance-softened carbonyl and sits well above acetone.
Take‑home. Geometry, orbital energy, vibration and dipole are four windows on one quantity — how much positive character sits on the carbonyl carbon — and a substituent that changes that quantity moves all four in a single, predictable, mutually-consistent direction.

Lab 16Why is one acid stronger? (conjugate-base maps)▶

Acid strength is set by how stable the conjugate base is. Watch electron-withdrawing groups spread the negative charge.

Why it matters. The single most useful idea for predicting acidity is that it's not about the acid at all — it's about how stable the anion left behind is. Groups that pull electron density away and spread the negative charge stabilise that conjugate base and make the acid stronger. This one principle explains pKₐ trends across all of organic chemistry, and here you can see the charge delocalise.

charge −1ESP mappartial chargesconjugate base

You’ll learn to: explain acid strength through conjugate-base stability, and use ESP/charge maps to see an electron-withdrawing group delocalise the negative charge.

Background 1 — acidity lives in the anion

The most powerful shortcut for predicting acid strength is to stop looking at the acid and look instead at the conjugate base it leaves behind. A more stable anion means the proton leaves more readily, so the parent acid is stronger. All three molecules here are carboxylates, R–COO⁻; resonance already spreads the −1 charge equally over the two carboxylate oxygens. The question this lab visualises is what the R group does on top of that — whether it helps the molecule shoulder the remaining negative charge, and you watch that directly in the electrostatic-potential (ESP) picture.

Background 2 — the inductive effect, made visible

Fluorine and chlorine are highly electronegative, so each C–X bond is a dipole that pulls electron density through the σ framework toward itself — the inductive effect. In trifluoroacetate three C–F bonds tug density off the carboxylate and out onto the CF₃ end, draining negative charge away from the oxygens. Chloroacetate has one weaker puller; acetate’s methyl pulls nothing (it is a mild donor). The effect is purely electrostatic and falls off sharply with distance — this is through-bond polarisation, not π-conjugation onto the C–X bonds.

Background 3 — reading the ESP and partial-charge maps

An ESP map colours the molecular surface by the potential a point positive test charge would feel: deep red = most negative (electron-rich), fading to blue. A more delocalised anion shows its red spread thinner and over more of the surface; a poorly stabilised one shows an intense red puddle concentrated on the oxygens. Partial charges put a number on the same idea. Treat both as qualitative and method-dependent — atomic charges are not observables and depend on the partitioning scheme (Mulliken, Hirshfeld, …); compare the trend across the three, not absolute values.

Conjugate-base anions

acetate chloroacetate trifluoroacetate

Procedure

Build each conjugate-base anion — the charge box auto-fills −1 from the SMILES. Press Optimize & compute.
View the ESP map (orbitals → ESP) and judge how widely the negative (red) region is spread; record the most-negative partial charge.
Rank the anions by how delocalised the charge is, and predict the order of acid strength.

	acetate	chloroacetate	trifluoroacetate
most-negative charge
charge spread out?

Expected finding

Electron-withdrawing halogens pull negative charge off the carboxylate and spread it over the molecule (inductive effect) — the more F/Cl, the more delocalised and stable the anion, so the stronger the parent acid. Trifluoroacetic acid (pKₐ ≈ 0.2) ≫ chloroacetic (≈ 2.9) ≫ acetic (≈ 4.8). Acidity, explained by the conjugate base's ESP.

Questions

Read the picture: on the three ESP maps, where is the deepest red, and how does the spread of that red change from acetate to chloroacetate to trifluoroacetate? Connect what you see to which parent acid is strongest.
All three are carboxylates whose −1 charge is already shared over two oxygens by resonance. So what, specifically, do the halogens add that resonance alone does not — and why is that an inductive rather than a resonance effect?
From the partial charges, does the carboxylate oxygen carry more or less negative charge in trifluoroacetate than in acetate? Explain how draining charge off the oxygens stabilises the anion.
The inductive effect falls off with distance. Predict how the ESP and acidity of 3-fluoropropanoate (F two carbons further from the COO⁻) would compare with fluoroacetate, and why.
Quantitative: trifluoroacetic acid has pK_a ≈ 0.23 and acetic acid pK_a ≈ 4.76. Using ΔΔG = 1.36 · ΔpK_a (kcal/mol), find how much more the trifluoroacetate anion is stabilised, and convert the pK_a gap into a ratio of K_a.
Predict/apply: rank acetate, chloroacetate and trifluoroacetate by anion stability and by parent-acid strength, and state where you would place bromoacetate (Br less electronegative than Cl) in the ladder.

Worked answers & discussion

Q1. The deepest red sits on the carboxylate oxygens in every case, but its intensity and spread differ. Acetate shows an intense, concentrated red puddle on the two oxygens; chloroacetate’s red is paler and bleeds slightly toward the chlorine; trifluoroacetate shows the negative potential thinnest and smeared furthest over the CF₃ end. The more the red is drained and delocalised, the more stable the anion — so the strongest parent acid is trifluoroacetic, then chloroacetic, then acetic.
Q2. Resonance equalises the charge over the two oxygens but cannot move it off the carboxylate group as a whole. The halogens add through-σ withdrawal: each electronegative C–X dipole pulls density out of the framework, removing charge from the oxygens entirely. It is inductive, not resonance, because there is no π overlap delocalising the lone pairs onto the C–X bonds — the C–F/C–Cl σ bonds simply polarise.
Q3. The carboxylate oxygens carry less negative charge in trifluoroacetate — the three C–F bonds have siphoned density toward fluorine. Spreading a concentrated charge over more atoms lowers electrostatic self-repulsion and overall energy, so the anion is more stable and the proton leaves its parent acid more easily.
Q4. Because induction decays steeply with the number of intervening σ bonds, a fluorine two carbons farther away polarises the carboxylate far less. Its ESP would look much closer to plain acetate (deeper, more concentrated red on the oxygens), and 3-fluoropropanoic acid is correspondingly a much weaker acid than fluoroacetic acid.
Q5. ΔpK_a = 4.76 − 0.23 = 4.53, so ΔΔG = 1.36 × 4.53 ≈ 6.2 kcal/mol of extra stabilisation for trifluoroacetate. A 4.53-unit pK_a gap is 10^4.53, a factor of roughly 3.4 × 10⁴ (~30,000×) in K_a.
Q6. Anion stability and parent-acid strength run the same way: trifluoroacetate > chloroacetate > acetate (pK_a 0.23 < 2.87 < 4.76). Bromine is less electronegative than chlorine, so it withdraws slightly less; bromoacetate is a touch less stabilised than chloroacetate, placing bromoacetic acid just below chloroacetic in strength.
Take‑home. Acidity is the stability of the leftover anion made visible: the farther and thinner an ESP map spreads the negative charge off the carboxylate oxygens, the stronger the acid.

Lab 17Carbocation stability▶

Why does a tertiary carbocation form so much more easily than a methyl one? See the empty p-orbital and where the + charge sits.

Why it matters. Carbocation stability decides the outcome of SN1 and E1 reactions, Markovnikov addition, and a host of rearrangements. The famous ordering 3° > 2° > 1° > methyl comes from hyperconjugation and induction — neighbouring bonds donating electron density into the empty orbital — and in this lab you can literally watch that empty orbital spread out and the positive charge get shared.

charge +1LUMO mapESP maphyperconjugation

You’ll learn to: rank carbocation stability, and explain the 3°>2°>1°>methyl ordering with hyperconjugation and the empty p-orbital.

Background 1 — the empty p-orbital that needs help

A carbocation is an sp², trigonal-planar carbon with only six valence electrons and an empty p-orbital perpendicular to the plane. That electron-deficient orbital is the cation’s liability — and its lifeline. Anything that can feed electron density into it lowers the energy and stabilises the ion. Across this series, methyl (CH₃⁺) has nothing but C–H bonds pointing the wrong way; each step toward tert-butyl swaps an H on the cationic carbon for a methyl group that brings new neighbouring bonds aligned to donate. More donors, more stable cation.

Background 2 — hyperconjugation and induction

Hyperconjugation is the donation of a filled adjacent σ_C–H or σ_C–C bond into the empty p-orbital: the bonding electrons partly delocalise onto the cationic carbon, sharing the positive charge over the whole alkyl framework. A 3° cation has nine β C–H bonds plus three C–C bonds positioned to overlap; methyl has none. A smaller, purely electrostatic inductive donation from the more-polarisable alkyl groups adds to this. Together they give the famous ordering 3° > 2° > 1° > methyl, with each substitution worth a sizeable stabilisation.

Background 3 — seeing it in the LUMO and ESP maps

For a cation the LUMO is essentially that empty p-orbital, and the LUMO map shows where an incoming nucleophile would attack. In methyl it is a clean lobe on one carbon; in tert-butyl it visibly spreads onto the methyl groups, the fingerprint of hyperconjugative delocalisation. The ESP map tells the complementary story: the most-positive (blue) region is intense and pinned on methyl’s carbon, but smeared and softened over the periphery in tert-butyl. Both maps are qualitative, orbital-shape and partial-charge pictures — method-dependent, so read the trend down the series.

Carbocations

methyl ethyl (1°) isopropyl (2°) tert-butyl (3°)

Procedure

Build each cation (the charge auto-fills +1), press Optimize, then Compute orbitals.
View the LUMO map (the empty p-orbital) and the ESP map; note how much positive charge sits on the central carbon and whether the LUMO spreads onto neighbouring groups.
Rank the cations by stability and connect it to the number of adjacent C–H / C–C bonds.

	methyl	ethyl 1°	isopropyl 2°	tert-butyl 3°
+ charge on central C
LUMO spread over methyls?

Expected finding

Each neighbouring C–H/C–C bond donates electron density into the empty p-orbital (hyperconjugation), so going methyl → 1° → 2° → 3° the positive charge is shared out and the cation is more stable. The LUMO visibly spreads onto the methyl groups in tert-butyl. This is the 3° > 2° > 1° > methyl ordering behind SN1/E1 and Markovnikov.

Questions

Watch the orbital spread: compare the LUMO maps of methyl and tert-butyl cations. Where is the empty orbital localised in each, and how does that visual difference encode the 3° > 2° > 1° > methyl stability order?
Define hyperconjugation precisely. Why must the donating σ bond be on a carbon adjacent to the cationic centre and aligned with the empty p-orbital, rather than on the cationic carbon itself?
Count the number of β C–H (and C–C) bonds able to hyperconjugate in the ethyl, isopropyl and tert-butyl cations. How does the count track the stability ranking?
On the ESP maps, describe how the most-positive region changes from methyl to tert-butyl. Why does spreading the positive charge over more atoms lower the ion’s energy?
Apply to mechanism: SN1, E1 and Markovnikov addition all route through the most stable carbocation. Use the ordering to predict the major product when HBr adds to 2-methyl-2-butene, and explain via the intermediate.
Predict/apply: the benzyl cation (PhCH₂⁺) is a primary carbon yet remarkably stable. What stabilisation is available to it that none of these four aliphatic cations can use, and how would its LUMO map differ?

Worked answers & discussion

Q1. In the methyl cation the LUMO is a single clean p-lobe on the lone carbon — the charge has nowhere to go. In tert-butyl the LUMO visibly bleeds onto the three methyl groups, because their C–H bonds mix into the empty orbital. A more spread-out empty orbital means the positive charge is shared and the ion is lower in energy — exactly the visual signature of 3° > 2° > 1° > methyl.
Q2. Hyperconjugation is delocalisation of a filled adjacent σ_C–H/σ_C–C bond into the empty p-orbital. The donor must be on an adjacent (β) carbon and roughly parallel to the p-orbital so the two can overlap; a bond on the cationic carbon itself lies in the nodal plane of the empty p-orbital (or simply isn’t there, since that carbon is electron-poor), so it cannot donate.
Q3. Ethyl (1°) has 3 β C–H bonds; isopropyl (2°) has 6; tert-butyl (3°) has 9 (plus, in the substituted cases, C–C bonds that also donate). The count rises 3 → 6 → 9 in lockstep with the stability ranking — more aligned donor bonds, more delocalisation, more stable cation.
Q4. Methyl’s ESP shows an intense blue maximum pinned on its single carbon; toward tert-butyl that positive region softens and smears over the periphery. Spreading charge over more atoms cuts the concentrated electrostatic penalty of a localised + charge, lowering the total energy — the same delocalisation principle that stabilises anions, run in reverse.
Q5. HBr protonates the alkene to give the more stable carbocation. For 2-methyl-2-butene, protonation yields the 3° cation (at C2) rather than the 2° one, so Br⁻ adds there: the major product is 2-bromo-2-methylbutane — Markovnikov, dictated by the 3° > 2° preference.
Q6. The benzyl cation is stabilised by resonance: its empty p-orbital conjugates with the aromatic π system, delocalising the positive charge into the ring (onto the ortho/para carbons) — far stronger than hyperconjugation alone. Its LUMO map would show the empty orbital spread over the whole ring, not just the benzylic carbon, which no purely aliphatic cation here can do.
Take‑home. Carbocation stability is the empty p-orbital getting help: every adjacent aligned bond that donates into it shares out the positive charge, which is why 3° > 2° > 1° > methyl and why SN1/E1/Markovnikov outcomes follow that ladder.

Lab 18Resonance & delocalization▶

When you draw several resonance structures, are the "different" bonds actually identical? Measure them.

Why it matters. Resonance is one of the most misunderstood ideas in chemistry: the molecule does not flip back and forth between the structures you draw — it is a single, averaged structure all the time. Measuring bonds that your Lewis structures call "single" and "double" and finding them identical makes delocalization concrete, and explains the special stability of aromatic rings and oxoanions.

charge📏 measureπ orbitalsESP map

You’ll learn to: measure equalised bond lengths to demonstrate delocalisation, and dispel the misconception that resonance means the molecule “flips”.

Background 1 — resonance is one structure, not many

The single most misread idea in chemistry is that a molecule with several resonance structures flips between them. It does not. The Lewis structures you draw are incomplete bookkeeping; the real molecule is one fixed, averaged structure that exists all the time. Benzene never has alternating long and short bonds that swap; carbonate is not a fast equilibrium of one double and two single C–O bonds. The honest test is to measure the bonds your drawings label “single” and “double” — if delocalisation is real, they come out identical, an average of the limiting forms.

Background 2 — a delocalised π system and equalised bonds

Behind the equal bond lengths is a single delocalised π system: p-orbitals overlap continuously around the whole framework so the π electrons are shared evenly, not pinned in one location. Benzene’s six C–C bonds land at ~1.39 Å — squarely between a single (~1.54 Å) and a double (~1.33 Å) bond. Carbonate’s three C–O bonds are all equal, as are nitrate’s three N–O bonds. Each is a bond order of about 1⅓ (carbonate/nitrate) or 1½ (benzene), the exact average the resonance picture predicts — concrete proof that delocalisation, not flipping, is at work.

Background 3 — why delocalisation pays: stability and ESP

Spreading electrons over more atoms lowers their energy — this resonance (delocalisation) stabilisation is why benzene resists addition and why oxoanions like carbonate and nitrate are so stable and unreactive. The ESP map shows the complementary charge story: in carbonate the −2 and in nitrate the −1 charge is smeared evenly over all the oxygens, with no single oxygen redder than the rest — symmetry made visible. Bear in mind the geometry comes from DFT/GFN2-xTB and ESP from method-dependent partial charges, so weigh the equality of the bonds and the symmetry of the map, not the last decimal.

Molecules

benzene carbonate (2−) nitrate (1−)

Procedure

Build benzene, carbonate (charge auto-fills −2) and nitrate (−1), and Optimize each.
Use 📏 measure to compare the bonds that resonance structures draw as single vs double — are they actually equal?
Compute orbitals and view the delocalised π system and the ESP map to see the charge smeared evenly over the molecule.

	benzene C–C	carbonate C–O	nitrate N–O
bond 1 length (Å)
bond 2 length (Å)

Expected finding

All the "single" and "double" bonds come out equal — benzene's six C–C ≈ 1.39 Å (between single 1.54 and double 1.33), carbonate's three C–O identical, nitrate's three N–O identical. The real molecule is the average of the resonance structures: one delocalized π system spread evenly, with the charge smeared over all the oxygens (ESP). Resonance isn't "flipping" — it's delocalization.

Questions

Measure to settle it: after optimising, what do you find for benzene’s six C–C lengths, carbonate’s three C–O lengths and nitrate’s three N–O lengths? Explain how equal bonds disprove the idea that the molecule flips between resonance structures.
Benzene’s C–C bond is ~1.39 Å, between a single (~1.54 Å) and a double (~1.33 Å). What effective bond order does that imply, and how does it match the two main Kekulé structures?
What does the π-orbital view add that the bond-length measurement alone cannot? Describe how a delocalised π system over benzene (or over carbonate’s O–C–O) gives rise to the equal bonds you measured.
Read the ESP: on the carbonate and nitrate maps, is any one oxygen more negative than the others? Connect what you see to the equal bond lengths and to resonance stabilisation.
Carbonate carries −2 and nitrate −1, both over three oxygens. Estimate the average formal-charge-per-oxygen in each, and explain why spreading charge this way stabilises the anion (relate to the acid/conjugate-base reasoning of the carboxylate lab).
Predict/apply: the carboxylate ion (RCOO⁻) has two oxygens related by resonance but one alkyl-bearing carbon. Predict the relationship between its two C–O bond lengths, and contrast with a neutral ester where one C–O is clearly double and the other single.

Worked answers & discussion

Q1. Benzene’s six C–C bonds all come out equal at ~1.39 Å; carbonate’s three C–O bonds are identical; nitrate’s three N–O bonds are identical. If the molecule truly flipped between Lewis structures you would measure a mix of short (double) and long (single) bonds, or a time-average that still betrayed the underlying alternation. Finding every bond the same length means there is a single delocalised structure, not a jumping equilibrium.
Q2. A length midway between single and double implies a bond order of about 1½. That matches the two Kekulé structures averaging to one-and-a-half bonds per C–C edge: each bond is “double” in one structure and “single” in the other, so the real, delocalised bond is their average.
Q3. The π-orbital view shows the cause: a continuous ring (or O–C–O fan) of overlapping p-orbitals holding electrons shared evenly over every atom, rather than localised double bonds. Bond-length measurement shows the consequence (equal bonds); the delocalised π picture shows why — the same electron density sits on every equivalent bond, so they must be equal.
Q4. No oxygen is redder than the others — the ESP is symmetric, with the negative charge smeared evenly over all three oxygens in both carbonate and nitrate. That even charge distribution is the same fact as the equal bond lengths, and it is what makes these oxoanions strongly resonance-stabilised and unreactive.
Q5. Carbonate spreads −2 over three oxygens ≈ −⅔ (about −0.67) per oxygen; nitrate spreads −1 over three ≈ −⅓ (about −0.33) per oxygen. Diluting a large charge over several atoms cuts electrostatic self-repulsion and lowers energy — exactly the conjugate-base stabilisation logic: a more delocalised anion is a more stable anion.
Q6. In carboxylate the two C–O bonds are equal (an averaged bond order ~1½) because the −1 charge resonates equally over both oxygens — just like carbonate’s and nitrate’s equalised bonds. A neutral ester has no such symmetry: its C=O stays a genuine short double bond and the C–O–R a longer single bond, so the two differ markedly.
Take‑home. Equal bond lengths and a symmetric ESP prove resonance is delocalisation — one averaged structure with electrons and charge spread evenly — not a molecule flickering between the structures you draw.

Lab 19Catch a reaction in the act (transition states)▶

What does a reaction actually look like at the top of its energy hill — and how do different reaction types compare?

Why it matters. The transition state is the fleeting, highest-energy arrangement a reaction must pass through, and its height sets the reaction rate — the difference between a reaction that runs in seconds and one that never goes. You can't trap a transition state in a flask, but you can compute it, and confirm it the way professionals do: a true saddle point has exactly one imaginary vibrational frequency, and that vibration is the reaction itself.

reaction pathtransition stateactivation barriersaddle point + freqscrub the geometry

Five reactions: SN2 · E2 · Menshutkin · Diels-Alder · electrocyclic (Studio → panel 6, Reactions & transition states)

You’ll learn to: locate a transition state, verify it from its single imaginary frequency, and read off the activation barrier.

Background 1 — the saddle point and transition-state theory

On a reaction's potential-energy surface the reactants and products sit in valleys and are joined by a mountain pass. The transition state is the highest point along the lowest pass — a first-order saddle point: a maximum along the reaction coordinate but a minimum in every direction perpendicular to it. Transition-state theory says the rate depends exponentially on the height of that pass, the activation barrier ΔG^‡. A barrier a few kcal/mol higher slows a reaction by orders of magnitude, which is why locating and characterising the saddle point is the heart of this lab.

Background 2 — one imaginary frequency is the signature

How do you prove a stationary point really is a transition state and not just a minimum? Diagonalise the Hessian (the matrix of second derivatives of energy) to get the vibrational frequencies. A minimum has all real, positive frequencies; a genuine TS has exactly one imaginary frequency — a negative curvature whose normal mode is the bond-breaking/forming motion itself. Scrubbing along that mode literally animates the reaction: the S_N2 umbrella flip (≈328i cm⁻¹), the Diels–Alder bond formation (≈818i), the electrocyclic unzip (≈899i). Two or more imaginary frequencies means you found a higher-order saddle, not a real TS.

Background 3 — concerted mechanisms and method caveats

All three reactions here are concerted: bonds break and form in one step over a single barrier, with no intermediate. S_N2 inverts the carbon (Walden inversion, trigonal-bipyramidal-like TS); E2 demands an anti-periplanar H and leaving group so the developing π bond overlaps cleanly; the Diels–Alder forms two σ bonds at once (TS C···C ≈ 2.3 Å) and is strongly exothermic. Honest caveat: the fast GFN2-xTB scan gives an approximate TS quickly; the Psi4 refinement (HF/3‑21G or HF/6‑31G*) makes it a rigorous saddle point with the right shape and one imaginary mode — but these are learning-grade geometries, not benchmark barrier heights.

Procedure

Open panel 6, pick a reaction and press Run reaction (~30–90 s). Record the activation barrier and the reaction energy ΔE.
Press Jump to transition state ‡, then drag the slider back and forth to watch the bonds break and form — note exactly what's happening at the very top of the hill.
Press ✓ Refine TS (Psi4) — optionally choose HF/3-21G or HF/6-31G* — to rigorously optimize the saddle point and run a frequency check; confirm it has exactly one imaginary frequency, then Animate the reaction mode to see the atoms move along the reaction coordinate.
Now make your own: pick ✎ Custom in the reaction list, build a molecule, and click two atoms to mark a bond to form and/or break — try a ring-opening (break a C–C bond) or an intramolecular cyclization, then find and refine its TS just like the presets.

	SN2	Menshutkin	Diels-Alder	electrocyclic
barrier (kcal/mol)
ΔE (exo/endothermic?)
imaginary freq (after Refine)

Expected finding

Each reaction is one concerted step over a single barrier. SN2: the carbon is trigonal at the TS with the nucleophile and leaving group both half-bonded — scrub through and the three H's flip through the plane (Walden inversion). E2: at the TS the β-H, the new C=C and the departing leaving group are all forming/breaking together, and it only works anti-periplanar. Diels-Alder: both new C–C bonds form simultaneously and symmetrically (TS ≈ 2.3 Å), and it's hugely exothermic (two π → two σ). Refine TS confirms each is a genuine first-order saddle point — one imaginary frequency whose motion is exactly the bond-breaking/forming you scrubbed through (SN2 ≈ 328i, Diels-Alder ≈ 818i, electrocyclic ≈ 899i cm⁻¹). Also try the Menshutkin reaction (neutral molecules becoming ions — endothermic in the gas phase, the curve climbs all the way; it's solvent that makes it go) and the electrocyclic ring-opening (a clean single peak as cyclobutene unzips to butadiene). You can refine at HF/3‑21G (fast) or HF/6‑31G* (better). Note: the scan gives approximate transition states fast (GFN2‑xTB); the Psi4 refinement makes them rigorous saddle points — the right shapes and story for learning, not benchmark barrier heights.

Questions

What geometric and mathematical features distinguish a transition state from a stable minimum on the potential-energy surface?
The diagnostic test: why must a genuine transition state have exactly one imaginary vibrational frequency — and what does the motion of that single imaginary mode physically represent?
At the S_N2 saddle point the central carbon is trigonal (planar three H's) with the nucleophile and leaving group both half-bonded. Explain how scrubbing through the imaginary mode demonstrates Walden inversion.
Why does the E2 transition state require an anti-periplanar arrangement of the β-H and the leaving group, and what would happen to the barrier if they were forced syn?
The Menshutkin reaction (two neutral molecules forming ions) climbs uphill the whole way in the gas phase, yet runs readily in solution. From transition-state and solvation reasoning, why does solvent rescue it?
Quantitative: using the Eyring equation k = (k_BT/h)·e^−ΔG‡/RT, estimate roughly how much the rate changes at 298 K if a refined barrier is 2 kcal/mol higher than a crude estimate. (RT ≈ 0.593 kcal/mol at 298 K.)

Worked answers & discussion

Q1. A minimum is a valley — energy rises in every direction, and every Hessian eigenvalue (force constant) is positive. A transition state is a first-order saddle point: it is a maximum along the reaction coordinate but a minimum in all other directions. Mathematically the Hessian has exactly one negative eigenvalue; geometrically it sits at the lowest pass between reactant and product valleys.
Q2. A negative Hessian eigenvalue gives an imaginary frequency (frequency ∝ √(force constant); a negative force constant yields an imaginary root). One negative curvature = one imaginary frequency, and its normal mode points along the reaction coordinate — it is the bond-breaking/forming motion. Zero imaginary modes means you found a minimum; two or more means a higher-order saddle, not a real TS. Here S_N2 ≈ 328i, Diels–Alder ≈ 818i, electrocyclic ≈ 899i cm⁻¹.
Q3. At the S_N2 TS the carbon is sp²-like and planar, nucleophile and leaving group on opposite faces, both partially bonded. The single imaginary mode pushes the nucleophile in and the leaving group out; scrubbing it forward drives the three H's to bend through the trigonal plane to the far side — an umbrella flip. That is Walden inversion: the stereocentre turns inside-out, like an umbrella in the wind.
Q4. E2 forms the new π bond from the C–H σ and C–leaving-group σ orbitals; their p-lobes must be parallel for good overlap, which happens only when H and leaving group are anti-periplanar (dihedral ≈ 180°). A syn arrangement gives poor overlap and adds eclipsing strain, so the barrier rises sharply and the rate falls — anti-periplanar is strongly preferred.
Q5. The Menshutkin TS develops substantial charge separation (neutral reactants → ionic products). In the gas phase nothing stabilises that charge, so the curve climbs all the way and is endothermic. A polar solvent's reaction field stabilises the charged, product-like TS and products far more than the neutral reactants, pulling the barrier and the product energy down — solvent makes the reaction go.
Q6. A 2 kcal/mol higher barrier multiplies the rate by e^−2/0.593 = e^−3.37 ≈ 0.034 — roughly a 30-fold slowdown at 298 K. This is why "approximate vs refined" barriers matter: small energy errors translate into large rate errors, so we trust these calculations for mechanism and shape, not for benchmark rates.
Take‑home. A transition state is a single-imaginary-frequency saddle point, and that one imaginary mode is the reaction — animate it and you watch S_N2 inversion, anti-periplanar E2, or simultaneous Diels–Alder bond formation happen.

Lab 20Does the solvent matter?▶

Molecules almost never react in a vacuum — how does putting them in a solvent change their properties and their reactions?

Why it matters. Almost all real chemistry happens in solution, and a polar solvent is not a passive bystander: its molecules reorganise around a solute (the "reaction field"), stabilising charge and polarising the molecule. That single effect explains why dipoles grow in solution, why acids are stronger in water, and — famously — why a charged-reactant SN2 reaction is much slower in a polar solvent than in the gas phase. MoleBench models this with an implicit solvent (xtb ALPB / Psi4 PCM) — a continuous dielectric instead of explicit solvent molecules.

implicit solvationdipoleDFT (PCM)reaction barrier

You’ll learn to: predict how an implicit solvent changes a molecule’s dipole and a reaction’s barrier through the solvent reaction field.

Background 1 — the reaction field and continuum solvation

Place a polar solute in a polar liquid and the solvent molecules reorient so their dipoles oppose the solute's field. This polarised solvent generates a reaction field that points back at the solute, stabilising its charges and, in turn, pulling them further apart. Rather than track thousands of explicit solvent molecules, an implicit (continuum) solvation model replaces the solvent with a structureless dielectric of permittivity ε (water ≈ 78, acetonitrile ≈ 37). MoleBench uses xtb ALPB and Psi4 PCM; the higher ε, the stronger the reaction field and the larger the effect.

Background 2 — why dipoles grow in solution

The reaction field acts on a polar molecule like an external field, inducing extra polarisation: the molecule's electrons shift to deepen its charge separation, so the computed dipole moment is larger in solvent than in vacuum. For acetone the gas-phase dipole (≈2.8 D) swells toward ≈3.3 D in water. This is self-consistent — the solute polarises the dielectric, which polarises the solute back — and it is the same physics, scaled up, behind Born solvation of an ion: charge is stabilised in proportion to (1 − 1/ε).

Background 3 — differential solvation raises the S_N2 barrier

For an anionic S_N2 (e.g. Cl⁻ attacking), solvation does not stabilise reactant and transition state equally. The reactant has a compact, charge-concentrated chloride that a polar solvent solvates very strongly; at the TS that charge is smeared over a larger, half-bonded framework and is solvated more weakly. Stabilising the bottom of the hill more than the top raises the barrier — from ≈6 kcal/mol (gas) to ≈17 kcal/mol in water. Caveat: a continuum dielectric captures this bulk electrostatic trend but omits explicit hydrogen bonds, which matter for protic solvents.

Polar molecules (part 1)

acetone acetonitrile formaldehyde

Reaction (part 2): SN2 — Cl⁻ + CH₃Br (Studio → panel 6)

Procedure — part 1: solvent & dipole

Build acetone. In panel 2 leave solvent = gas phase and press DFT (B3LYP); record the dipole.
Change solvent = water and press DFT again. The result is labelled "water (implicit)" — record the new dipole. Try DMSO too.
Repeat for the other polar molecules. Does a more polar solvent enhance the dipole more?

dipole (D)	acetone	acetonitrile	formaldehyde
gas phase
in water

Procedure — part 2: solvent & an SN2 barrier

Open panel 6, choose the SN2 reaction, set Solvent = gas phase and Run reaction; record the activation barrier.
Set Solvent = water and run it again. Compare the two barriers.

	gas phase	water
SN2 barrier (kcal/mol)

Expected finding

Part 1: the dipole grows in solution — acetone ≈ 2.8 D (gas) → ~3.3 D (water) — because the solvent's reaction field pulls the molecule's charges further apart (more polar solvent → bigger effect). Part 2: the SN2 barrier jumps from ≈ 6 kcal/mol (gas) to ≈ 17 kcal/mol in water. The reason is a cornerstone of physical-organic chemistry: water stabilises the compact, charge-concentrated reactant (Cl⁻) much more than the spread-out transition state, so the hill to climb is far higher — which is exactly why anionic SN2 reactions run faster in the gas phase or in aprotic solvents. Solvent isn't a detail; it can change a rate by orders of magnitude.

Questions

In an implicit solvation model, what physically replaces the individual solvent molecules, and which single solvent property controls how strong the effect is?
Why dipoles grow: explain in terms of the reaction field why acetone's computed dipole rises from ≈2.8 D in the gas phase to ≈3.3 D in water, and predict whether the increase would be larger or smaller in acetonitrile (ε ≈ 37).
The anionic S_N2 barrier jumps from ≈6 kcal/mol (gas) to ≈17 kcal/mol in water. Using differential solvation of reactant vs transition state, explain why the barrier goes up, not down.
Polar aprotic solvents (DMSO, acetonitrile) speed anionic S_N2 reactions relative to polar protic ones (water, alcohols). What does the continuum model capture about this, and what does it miss?
Quantitative/predict: an 11 kcal/mol increase in ΔG^‡ on going to water — using a factor of e^{−ΔΔG‡/RT} with RT ≈ 0.593 kcal/mol — corresponds to roughly what change in rate at 298 K? Why is "solvent isn't a detail" a fair summary?
Why does a continuum dielectric handle a neutral dipolar solute (acetone, formaldehyde) more faithfully than it handles a small, hydrogen-bonded anion in water?

Worked answers & discussion

Q1. The discrete solvent molecules are replaced by a continuous, structureless dielectric medium characterised by its permittivity ε. The solute sits in a cavity carved out of this dielectric, which polarises and pushes a reaction field back on the solute. The strength of the effect is governed by ε (and the (1 − 1/ε) Born factor): the larger the dielectric constant, the stronger the stabilisation.
Q2. Acetone's permanent dipole polarises the surrounding dielectric, which produces a reaction field pointing back along the molecular dipole. That field pulls the molecule's own charges further apart — an induced extra polarisation — so the self-consistent dipole grows from ≈2.8 D to ≈3.3 D in water. In acetonitrile (ε ≈ 37 vs water's 78) the reaction field is weaker, so the increase would be smaller than in water but still present.
Q3. Solvation stabilises reactant and TS by different amounts. The reactant's chloride is a small ion with concentrated charge, which water solvates extremely strongly; at the TS the charge is delocalised across the half-bonded Cl···C···Cl framework and is solvated more weakly. Lowering the reactant valley more than the TS pass increases the gap between them — so ΔG^‡ rises from ≈6 to ≈17 kcal/mol. This is exactly why anionic S_N2 runs faster in the gas phase.
Q4. The continuum model captures the bulk dielectric part: both protic and aprotic solvents stabilise the concentrated anion and raise the barrier. What it misses is the specific, directional hydrogen bonding in protic solvents, which cages and further deactivates the nucleophile far more than the bulk ε alone implies. That extra, explicit H-bond stabilisation of the anion — absent in aprotic DMSO/MeCN — is why aprotic solvents leave the nucleophile "naked" and faster. A continuum dielectric, lacking explicit waters, cannot reproduce it.
Q5. The rate is multiplied by e^−11/0.593 = e^−18.5 ≈ 9 × 10⁻⁹ — roughly a hundred-million-fold slowdown at 298 K. A change of solvent has slowed the reaction by about eight orders of magnitude, so "solvent isn't a detail; it can change a rate by orders of magnitude" is entirely fair.
Q6. A neutral dipolar solute interacts with the solvent mainly through smooth, long-range electrostatics — precisely what a continuum dielectric models well. A small anion in water is stabilised substantially by short-range, directional hydrogen bonds to specific water molecules, a discrete effect the structureless continuum cannot represent, so it is the harder case.
Take‑home. A polar solvent's reaction field stabilises concentrated charge, swelling dipoles and — by stabilising a compact anionic reactant more than its spread-out transition state — raising the anionic S_N2 barrier enough to change the rate by orders of magnitude.

Lab 21Predict & assign an NMR spectrum▶

Can a computer predict the ¹H and ¹³C NMR of a molecule well enough to assign every peak?

Why it matters. NMR is the most powerful tool chemists have for working out structure, and DFT shift prediction (GIAO) is now routine for assigning real spectra and even for choosing between candidate structures of a new natural product. The core skill is connecting a peak — its position (electronic environment) and its integration (how many equivalent nuclei) — back to a specific atom. This lab lets you predict a spectrum and assign it yourself.

GIAO NMR¹H / ¹³Cintegrationassignment

You’ll learn to: predict ¹H and ¹³C NMR spectra and assign every peak using chemical shift and integration.

Background 1 — nuclear shielding and the chemical shift

In a magnetic field B₀ the molecule's electrons circulate and generate a tiny opposing field, so each nucleus feels a slightly reduced effective field: it is shielded. The shielding constant σ measures that reduction, and it depends on the electron density and bonding around the nucleus. Electron-rich environments shield (low δ, upfield); electron-poor or deshielded environments (next to electronegative atoms, in π ring currents) push downfield (high δ). The chemical shift reports σ relative to a reference: δ = (σ_ref − σ_sample), which is why a CH₃ appears near 1–2 ppm but an aromatic H near 7 ppm.

Background 2 — GIAO and empirical scaling

Computing σ in a magnetic field has a notorious problem: the answer should not depend on where you place the coordinate origin, but a finite basis set makes it do so. GIAO (gauge-including atomic orbitals) attaches a field-dependent phase to each basis function, restoring origin independence and giving reliable shieldings. MoleBench runs GIAO with real B3LYP DFT. Raw shieldings are then converted to shifts by an empirical linear scaling against an experimental calibration set, which absorbs systematic method error — making predictions quantitative to about ¹H ≈ 0.2 ppm and ¹³C ≈ 8 ppm across the whole range.

Background 3 — assignment: position and integration

Assigning a spectrum joins two pieces of information. Position (δ) reports the electronic environment: ethanol's CH₂ at ≈3.7 ppm sits next to electronegative oxygen, its CH₃ at ≈1.2 ppm does not; toluene's aromatic H's near 7.1–7.3 are deshielded by the ring current. Integration counts equivalent nuclei — ethanol's 3H : 2H ratio fingerprints CH₃ vs CH₂. On ¹³C the same logic places benzene at ≈128.7 ppm (lit 128.4) and a ketone C=O near 206 ppm (lit 207). Honest caveat: very polar O–H/N–H protons scatter, because the gas-phase model omits the hydrogen bonding that shifts them in real solutions.

Molecules

ethanol acetic acid toluene

Procedure

Build each molecule and press NMR (¹H / ¹³C) in panel 3. Record the predicted ¹H and ¹³C peaks and their integration (the number in parentheses / the stick height).
Assign every peak to an atom or group — e.g. for ethanol the 3H peak ≈ 1.2 ppm is the CH₃, the 2H ≈ 3.7 ppm is the CH₂.
For toluene, count how many distinct carbon environments the prediction finds and match them to the ring (ipso/ortho/meta/para) plus the methyl.

	¹H peaks (δ, #H)	¹³C peaks (δ)
ethanol
acetic acid
toluene

Expected finding

Ethanol: ¹H ≈ 3.7 (2H, CH₂) and 1.2 (3H, CH₃) — spot on; ¹³C ≈ 59 (CH₂) and ~20 (CH₃). Acetic acid: ¹H ≈ 2.0 (3H, CH₃) plus the acid O–H; ¹³C a CH₃ near 19 and the carboxyl carbon. Toluene: a 3H methyl at ≈ 2.3 ppm and aromatic H's near 7.1–7.3, with the methyl ¹³C near 21. The integration tells you how many H's — that's how you'd pick the CH₃ out of an unknown. The shifts are empirically scaled against an experimental calibration, so they're quantitative across the whole range: ¹H good to ~0.2 ppm and ¹³C to ~8 ppm, including the hard cases — benzene ≈ 128.7 (lit 128.4) and a ketone carbonyl ≈ 206 (lit 207). (Very polar O–H/N–H protons still scatter — they're sensitive to hydrogen bonding that the gas-phase model ignores.)

Questions

What is nuclear shielding, and why does a higher shielding constant σ correspond to a peak farther upfield (lower δ)?
The two clues: in assigning ethanol's ¹H spectrum, how do position (≈3.7 vs ≈1.2 ppm) and integration (2H vs 3H) together let you assign the CH₂ and CH₃ peaks unambiguously?
Toluene's aromatic protons appear near 7.1–7.3 ppm while its methyl sits at ≈2.3 ppm. What deshields the ring protons so strongly, and why is the methyl comparatively shielded?
Why is a coordinate-origin problem (the gauge problem) inherent to computing shieldings, and how does the GIAO approach fix it?
The raw computed shieldings are run through an empirical linear scaling before being reported as δ. What systematic errors does that absorb, and why does it let even hard cases like a ketone C=O (≈206 ppm) come out accurate?
Quantitative: on a 400 MHz spectrometer (¹H), how many Hz apart are ethanol's CH₂ (3.7 ppm) and CH₃ (1.2 ppm) signals? (Hint: 1 ppm = spectrometer frequency in MHz, in Hz.)

Worked answers & discussion

Q1. Shielding is the small opposing magnetic field that the molecule's circulating electrons create at a nucleus, reducing the field the nucleus actually feels. A larger σ means more shielding, so a stronger external B₀ is needed to reach resonance — equivalently a lower resonance frequency — which by the convention δ = (σ_ref − σ_sample) places the peak farther upfield (lower δ). Electron-rich = shielded = upfield; deshielded = downfield.
Q2. Position fixes the chemical environment: the CH₂ is bonded to electronegative oxygen, which withdraws electron density and deshields it to ≈3.7 ppm, while the CH₃ is remote from oxygen and stays at ≈1.2 ppm. Integration then confirms identity by counting equivalent H's: the upfield peak integrates for 3H (CH₃) and the downfield one for 2H (CH₂). Together the 3.7 ppm / 2H and 1.2 ppm / 3H pattern is unambiguous — and that 3H integral is how you'd pick a methyl out of an unknown.
Q3. The aromatic ring's π electrons circulate in the field to set up a ring current whose field adds to B₀ in the plane of the ring where the H's sit, strongly deshielding them to ≈7.1–7.3 ppm. The methyl protons are ordinary sp³ C–H, only mildly deshielded by the adjacent ring, so they appear at ≈2.3 ppm.
Q4. Shielding involves the magnetic vector potential, which depends on an arbitrary choice of gauge origin. With a complete basis the physical result is origin-independent, but any finite basis introduces a spurious dependence on where you put the origin. GIAO attaches an explicit field-dependent phase factor to each atomic orbital, effectively giving every basis function its own local origin, which cancels the gauge dependence and yields well-defined, origin-independent shieldings even in modest basis sets.
Q5. The linear scaling (δ = a·σ + b, fit to experimental data) absorbs systematic errors — basis-set incompleteness, the chosen functional, and the reference-compound shielding — that bias all nuclei in a similar, predictable way. Because those errors are largely linear in σ, fitting a slope and intercept corrects them across the whole range, so even a far-downfield ketone C=O lands near 206 ppm (lit 207) and benzene at 128.7 (lit 128.4). What scaling cannot fix are non-systematic effects like hydrogen bonding on O–H/N–H protons.
Q6. 1 ppm on a 400 MHz instrument = 400 Hz. The separation is (3.7 − 1.2) ppm × 400 Hz/ppm = 2.5 × 400 = 1000 Hz. (Note this gap scales with field strength — at 600 MHz it would be 1500 Hz — whereas δ in ppm is field-independent, which is why we report ppm.)
Take‑home. GIAO shieldings, empirically scaled, reproduce real spectra because position encodes the electronic environment and integration counts equivalent nuclei — the same two clues you use to assign any unknown by hand.

Lab 22Map a conformational landscape (2D PES)▶

A 1D scan rotates one bond — but real flexible molecules have several. What does the full energy surface look like?

Why it matters. When two (or more) bonds rotate together, the molecule's energy becomes a surface over those coordinates, with basins (stable conformers), ridges (barriers) and saddle points connecting them. This is exactly the picture behind the Ramachandran plot of protein backbones, and it reveals effects a single 1D scan misses — like syn-pentane interference, where two individually-fine gauche turns clash when combined.

2D scanpotential-energy surfaceMMFF94click-to-view

You’ll learn to: read a two-dimensional energy surface — basins, ridges and saddle points — and recognise effects (like syn-pentane) that a 1D scan misses.

Background 1 — from a 1D scan to a 2D energy surface

Rotating one bond traces a 1D energy curve; let two dihedrals vary together and the energy becomes a surface over a plane of two torsion angles. That surface has basins (stable conformers at the bottom of valleys), ridges (rotational barriers), and saddle points that mark the lowest passes connecting one basin to another. Reading it as a contour or heat map shows not just where the minima are but how high the walls between them sit — the whole conformational landscape in one picture. MoleBench builds this with the MMFF94 force field, which is fast and qualitatively reliable for conformational energies.

Background 2 — the Ramachandran connection and 1D blind spots

This is exactly the logic of the Ramachandran plot, where a protein backbone's allowed shapes are mapped over its two torsions φ and ψ. The key lesson is that a 2D map reveals coupling a 1D scan cannot: two torsions that are each individually fine can clash when set simultaneously. In pentane both central C–C bonds matter; the deep minimum is anti–anti (both ≈180°), gauche basins sit a few kcal/mol up, and the corners are eclipsed, high-energy geometries.

Background 3 — syn-pentane interference and an H-bonded basin

The signature coupling effect is syn-pentane interference: a gauche turn on each torsion is individually low-energy, but two gauche turns of opposite sign force the terminal methyls into direct steric contact, producing a high-energy spot a 1D scan would never flag. Ethylene glycol shows the opposite surprise: a gauche O–C–C–O basin is stabilised — not by sterics but by an intramolecular O–H···O hydrogen bond, the same effect seen in the 1D conformer lab, now appearing as a distinct basin on the surface. Caveat: MMFF94 is a classical force field, fast and qualitatively right, not a quantum benchmark.

Molecules

pentane ethylene glycol

Procedure

Build pentane. In panel 5 under 2D conformational map, the two central C–C–C–C torsions are pre-chosen as bond X and bond Y. Press Run 2D map.
Read the heatmap (blue = low, red = high). Find the anti–anti minimum (both ≈ 180°) and the high-energy regions. Click cells to load and inspect those conformers.
Now try ethylene glycol (drive the H–O–C–C and O–C–C–O torsions): look for a low-energy basin that corresponds to the internal O–H⋯O hydrogen bond.

	lowest (X°, Y°)	energy range (kcal/mol)	what's at the high corner?
pentane
ethylene glycol

Expected finding

Pentane: the deep minimum sits at anti–anti (both torsions ≈ 180°); broad gauche basins are a few kcal/mol up; the highest corners are eclipsed/clashed geometries — including the famous syn-pentane spot where two gauche turns of opposite sign force the end methyls together. Ethylene glycol: unlike a pure-steric map, a gauche O–C–C–O basin is stabilised by the intramolecular O–H⋯O hydrogen bond — the same surprise as the 1D conformer lab, now visible as a basin on the 2D surface. A 2D map shows the whole landscape — minima, barriers and the paths between them — in one picture. (MMFF94 force field; fast and qualitatively right for conformational energies.)

Questions

On a 2D potential-energy surface, what conformational feature does each of a basin, a ridge, and a saddle point correspond to physically?
Why does pentane's deepest minimum sit at anti–anti (both central torsions ≈180°), and what kinds of geometries occupy the high-energy corners of the map?
Syn-pentane interference: each central bond can adopt a gauche turn at only a small energy cost, so why does combining two specific gauche turns produce a high-energy spot — and why is this effect invisible to a single 1D dihedral scan?
Ethylene glycol's surface has a stabilised gauche O–C–C–O basin instead of the simple anti minimum a pure-steric molecule would show. What stabilises it, and how does that echo the 1D conformer lab?
How is this 2D map the same idea as a protein's Ramachandran plot, and what general lesson about multi-bond molecules does that comparison teach?
Predict/apply: a basin sits ≈3 kcal/mol above the global minimum. Using a Boltzmann factor e^−ΔE/RT at 298 K (RT ≈ 0.593 kcal/mol), estimate its relative population, and comment on whether MMFF94 energies are reliable enough to trust that number quantitatively.

Worked answers & discussion

Q1. A basin is a local minimum — a stable conformer the molecule spends time in. A ridge is a high-energy wall, a barrier separating regions of the map. A saddle point is the lowest pass over a ridge — a maximum along the path connecting two basins but a minimum across it — so it sets the barrier the molecule must climb to interconvert between those two conformers.
Q2. At anti–anti both central torsions are ≈180°, so every large group is staggered and the chain is fully extended with no 1,4 or 1,5 steric clashes — the lowest-energy arrangement. Gauche basins (one or both torsions near ±60°) sit a few kcal/mol higher. The map's corners are eclipsed geometries where torsions pass through 0° and groups overlap, the highest-energy regions.
Q3. A single gauche turn is cheap (≈0.9 kcal/mol). But two gauche turns of opposite sign (e.g. g⁺g⁻) bend the chain back on itself so the two terminal methyls are driven into direct van der Waals contact — a 1,5 steric clash that costs far more than the sum of the two individual gauche penalties. A 1D scan rotates only one bond with the other fixed, so it never samples the combination that creates the clash; only the 2D map, varying both at once, exposes this coupling.
Q4. The gauche O–C–C–O geometry brings the two hydroxyl groups close enough to form an intramolecular O–H···O hydrogen bond, whose stabilisation outweighs the small gauche steric penalty — so the basin sits lower than sterics alone would predict. This is exactly the surprise from the 1D conformer lab (glycol favouring gauche over anti), now visible as a distinct, stabilised basin on the 2D surface.
Q5. A Ramachandran plot maps a protein backbone's energy over its two torsions φ and ψ, with allowed (low-energy) basins for α-helix and β-sheet and forbidden high-energy regions — precisely a 2D conformational PES. The general lesson: for any molecule with two or more rotatable bonds, the torsions are coupled, so the real accessible shapes (and the clashes that forbid others) emerge only from the full surface, never from independent 1D scans.
Q6. The Boltzmann factor is e^−3/0.593 = e^−5.06 ≈ 0.0064 — about 0.6% of the global-minimum population (before any degeneracy/multiplicity correction). Quantitatively, treat this as an estimate only: MMFF94 is a classical force field, fast and qualitatively correct for conformational ordering and surface shape, but its energies are not benchmark-accurate, and a ~1 kcal/mol error shifts a Boltzmann population by a factor of several. Trust the landscape and the trend, not the third significant figure.
Take‑home. Letting two torsions vary together turns a curve into a surface whose basins, ridges and saddles — including the syn-pentane clash and glycol's H-bonded gauche basin — reveal the coupling between bonds that any single 1D scan is blind to.

Lab 23Acidity by design — pK_a, electronic effects, descriptors & electron correlation▶

Acetic acid and trifluoroacetic acid differ by just three fluorine atoms, yet trifluoroacetic acid is roughly 30,000× stronger. Where does that factor come from — and can you see it in a predicted pK_a, in the molecule's descriptors, and in a correlated‑level energy?

Why it matters. pK_a is arguably the single most useful number in chemistry: it decides whether a group is charged at a given pH, how a drug crosses a membrane and binds its target, how an enzyme moves protons, how a buffer holds, and which way an equilibrium or mechanism runs. Learn to reason about acidity from structure and you can predict reactivity across organic, biological and medicinal chemistry. This lab builds that intuition from three angles at once — a directly estimated pK_a, the QSAR descriptors medicinal chemists live by, and the Hartree–Fock→MP2 jump that defines electron correlation — and asks you to connect them.

pK_a (GFN2/ALPB)inductive vs resonanceQSAR descriptorsMP2 correlationHammett

You’ll learn to: estimate a pK_a, connect it to molecular descriptors and inductive/resonance effects, and interpret the MP2 correlation energy.

Background 1 — pK_a is really a free energy

An acid sits in equilibrium with its conjugate base, HA ⇌ H⁺ + A⁻, with K_a = [H⁺][A⁻]/[HA] and pK_a = −log₁₀K_a. Because the position of any equilibrium is fixed by its free energy, pK_a is a disguised thermodynamic quantity: ΔG° = 2.303 RT · pK_a. At 298 K that prefactor is 1.36 kcal/mol per pK_a unit — so an acid whose pK_a is 5 units lower has a conjugate base about 7 kcal/mol more stable. The golden rule follows: almost everything that makes an acid stronger does so by stabilising A⁻.

Background 2 — two ways a substituent stabilises the anion

Electron‑withdrawing groups stabilise A⁻ by two distinct routes. The inductive effect pulls density through the σ‑bond framework; it is strongest for the most electronegative atoms (F > Cl > Br) and falls off steeply with distance — which is exactly why α‑halogens on a carboxylic acid lower its pK_a so much, and three fluorines make trifluoroacetic acid one of the strongest simple organic acids. The resonance (mesomeric) effect delocalises charge through the π system and needs conjugation plus the right geometry: a para‑nitro group lets the phenolate lone pair spill onto the –NO₂ oxygens, something a meta nitro cannot do. Physical‑organic chemistry compresses all of this into Hammett σ constants (for –NO₂, σ_para = +0.78; more positive = more withdrawing).

Background 3 — how the pK_a button works (and its limits)

MoleBench estimates pK_a from a thermodynamic deprotonation cycle. For every ionisable X–H (X = O, N, S) it optimises the neutral acid and its conjugate base in implicit water (ALPB continuum solvent) at GFN2‑xTB and takes ΔE = E(A⁻) − E(HA), then maps it to pK_a = m·ΔE + b through a line fitted to 15 acids of known pK_a across 0–16 (R² = 0.91, mean error ≈ 1.3 units). It reports the most acidic site and lists every candidate. Treat the value as an estimate (±~1–2 units) — one global line cannot perfectly fit carboxylic acids, phenols, alcohols and thiols together, and it omits explicit waters and full entropy — but the trends are reliable, which is what chemical reasoning usually needs.

Background 4 — what the descriptors measure

cLogP is the calculated oil/water partition coefficient (lipophilicity), summed from Crippen atom contributions. Molar refractivity (MR) scales with molecular volume × polarisability — how big and how “squishy” the electron cloud is. TPSA (topological polar surface area) sums the surface of N/O/polar‑H atoms and is a strong proxy for H‑bonding and membrane permeability. QED rolls several properties into one 0–1 drug‑likeness score. Keep one thing in mind that this lab is designed to expose: polarisability (MR) and electronegativity are different atomic properties, and they will openly disagree below.

Background 5 — Hartree–Fock, correlation, and MP2

Hartree–Fock (HF) is a mean‑field method: each electron feels only the average field of the others, so it misses the way electrons instantaneously avoid one another. The energy it leaves out is the correlation energy, E_corr = E_exact − E_HF, which is always negative. MP2 (second‑order Møller–Plesset perturbation theory) adds most of it back on top of the HF wavefunction. Correlation is decisive for reaction energies, dispersion / π‑stacking and weak interactions. The catch: total energies are only comparable within the same method and basis — an MP2 number for one molecule and an HF number for another say nothing on their own.

An acidity ladder (click any to load it)

acetic acid chloroacetic acid trifluoroacetic acid phenol 4-nitrophenol

Procedure — part 1: build the pK_a ladder

Build acetic acid; in panel 2 press Estimate pK_a. Record the predicted pK_a and which site it chooses (it lists every O/N/S–H).
Repeat for chloroacetic then trifluoroacetic acid. Each extra electronegative atom pulls harder on the carboxylate — watch the pK_a fall, and note how much each halogen is worth.
Now compare phenol with p‑nitrophenol. Predict the direction first, then test it.

predicted pK_a	acetic	chloroacetic	trifluoroacetic	phenol	p‑nitrophenol
site chosen
predicted pK_a
experimental pK_a	4.76	2.87	0.23	9.99	7.15

Procedure — part 2: read the descriptors

For each of the three carboxylic acids, click QSAR descriptors in panel 1. Record cLogP, molar refractivity, TPSA and QED.
Going acetic → chloro → trifluoro, mark which descriptors grow, which barely move, and — importantly — whether the descriptor that grows the most is the same property that drives the acidity.

	acetic	chloroacetic	trifluoroacetic
cLogP
molar refractivity
TPSA (Å²)
QED

Procedure — part 3: Hartree–Fock vs MP2

Build acetic acid. Open ⚙ Advanced in panel 2, choose Hartree–Fock, basis 6‑31G*, task Single‑point energy, Run; record the energy.
Switch only the method to MP2 (correlated), keep 6‑31G*, Run again. The difference is the correlation energy recovered by MP2.

	energy (Eh)
HF / 6‑31G*
MP2 / 6‑31G*
correlation (MP2 − HF)

Questions

Write the deprotonation equilibrium for acetic acid and draw its conjugate base. State the golden rule that links A⁻ stability to pK_a.
Using ΔG° = 1.36 · pK_a (kcal/mol, 298 K) and the experimental values, how much more stable is trifluoroacetate than acetate? Show the arithmetic.
Trifluoroacetic acid and p‑nitrophenol are both strengthened by an electron‑withdrawing group. Classify each effect as inductive or resonance and justify it from where the group sits relative to the negative charge. Why would meta‑nitrophenol be a weaker acid than the para isomer?
The key insight. Molar refractivity rises from acetic to chloroacetic but is almost unchanged for trifluoroacetic — even though fluorination makes the acid far stronger. Explain why acidity and MR track different atomic properties.
All three carboxylic acids share TPSA = 37.3 Å², yet phenol is 20.2 and p‑nitrophenol 63.4. Account for all three numbers.
From your part‑3 values, give E_corr for acetic acid in Eh and kcal/mol, and state why it must be negative. Why can you not compare acetic acid's MP2 energy with phenol's HF energy to decide which molecule is more stable?
Apply it. A drug candidate's –COOH is so acidic it is fully ionised in the gut, hurting permeability. Suggest one structural change that would raise its pK_a, and predict the sign of the change in cLogP and TPSA.

Worked answers, findings & discussion

Part 1 — the ladder. Predicted pK_a plummets with electron‑withdrawing power: acetic ≈ 6.5 → chloroacetic ≈ 3.9 → trifluoroacetic ≈ −0.8 (experiment 4.76 → 2.87 → 0.23 — the model runs a little high, but the order and spacing are right). Among phenols, p‑nitrophenol (≈7.0) is far stronger than phenol (≈11.2).

Q1. CH₃COOH ⇌ CH₃COO⁻ + H⁺; the conjugate base is acetate, whose charge is shared equally over two oxygens by resonance. Golden rule: the more stable A⁻, the lower the pK_a.
Q2. ΔΔG = 1.36 × (4.76 − 0.23) = 1.36 × 4.53 ≈ 6.2 kcal/mol — trifluoroacetate is that much more stabilised, and 4.5 pK_a units is a factor of ~30,000 in K_a.
Q3. Trifluoroacetic acid = inductive: the C–F dipoles pull density through the σ bonds toward the carboxylate (no π conjugation to the C–F bonds). p‑Nitrophenol = resonance: the phenolate lone pair delocalises through the ring onto the –NO₂ oxygens. A meta nitro is not conjugated to the O⁻ position, so it can only act inductively — weaker — making meta‑nitrophenol (pK_a 8.4) a weaker acid than the para isomer (7.15).
Q4 (the insight). Measured values: MR = 13.3 → 18.4 → 13.7 (acetic, chloroacetic, trifluoroacetic). MR scales with size×polarisability: one big, soft chlorine adds ~5 units, but three small, “hard” fluorines barely move it. Acidity, by contrast, tracks electronegativity / inductive withdrawal, where fluorine dominates. So the strongest acid of the three (TFA) has essentially the same MR as the weakest (acetic): electron‑withdrawing power is not the same axis as polarisability — a distinction Hammett σ captures but MR does not.
Q5. TPSA counts only N/O/polar‑H surface. The –COOH group is identical in all three acids → 37.3 Å² regardless of the halogens (Cl and F contribute zero). Phenol has just one –OH → 20.2; p‑nitrophenol adds a nitro group (~43 Å²) → 63.4.
Q6. HF ≈ −227.807 Eh, MP2 ≈ −228.433 Eh, so E_corr ≈ −0.63 Eh ≈ −393 kcal/mol. It is negative because correlation only ever lowers the energy (electrons avoiding each other reduce repulsion below the mean‑field estimate). You cannot compare MP2 of one molecule with HF of another because each method/basis has a different “zero” — only differences computed at the same level are physical.
Q7. Raise the pK_a by making the conjugate base less stable: e.g. replace α‑electron‑withdrawing groups with electron‑donating alkyls, or move polar/EWG substituents farther from the –COOH (inductive effects fade with distance). Adding a non‑polar alkyl typically raises cLogP (more lipophilic) and leaves TPSA essentially unchanged (no new polar atoms) — often exactly the permeability trade you want.

Take‑home. One structural lever — conjugate‑base stability — explains a 30,000‑fold acidity range, and you saw it three independent ways: a pK_a trend, a descriptor set that confirms what changed (and warns that polarisability ≠ electronegativity), and a correlation energy that quantifies what HF leaves out.

Going further: walk fluoroacetic → difluoro → trifluoroacetic acid and test whether each fluorine lowers pK_a by a similar step (inductive additivity); compare ortho/meta/para‑nitrophenol; or build the conjugate base directly (set charge −1, drop the acidic H) and run descriptors on the anion.

🧬 Structural biology labs

Hands-on in the Protein Workbench — protein architecture, folding, enzyme active sites, structure-based drug design, AlphaFold and DNA. Each lab opens real structures from the Protein Data Bank.

Lab 24Anatomy of a protein — the four levels of structure▶

A single hemoglobin molecule is four protein chains wrapped around four iron-bearing hemes — so how do biologists organize the journey from a flat string of amino acids to a working oxygen carrier?

Why it matters. Every protein in your body is described at four hierarchical levels, and disease often strikes at a specific one — a single primary-sequence swap (Glu→Val) causes sickle-cell anemia, while loss of quaternary assembly can disable an enzyme entirely. Learning to read these levels in a real structure is the first skill of structural biology and the foundation of modern drug design.

protein structuresecondary structurequaternary structurehemoglobinbeginner

You’ll learn to: read a one-letter sequence in the Sequence panel, identify α-helices and β-sheets by coloring a structure, isolate individual chains, recognize a multi-subunit (quaternary) assembly versus a single-domain protein, and record a side-by-side comparison of two real structures.

Background 1 — the four levels of structure

Protein architecture is described in a strict hierarchy. Primary structure is the linear sequence of amino acids joined by peptide bonds — the order encoded by the gene. Secondary structure is local backbone folding stabilized by backbone hydrogen bonds, giving repeating α-helices and β-sheets. Tertiary structure is the full three-dimensional fold of one chain, packing those elements together. Quaternary structure is the assembly of two or more folded chains (subunits) into one functional unit. Each higher level is built from the one below.

Background 2 — why hemoglobin is a tetramer

Human hemoglobin is a tetramer: two identical α-subunits and two identical β-subunits (an α₂β₂ arrangement), each cradling one heme group whose iron binds O₂. The subunits are not just decorative company — they communicate. Oxygen binding at one heme nudges the others into a higher-affinity shape, a quaternary phenomenon called cooperativity that gives hemoglobin its efficient sigmoidal O₂ curve. A lone subunit (like the related protein myoglobin) cannot cooperate. By contrast, crambin (1CRN) is a tiny 46-residue plant protein: a single chain, a single domain, no quaternary structure at all.

Background 3 — how a viewer maps levels onto colors and panels

A structure viewer lets you see each level by switching representation and color. Color by structure assigns one color to α-helix, another to β-strand, and a third to loops, so secondary structure becomes visible at a glance. Color by chain gives each subunit its own color, exposing the quaternary arrangement directly. The Sequence panel is the primary structure made clickable, and isolating one chain strips the view down to a single tertiary fold. A key habit is matching the question to the right tool: ask “which secondary structure?” with color by structure, but ask “how many subunits?” with color by chain. Resolution and atom counts in the Structure metadata tell you how much detail the crystallography actually resolved.

Structures — open in the workbench ↗

4HHB hemoglobin 1CRN crambin

Procedure

Open 4HHB in the workbench. Read the Structure (i) panel and note the method, resolution (Å), organism, and the counts of chains / residues / atoms / ligands. How many chains? How many ligands (these include the hemes)? Record these in the worksheet.
In Representation (1), confirm Show as → cartoon. This is the standard way to read tertiary structure — the ribbon traces the folded backbone path.
Still in Representation, set Color by → structure. Helices, sheets, and loops now get distinct colors. Spin the view (toolbar ◌ spin, then click ◌ again to stop and drag manually) and survey the fold. Is hemoglobin mostly helical, mostly sheet, or mixed? Record it.
Open the Sequence (5) panel. This one-letter string is the primary structure. Click any residue to highlight and zoom to it in the 3D view — watch where it lands on the cartoon.
In Chains (2), click isolate on the first chain (A). You now see one folded subunit on its own — this is the tertiary structure of a single α-subunit. Note its compact globin fold.
Click Show all in Chains, then in Representation set Color by → chain. The four subunits now get four colors — this is the quaternary structure. Identify the two α and two β chains by how they pair across the assembly.
In Ligands & binding sites (3), click focus on one heme to zoom to it, then contacts to see how it is held in its pocket. Note that there are four hemes — one per subunit.
Optionally, in Display (4) use 📏 measure: click an iron atom in one heme, then an iron in a neighboring subunit, to read the inter-heme distance (Å) and appreciate how far apart the cooperating sites are.
Now open 1CRN in the workbench. Read Structure (i): how many chains and residues? Set Color by → structure and compare crambin’s size and secondary-structure content to a single hemoglobin chain. Fill in the 1CRN column of the worksheet.
Use Saved views (6) to Save the structure-colored hemoglobin tetramer so you can restore it for comparison.

Record what you observe in both structures. Reference values are pre-filled; fill the shaded cells from the Structure (i) panel and your colored views.

Feature	4HHB (hemoglobin)	1CRN (crambin)
Number of chains
Approx. residues (total)	~574 (4 × ~143)	46
Resolution (Å)
Secondary structure seen (α-helix / β-sheet / both)
Highest level of structure illustrated	quaternary (α₂β₂ tetramer)	tertiary (single small domain)
Ligands (heme count)		none

Questions

After Color by → structure, which secondary-structure type dominates hemoglobin, and which is essentially absent? What is the name of the all-α fold each subunit adopts?
The Sequence panel shows a string of letters. Which level of structure is this, and what kind of bond links each letter to the next?
When you isolated chain A, which structural level were you looking at? When you colored all four chains differently, which level?
From your worksheet: how many chains, hemes, and total residues did 4HHB report versus 1CRN, and what secondary-structure type did each show? Which structural level does that residue/chain count let each protein reach?
Hemoglobin carries four hemes and four chains, while myoglobin has one of each. Why does the tetrameric arrangement matter functionally — what can hemoglobin do that a single subunit cannot?
Predict/apply: Sickle-cell anemia comes from a single Glu→Val change in the β-chain. Which structural level is altered directly, and explain how a change at that lowest level can still wreck the protein’s higher-level behavior.

Worked answers & discussion

Q1. Hemoglobin is overwhelmingly α-helical with essentially no β-sheet — under Color by structure you see helix colors everywhere and only short connecting loops. Each subunit adopts the classic all-α globin fold (roughly eight helices, conventionally labeled A–H).
Q2. The one-letter sequence is the primary structure. Consecutive residues are joined by peptide (amide) bonds formed between one residue’s carboxyl and the next residue’s amino group.
Q3. Isolating a single chain shows that chain’s tertiary structure (one folded subunit). Coloring all four chains separately reveals the quaternary structure — how the separate folded subunits assemble.
Q4. 4HHB reports 4 chains, 4 heme ligands, and ~574 residues total (about 143 per chain), and colors almost entirely as α-helix — enough chains to reach the quaternary level. 1CRN reports 1 chain, 46 residues, and no ligands; it shows both a short α-helix and a small β-sheet but only reaches the tertiary level because there is just one chain. (Crambin is mixed α/β despite being tiny — a nice contrast to the all-α globin.)
Q5. Four subunits allow cooperativity: O₂ binding at one heme triggers a quaternary shape change (T→R) that raises affinity at the other hemes. This gives a sigmoidal binding curve, so hemoglobin loads O₂ efficiently in the lungs and unloads it in tissues. A single subunit (myoglobin) has no partners to signal, so it binds hyperbolically and just stores O₂.
Q6. The mutation alters the primary structure directly (one residue in the β-chain sequence). But replacing a charged surface glutamate with a hydrophobic valine creates a sticky patch; under low oxygen, deoxy-hemoglobin molecules use that patch to polymerize into long fibers — an aberrant quaternary/intermolecular association that deforms red cells. So a single primary-level letter cascades upward to break the assembly-level behavior.
Take‑home. The four levels are a ladder: sequence (primary) dictates local folds (secondary), which pack into a 3D shape (tertiary), which may assemble with partners (quaternary). Hemoglobin showcases all four and uses its quaternary level for cooperative O₂ transport; crambin stops at tertiary. Reading a structure means knowing which level you are looking at — and which level a mutation or drug is acting on.

Lab 25The hydrophobic core — how a protein folds▶

A protein chain could fold a near-infinite number of ways — so what physical force reliably pushes the same sequence into the same compact shape every single time?

Why it matters. Misfolded proteins drive Alzheimer’s, Parkinson’s, and many other diseases, and the entire field of structure prediction (including AlphaFold) rests on understanding what makes folding spontaneous. The dominant answer — the hydrophobic effect — is one of the most important and counter-intuitive ideas in biochemistry, and you can see it directly in a structure no bigger than crambin.

protein foldinghydrophobic effectthermodynamicsAnfinsencrambin

You’ll learn to: color a structure by hydrophobicity, distinguish buried core residues from surface residues, predict residue location from side-chain chemistry and confirm it in 3D, and connect the hydrophobic effect to the entropy of water as the driving force of folding.

Background 1 — the hydrophobic effect drives folding

Folding is not pulled together by the nonpolar side chains “liking” each other — it is pushed by water. Around an exposed nonpolar group, water molecules cannot hydrogen-bond to it, so they form an ordered clathrate-like cage, losing entropy. When the protein buries its nonpolar (hydrophobic) side chains together in a core, that caging water is released to the bulk, and its entropy rises sharply. This favorable increase in water entropy (−TΔS) is the main negative-ΔG term of folding — the hydrophobic effect. Polar and charged residues, which water can solvate happily, stay on the surface.

Background 2 — Anfinsen: sequence encodes structure

Christian Anfinsen showed that a denatured protein can spontaneously refold to its correct, active shape with no outside help — so all the folding information lives in the primary sequence. The folded state is the thermodynamic minimum: the most stable arrangement reachable for that sequence in water. The pattern of hydrophobic versus polar residues along the chain therefore acts like a code — it tells the chain which residues belong inside and which belong outside. Crambin (1CRN), a 46-residue plant seed protein with a well-resolved structure, is a classic miniature example of a clean hydrophobic core, with charged residues neatly displayed on its surface.

Background 3 — reading hydrophobicity from side-chain chemistry

You can predict where a residue lives before ever opening the structure. Aliphatic side chains (Leu, Ile, Val, Ala) and aromatic ones (Phe, Trp) are pure hydrocarbon — nonpolar, so they bury in the core. Charged residues (Lys, Arg positive; Asp, Glu negative) and polar ones (Ser, Thr, Asn, Gln) carry hydrogen-bonding or ionizable groups, so they favor the water-facing surface. Some residues are amphipathic and sit at the boundary. Hydrophobicity scales (Kyte–Doolittle, for example) rank these quantitatively, and a viewer’s color-by-hydrophobic mode applies essentially the same scale — so your chemical prediction and the colored 3D view should agree. Disulfide bonds (crambin has three) also help lock the small fold.

Structures — open in the workbench ↗

1CRN crambin

Procedure

Open 1CRN in the workbench. Read Structure (i): note that crambin is a single chain of about 46 residues — small enough to inspect every side chain.
In Representation (1), set Show as → cartoon and Color by → hydrophobic. Orange marks hydrophobic residues; blue marks polar/charged residues.
Make sure the toolbar ◌ spin is off, then drag to rotate by hand. Slowly turn the molecule and watch where the orange clusters — toward the middle or the outside?
In Representation, switch Show as → surface. You now see the solvent-facing outside of the protein. Note whether the visible surface is mostly orange or mostly blue.
Switch Show as → sticks (or ribbon+lig) while keeping Color by → hydrophobic. Rotate and look into the molecule: the orange side chains you now see packed in the interior are the buried core.
Open the Sequence (5) panel. Before clicking, fill the worksheet’s prediction columns from side-chain chemistry. Then click each listed residue (a Leu, Ile, Phe, Val; then a Lys, Arg, Asp, Glu, Ser) to zoom to it, and confirm in 3D whether it sits buried or on the surface.
In Display (4), toggle 💧 water off if any crystal waters are shown, so you can see the protein surface cleanly. Optionally toggle residue labels to confirm residue identities, and use 📏 measure to gauge how deep a core residue sits.
Use Saved views (6) to Save the surface view and the sticks/core view, so you can flip between “outside” and “inside” for the discussion.

For each residue, write your prediction first (from Background 3), then confirm it using Color by → hydrophobic with the surface and sticks views. Fill the shaded cells.

Residue (example)	Hydrophobic or polar/charged?	Predicted: buried (core) or surface?	Confirmed in viewer?
Leu (L)
Ile (I)
Phe (F)
Val (V)
Lys (K)
Arg (R)
Asp (D)
Glu (E)
Ser (S)

Questions

After Color by → hydrophobic, where do the orange (hydrophobic) residues concentrate — the interior or the surface? Where do the blue (polar/charged) residues sit?
In the surface representation, is the solvent-exposed outside mostly orange or mostly blue? Why does that make thermodynamic sense?
The hydrophobic effect is often called “entropy-driven.” Whose entropy increases when the core forms — the protein’s or the water’s — and why?
From your worksheet: did every prediction match what you saw in the viewer? List which of Leu/Ile/Phe/Val you confirmed as buried and which of Lys/Arg/Asp/Glu/Ser you confirmed as surface, and note any residue that sat at the boundary instead.
How does this experiment illustrate Anfinsen’s principle that the sequence encodes the structure?
Predict/apply: A mutation buries a charged Asp deep in the hydrophobic core, or exposes a big hydrophobic Phe on the surface. For each case, predict whether the protein becomes more or less stable, and why.

Worked answers & discussion

Q1. Orange hydrophobic residues cluster in the interior core, shielded from solvent; blue polar/charged residues sit on the surface, in contact with water.
Q2. The exposed surface is predominantly blue (polar/charged). This makes sense because polar and charged groups can hydrogen-bond to and be solvated by water, so leaving them outside is energetically cheap, while hydrophobic groups are tucked away to minimize the entropic penalty of caging water.
Q3. It is mainly the water’s entropy that increases. Exposed nonpolar surfaces force surrounding water into an ordered, low-entropy cage; burying those surfaces in the core releases that water to the disordered bulk, raising its entropy. The favorable −TΔS of released water is the dominant driving force (the chain itself actually loses conformational entropy on folding).
Q4. The predictions should match: Leu, Ile, Phe, Val are nonpolar (aliphatic/aromatic) and confirm as buried in the core; Lys, Arg, Asp, Glu, Ser are charged or polar and confirm as surface, facing water. The side-chain chemistry predicts the location every time. Watch for the occasional boundary/amphipathic case — e.g. a partly exposed leucine on a loop or a serine whose –OH dips toward the core to hydrogen-bond — which is the expected exception, not a failure of the rule.
Q5. Crambin folds to one specific, reproducible shape with a clean core and a polar surface — and that pattern is dictated entirely by which residues in its sequence are hydrophobic versus polar. The sequence is the instruction set for “in vs. out,” exactly as Anfinsen’s experiments implied: the native fold is the thermodynamic minimum determined by the primary structure.
Q6. Burying a charged Asp in the core is destabilizing: you pay a large desolvation cost to strip its hydration shell and leave an unsatisfied charge in a greasy environment. Exposing a Phe on the surface is also destabilizing: it re-orders water around a nonpolar group, the very entropic penalty folding tries to avoid (and it removes that residue’s favorable packing in the core). Both mutations push ΔG the wrong way and tend to lower stability or promote misfolding/aggregation.
Take‑home. Proteins fold mainly to get their oily residues out of water. The hydrophobic effect — really a story about water’s entropy — sorts hydrophobic residues inward and polar ones outward, and the sequence is what decides that sorting. See it once in crambin and you will recognize the same inside-out logic in almost every globular protein.

Lab 26Inside an enzyme — the active site▶

How does an enzyme “know” exactly where to cut a protein — and why does trypsin always slice right after an arginine or lysine?

Why it matters. Trypsin is the workhorse enzyme of nearly every proteomics lab: it digests proteins into predictable peptides so mass spectrometers can identify them. Its razor-sharp specificity comes entirely from the 3D shape and charge of its active site. Reading that pocket directly teaches you the molecular logic behind enzyme recognition.

enzymesactive siteserine protease

You’ll learn to: identify the residues lining an enzyme’s binding pocket, recognise a salt bridge that drives specificity, measure a key contact distance in 3D, and tabulate the network of contacts that holds an inhibitor in place.

Background 1 — The catalytic triad

Trypsin is a serine protease: it hydrolyses peptide bonds using a catalytic triad of three cooperating residues — Ser195, His57 and Asp102. His57 acts as a base, pulling a proton off the Ser195 hydroxyl so its oxygen becomes a powerful nucleophile that attacks the substrate’s carbonyl carbon. Asp102 stabilises the protonated histidine, sharpening its catalytic power. This charge-relay system is the chemistry of the cut — but on its own it says nothing about where trypsin cuts. That job belongs to a separate pocket.

Background 2 — The S1 specificity pocket

Next to the triad sits the S1 specificity pocket, a deep slot that grips the side chain just before the bond to be cleaved. At its floor lies Asp189, a negatively-charged aspartate. Only a long, positively-charged side chain — arginine or lysine — can reach the bottom and form a stabilising salt bridge with Asp189. That single buried charge dictates trypsin’s “cut after Arg/Lys” rule. Here we use the inhibitor benzamidine (ligand BEN): its flat, positively-charged amidinium group mimics an Arg/Lys side chain, so it lodges in S1 and freezes the enzyme.

Background 3 — The oxyanion hole and tetrahedral intermediate

Recognition and the triad are only part of the story. As Ser195 attacks the carbonyl carbon, the substrate passes through a high-energy tetrahedral intermediate whose carbonyl oxygen carries a developing negative charge — an oxyanion. Trypsin stabilises it in an oxyanion hole, a cradle of backbone amide N−H groups from Gly193 and Ser195 that donate hydrogen bonds to the oxyanion. This transition-state stabilisation is the true source of catalytic rate enhancement: the enzyme binds the fleeting intermediate far more tightly than the substrate, lowering the activation barrier by orders of magnitude. Substrate-mimicking subsites (S1′, S2) flank the scissile bond and help register the polypeptide for an exact cut.

Structures — open in the workbench ↗

3PTB trypsin + benzamidine

Procedure

Open 3PTB in the workbench. In Structure (i), note the method, resolution, organism, and that there is one protein chain plus the ligand BEN.
In Representation (1) set Show as to cartoon and Color by to structure to see the two β-barrel domains typical of this enzyme family. Switch Color by to element later when you inspect contacts.
In Ligands & binding sites (3), click focus on BEN to zoom in on benzamidine, then click pocket. The residues within 5 Å appear as labeled sticks — this is the lining of the S1 specificity pocket. Find the label for Asp189 at the floor of the slot, and note Ser190 and Gly219 nearby.
Click contacts on BEN. Polar contacts are drawn as dashed lines with distances. Identify the salt bridge running from the benzamidine amidinium nitrogen to the carboxylate oxygen of Asp189, plus any backbone H-bonds to Gly219 or Ser190.
In Display & selection (4) turn on residue labels, then enable 📏 measure. Click one amidinium N atom of benzamidine, then the nearest carboxylate O of Asp189; record the distance (expect ~2.8–3.0 Å). Repeat for an amidinium N → Gly219 backbone carbonyl O and for an amidinium N → Ser190 contact.
Use the Sequence (5) panel to click residues 57, 102 and 195; watch them highlight and zoom. Note how close His57 and Ser195 sit to the bound inhibitor — the catalytic machinery is poised right beside S1. Toggle 💧 water in Display (4) to see ordered waters in the pocket, then save a Saved view (6).

Contact	Residue	Distance (Å)
Amidinium N → carboxylate O (salt bridge)	Asp189
Amidinium N → carboxylate O′ (2nd O)	Asp189
Amidinium N → backbone carbonyl O	Gly219
Amidinium N → side-chain/backbone O	Ser190
Catalytic-triad residues seen near pocket	Ser195 / His57 / Asp102	n/a (note positions)
Typical salt-bridge reference range	—	2.7–3.2

Questions

The S1 pocket floor carries Asp189. Explain in terms of charge why this makes trypsin cleave specifically after Arg and Lys, and not after a negatively-charged residue like Glu.
From your worksheet, what distance did you measure between the benzamidine amidinium N and the Asp189 oxygen? Is this consistent with a hydrogen-bonded salt bridge rather than a covalent bond, and how does it compare to your Gly219/Ser190 contacts?
Benzamidine is described as a competitive inhibitor. Using what the pocket view showed, explain how a small molecule that is not a peptide can still block trypsin.
The catalytic triad, the S1 pocket and the oxyanion hole are physically separate features. What distinct job does each one perform, and why does an enzyme need all three?
The amidinium group makes more than one polar contact to the pocket (you recorded several). Why does a network of contacts — rather than a single bond — give such high binding specificity and affinity?
If you mutated Asp189 to lysine, predict how the enzyme’s substrate preference would change.

Worked answers & discussion

Q1. Asp189 is negatively charged, so it electrostatically attracts and salt-bridges a positively-charged side chain. Arg (guanidinium) and Lys (ammonium) are long and cationic, reaching the floor to neutralise that charge favourably. A Glu side chain is anionic — it would be repelled by Asp189, so it never binds productively, and trypsin will not cleave there.
Q2. Typically ~2.8–3.0 Å for the amidinium N → Asp189 O salt bridge. That separation is the hallmark of a polar salt bridge / hydrogen bond (heavy-atom donor–acceptor distances of roughly 2.7–3.2 Å, the reference row in your table), far longer than a ~1.5 Å covalent bond. The Gly219 backbone and Ser190 contacts should fall in the same 2.8–3.2 Å band — all non-covalent, which is why the inhibitor is reversible.
Q3. The pocket view shows benzamidine occupying the same S1 slot a substrate’s Arg/Lys would use. Its amidinium group is a flat, rigid mimic of guanidinium, so it satisfies the same Asp189 salt bridge and shape complementarity. By plugging S1, it physically excludes substrate — classic competitive inhibition — even though it carries no scissile peptide bond.
Q4. The S1 pocket handles recognition: it selects which side chain (Arg/Lys) docks and thereby fixes the cleavage position. The catalytic triad (Ser195/His57/Asp102) handles chemistry: it performs the bond hydrolysis. The oxyanion hole (Gly193/Ser195 backbone N−H) handles transition-state stabilisation: it cradles the developing oxyanion of the tetrahedral intermediate, providing most of the rate enhancement. Separating recognition, catalysis and intermediate-stabilisation lets the same chemistry act on many substrates while specificity is tuned independently by reshaping the pocket.
Q5. Each contact is individually weak and reversible, but together they impose strict geometric and chemical demands: the ligand must present the right charge at the right distance and angle to all partners simultaneously. A wrong side chain might satisfy one contact but clash or miss on the others, so the combined network multiplies discrimination and sums to high affinity — the basis of molecular recognition and of rational inhibitor design.
Q6. Swapping the floor charge from negative (Asp) to positive (Lys) would repel Arg/Lys substrates and instead favour negatively-charged side chains (Asp/Glu). This is essentially the engineered “trypsin→Glu-specific” redesign idea — specificity follows the pocket charge.
Take‑home. An enzyme’s specificity and its chemistry live in separate, neighbouring sites: a shape-and-charge pocket that decides where to act, a catalytic triad that does the cutting, and an oxyanion hole that stabilises the fleeting intermediate. Benzamidine wins by faking the right side chain and satisfying the pocket’s whole contact network — the same trick rational drug design exploits.

Lab 27Structure-based drug design▶

How do you design a drug that has never existed — by staring at the very enzyme you need to stop?

Why it matters. When the HIV-1 protease structure was solved, chemists could finally see the target and sculpt molecules to fit it. The protease inhibitors that followed turned AIDS from a death sentence into a manageable condition, saving millions of lives. Indinavir, the drug you’ll examine, is a landmark of structure-based drug design — medicine engineered atom by atom from a crystal structure.

drug designHIV proteaseinhibitor

You’ll learn to: recognise a symmetric enzyme dimer, see how an inhibitor grips a catalytic cleft, measure the bridge to the catalytic dyad, and assess a drug’s drug-likeness against Lipinski’s rule with a quantitative worksheet.

Background 1 — A symmetric aspartic protease

HIV-1 protease is the molecular scissors the virus uses to chop its long polyprotein precursors into functional proteins — an essential maturation step. It is an aspartic protease: catalysis is performed by two Asp25 residues that activate a water molecule to hydrolyse the peptide bond. Crucially, the enzyme is a homodimer — two identical chains related by C2 symmetry, each contributing one Asp25 to a single shared active site at the dimer interface. Block that one cleft and the virus cannot mature. That single, druggable site made the protease a prime target.

Background 2 — Structure-based drug design

Structure-based drug design (SBDD) means using a target’s 3D structure to guide the shape, charge and hydrogen-bonding of a candidate molecule. Indinavir (ligand MK1) is a peptidomimetic: it imitates the transition state of the natural substrate but resists being cut, so it lodges permanently in the cleft. A central hydroxyl reaches between the two catalytic aspartates while bulky arms fill the symmetric subsites, gripping the enzyme far more tightly than substrate. Because indinavir was tailored to a real structure, it became a clinical antiretroviral — though, as you’ll see, it sits at the edge of conventional drug-likeness.

Background 3 — The flaps, the bridging water, and resistance

Two glycine-rich β-hairpin flaps fold down over the cleft and clamp the inhibitor in place, sealing it from solvent. In substrate complexes a single structural water bridges the inhibitor’s carbonyls to the flap Ile50 backbone amides — a tetrahedrally-coordinated water unique to retroviral proteases that later inhibitors were designed to displace. Because the virus mutates rapidly, residues lining the pocket (e.g. near positions 82, 84, 90) drift to weaken inhibitor contacts, producing drug resistance. SBDD therefore aims for contacts to conserved main-chain atoms and the catalytic Asp25 dyad, which the virus cannot easily mutate without crippling its own enzyme — a durable grip on an unchangeable feature.

Structures — open in the workbench ↗

1HSG HIV protease + indinavir

Procedure

Open 1HSG in the workbench. Read Structure (i): method, resolution, and the residue/chain counts. Confirm there are two protein chains plus the ligand MK1 (indinavir) and waters.
In Representation (1) choose Show as → cartoon and Color by → chain. The two chains are drawn in different colours — this is the homodimer. In Chains (2), click isolate on each chain in turn to see they are identical halves, then click Show all.
In Ligands & binding sites (3), click focus on MK1 to zoom to indinavir buried at the centre, then click pocket to reveal the lining residues as labeled sticks. Note how residues from both chains contribute — the drug straddles the dimer interface. Look for the Ile50 flap residues capping the cleft.
Click contacts on MK1. Observe the dashed polar contacts and distances showing how indinavir hydrogen-bonds to the cleft, including near the catalytic Asp25 pair. Use the Sequence (5) panel to click residue 25 in each chain and watch both catalytic aspartates highlight.
Turn on 📏 measure in Display & selection (4) and measure from indinavir’s central hydroxyl O to a nearby Asp25 carboxylate O to confirm it bridges the catalytic dyad (record this; expect ~2.7–3.2 Å). Toggle 💧 water to look for an ordered flap-bridging water near Ile50.
Click ⚛ Studio on MK1 to send indinavir into the molecule Studio (its SMILES loads). Read off MW, cLogP, H-bond donors, H-bond acceptors and TPSA, plus the Lipinski drug-likeness summary. Enter each into the worksheet below and mark Pass/Fail. Back in the workbench, Save view (6).

Property	Indinavir (from ⋮ Studio)	Rule limit	Pass?
Molecular weight (g/mol)		< 500
cLogP		< 5
H-bond donors		≤ 5
H-bond acceptors		≤ 10
TPSA		≤ ~140 Å²
Hydroxyl O → Asp25 O distance (Å)		2.7–3.2 (H-bond)

Questions

HIV protease is a homodimer with one active site at the interface. Why is dimerisation essential for catalysis, and why does that make dimer-disruption an alternative drug strategy?
From the contacts view and your measured hydroxyl–Asp25 distance, describe how indinavir physically blocks catalysis. Why does the enzyme fail to cut a molecule that fills its own active site?
Using your completed Lipinski worksheet, evaluate indinavir against the rule of five (MW < 500, cLogP < 5, donors ≤ 5, acceptors ≤ 10, TPSA ≤ ~140 Å²). Which row(s) fail, and by roughly how much does MW exceed the limit?
Indinavir is a large peptidomimetic that bends Lipinski’s guidelines yet still works as an oral drug. What does that tell you about treating drug-likeness rules as guides rather than laws?
Background 3 noted the rapidly-mutating virus. Why is targeting contacts to the conserved Asp25 dyad and main-chain atoms a smarter long-term strategy than gripping variable side chains?
How did having the 1HSG structure in hand speed the design of an inhibitor compared with blind screening?

Worked answers & discussion

Q1. Each chain donates only one Asp25; a complete active site needs both, so a single chain is catalytically dead. The functional enzyme exists only as the dimer. Because the interface itself is required, drugs that prevent the two halves from coming together (dimerisation inhibitors) can shut the enzyme down without touching the catalytic site — a genuine second strategy.
Q2. The contacts view shows indinavir hydrogen-bonded across the cleft, with its central hydroxyl reaching the Asp25 dyad (you should measure ~2.7–3.2 Å — a true H-bond) and bulky groups packing the symmetric subsites under the closed flaps. It occupies the substrate’s space but, as a peptidomimetic transition-state mimic, has a non-hydrolysable hydroxyethylene unit in place of the scissile peptide bond — so it jams the site as a tight competitive inhibitor instead of being processed.
Q3. Indinavir has MW ~614, which exceeds the 500 limit by roughly 110–115 g/mol — the clear, decisive violation. Its hydrogen-bond donor/acceptor counts sit near or at the upper bounds and TPSA is high, so it is a textbook borderline / one-clear-violation case; cLogP is moderate and typically passes. Your Pass? column should read fail on MW and pass (or borderline) on the rest.
Q4. Lipinski’s rule predicts oral absorption tendencies; it is statistical, not absolute. Indinavir’s tight, structure-guided fit and formulation overcome its size, proving that a molecule optimised against a real target can succeed despite breaking the “rules.” The rules flag risk, they don’t forbid a compound — many peptidomimetic and natural-product drugs live outside rule-of-five space.
Q5. Side chains lining the pocket can mutate (e.g. near 82/84/90) to weaken inhibitor contacts while the virus still functions, breeding resistance. The catalytic Asp25 dyad and backbone amides are conserved because the enzyme cannot mutate them without destroying its own catalysis. Anchoring the drug to those invariant features makes escape mutations costly to the virus, so the inhibitor stays effective longer.
Q6. With the structure, chemists could see the exact shape, the C2-symmetric subsites, the flaps and the catalytic aspartates, then design a molecule complementary to that cleft — placing groups precisely where favourable contacts exist and even displacing the bridging water. That rational, hypothesis-driven design dramatically narrows the chemical search versus screening millions of random compounds blindly.
Take‑home. Seeing the target changes everything. The 1HSG structure let chemists sculpt indinavir to grip a symmetric catalytic cleft at its conserved Asp25 dyad, and the molecule Studio worksheet shows the trade-offs — a borderline-Lipinski drug (MW ~614) that still saved millions of lives. That is the power, and the pragmatism, of structure-based drug design.

Lab 28AlphaFold & the protein-folding problem▶

A protein’s amino-acid sequence is just a string of letters — so how does a computer fold it into a working 3D machine, and how much should you trust the answer?

Why it matters. For 50 years, predicting a protein’s shape from its sequence was biology’s great unsolved problem. AlphaFold cracked it in 2020, and now millions of predicted structures guide drug design, enzyme engineering and disease research — but only if you can read where the model is confident and where it is guessing.

structural predictiondeep learningconfidence

You’ll learn to: interpret a pLDDT confidence map from the B-factor column, locate high- and low-confidence regions, compare a predicted fold to an experimental structure, and judge what an AlphaFold model can and cannot tell you.

Background 1 — Anfinsen and the folding problem

In the 1960s Christian Anfinsen showed that a denatured enzyme refolds spontaneously into its active shape — proving that the amino-acid sequence alone encodes the native 3D structure. This is Anfinsen’s dogma. Yet knowing that the answer is hidden in the sequence did not mean we could compute it. A chain can adopt astronomically many conformations (Levinthal’s paradox), and physics-based simulation was far too slow. For five decades, the protein-folding problem — predicting structure from sequence — remained one of biology’s hardest open challenges, tested every two years at the blind CASP competition.

Background 2 — AlphaFold, CASP14 and pLDDT

At CASP14 in 2020, DeepMind’s AlphaFold achieved near-experimental accuracy, effectively solving the problem. Its deep-learning network reads patterns of co-evolution across related sequences to infer which residues sit close in space. Crucially, each model carries a per-residue confidence score, pLDDT (0–100), stored in the B-factor column. In MoleBench, Color by → B-factor therefore paints confidence: blue ≈ high (>90) in well-folded cores, red/orange ≈ low in flexible loops, chain termini and intrinsically disordered regions. Low pLDDT often flags genuine floppiness, not just model error.

Background 3 — Reading the pLDDT scale and its limits

The pLDDT scale has four agreed bands: >90 very high (backbone and most side chains reliable), 70–90 confident (backbone reliable), 50–70 low (treat with caution) and <50 very low (often a sign of intrinsic disorder, not just error). Because pLDDT lives in the B-factor slot, a blue→red ramp reads as high→low confidence on an AlphaFold model — the opposite meaning of B-factor on an X-ray structure, where red marks mobile atoms. Remember the score is the network’s self-assessment of local accuracy; it does not certify the relative orientation of distant domains, which needs the separate PAE map.

Structures — open in the workbench ↗

P69905 AlphaFold · Hb α 4HHB experimental Hb

Procedure

Open the AlphaFold model P69905 (human haemoglobin α-subunit, a prediction). In the Structure (i) panel, note the method reads as a computed/predicted model, with one chain.
Set Representation (1) → Show as → cartoon so you can see the fold of α-helices.
In Representation (1) → Color by, choose B-factor. Because this is an AlphaFold model, the colours now read pLDDT confidence: blue = confident, red/orange = uncertain.
Use ⟲ fit to frame the whole chain. Look at the helix cores versus the N- and C-termini and tight loops — spin with ◌ to inspect them.
Open the Sequence (5) panel. Click a residue near a terminus, then one in the middle of a long helix, and watch the viewer zoom to each — relate its colour to its sequence position.
For each region in the worksheet, record the dominant colour (blue/green/orange/red) you see under Color by → B-factor, then translate it into a confidence band (high/med/low) using Background 3.
Now open the experimental 4HHB (X-ray hemoglobin). In Chains (2), isolate a single α-chain (A or C) so you compare like with like.
Set 4HHB to cartoon and Color by → structure, then visually compare its helix layout to the P69905 prediction and complete the final worksheet row.

Region	pLDDT colour (blue/green/orange/red)	Confidence (high/med/low)
Helix core (mid-chain)
Surface loops
N-terminus
C-terminus
Predicted fold vs experimental 4HHB α-chain — match? (Y/N)

Questions

What does a pLDDT value of 95 versus 45 tell you about a residue? Which colour is which under Color by → B-factor, and which confidence band does each fall in?
From your worksheet, which region scored the lowest confidence? Give two physical reasons why those regions score poorly.
Why does a red region mean the opposite thing on this AlphaFold model than it would on the experimental 4HHB structure coloured by B-factor?
How does the single predicted α-chain compare to one isolated α-subunit of experimental 4HHB — same overall fold, or different? Cite your final worksheet row.
Why is fast, accurate structure prediction so valuable for biology and medicine? Give a concrete example.
Name two things an AlphaFold model does not directly give you that the experimental 4HHB structure does.

Worked answers & discussion

Q1. pLDDT is a per-residue confidence on a 0–100 scale held in the B-factor column. A value of ~95 means the model is highly confident the local backbone (and most side chains) is correct — coloured blue, the very high (>90) band; ~45 means very low confidence — coloured red/orange, the very low (<50) band. It is the model’s self-assessment, not an experimental error bar.
Q2. The N- and C-termini and surface loops score lowest (orange/red on the worksheet). These regions are genuinely flexible/disordered (few stabilising contacts, so no single “right” conformation), and they carry fewer co-evolutionary constraints from related sequences for the network to exploit — so the prediction is both harder and physically less defined.
Q3. The colours share a column but mean different things. On an AlphaFold model the B-factor slot holds pLDDT confidence, so red = low confidence. On an X-ray structure like 4HHB the same column holds the true crystallographic B-factor (atomic mobility), so red = high thermal motion. Same ramp, inverted meaning — always check whether you are looking at a predicted or experimental model.
Q4. The folds match closely (worksheet final row = Y): both show the classic globin fold — a bundle of α-helices (no β-sheet) cradling the heme site. This is the point: AlphaFold reproduces the experimentally determined subunit fold to near-atomic accuracy from sequence alone.
Q5. Structures reveal mechanism and druggable pockets. With predictions for nearly every human protein, researchers can model the effect of disease mutations, target previously “structure-less” proteins, and design or repurpose drugs — e.g. understanding how a sickle-cell or thalassemia mutation perturbs the globin fold, or screening inhibitors against an enzyme that was never crystallised.
Q6. AlphaFold gives a single static predicted conformation. It does not directly give you the quaternary assembly (4HHB is the α₂β₂ tetramer), the bound heme/ligands and ions, the dynamics (the T↔R allosteric motion of hemoglobin), or experimental resolution/electron density. Acceptable answers: ligands, oligomeric state, dynamics, solvent/waters.
Take‑home. AlphaFold turned a 50-year grand challenge into a one-click prediction — but a structure is only useful if you read its pLDDT confidence (blue high → red low) and remember it models the static fold of a chain, not the living, ligand-bound, multi-subunit machine.

Lab 29The DNA double helix▶

Two strands, four letters, one of the most famous shapes in science — what holds the DNA double helix together, and why does its sequence change how easily it comes apart?

Why it matters. Every time a cell divides, reads a gene, or a PCR machine copies DNA, the double helix must zip and unzip. Base pairing and groove geometry decide how stable the helix is and how proteins find their target sequences — the molecular basis of heredity, gene regulation and modern biotech.

nucleic acidsbase pairingB-form DNA

You’ll learn to: identify the two antiparallel strands and the sugar-phosphate backbone, measure Watson–Crick hydrogen-bond distances, distinguish the major and minor grooves, and reason about how GC content sets DNA stability.

Background 1 — Watson–Crick base pairing

DNA stores information as a sequence of four bases: adenine (A), thymine (T), guanine (G) and cytosine (C). They pair with strict complementarity by hydrogen bonds: A–T forms 2 H-bonds and G–C forms 3 H-bonds. Each pair links a purine to a pyrimidine, giving every rung the same width so the helix stays uniform. Because G–C has an extra hydrogen bond (and stronger base-stacking), GC-rich DNA is more thermally stable than AT-rich DNA. This complementary pairing is what lets each strand serve as a template to rebuild the other during replication.

Background 2 — The antiparallel B-form helix

The two strands are antiparallel: one runs 5′→3′ while its partner runs 3′→5′ alongside it. The outside is a sugar-phosphate backbone (deoxyribose linked by phosphates), and the bases point inward. The Dickerson dodecamer (1BNA) is the textbook B-form: a right-handed helix with about 10 base pairs per turn, a rise of ~3.4 Å per pair and a twist near 34°. Because the two backbones are not diametrically opposite, the helix surface has a wide major groove and a narrow minor groove — the channels DNA-binding proteins reach into to read the sequence.

Background 3 — Geometry, grooves and melting

The regular geometry is what makes DNA both stable and readable. With ~10 bp per turn and a 34° twist between adjacent rungs, the bases stack like coins, and overlapping π-systems supply the base-stacking energy that — together with H-bonds — holds the duplex shut. The 3.4 Å rise sets the helix pitch at ~34 Å per turn. Heating breaks H-bonds and unstacks the bases; the midpoint of this denaturation is the melting temperature (T_m). Because G–C rungs carry an extra bond and stack better, GC content raises T_m — the rule behind PCR primer design and probe selection.

Structures — open in the workbench ↗

1BNA Dickerson B-DNA

Procedure

Open 1BNA (the Dickerson dodecamer). In the Structure (i) panel, note the resolution and that it contains 2 chains — in Chains (2) you should see both strands listed.
Set Representation (1) → Show as → sticks so every atom of the backbone and bases is visible.
Choose Color by → chain to give each strand its own colour — trace one backbone and confirm the two strands run in opposite (antiparallel) directions.
Switch Color by → element to pick out the N (blue), O (red) and backbone P (phosphorus) atoms; the H-bonds of a base pair run between N and O atoms.
Open Sequence (5), click a G on one strand to zoom to it, and locate its paired C on the opposite strand; then find an A–T pair the same way.
Use 📏 measure (click 2 atoms → distance): on the A–T pair, click an N or O donor on one base, then its partner N/O across the rung — a Watson–Crick H-bond reads ~2.8–3.0 Å. Record it in the worksheet, then repeat on the G–C pair.
With 📏 measure, click an atom on one base pair and the equivalent atom on the next pair up the helix to estimate the rise (~3.4 Å); record it.
Count the rungs through one full turn of the helix to confirm ~10 base pairs per turn, and enter it.
Use ◌ spin (and ⟲ fit) to rotate the helix and identify the wide major groove and the narrow minor groove. Save the best angle in Saved views (6).

Base pair	# H-bonds	Measured donor–acceptor distance (Å)
A–T pair	2
G–C pair	3
Helical rise per base pair	—
Base pairs per turn	—

Questions

Why is a G–C pair more stable than an A–T pair? Refer to the H-bond counts pre-filled in your worksheet.
What does antiparallel mean, and how did Color by → chain help you confirm it?
What donor–acceptor distance did you record for the A–T and G–C N–N / N–O contacts, and what does that distance represent physically (versus a covalent bond)?
The rise and base-pairs-per-turn you measured (~3.4 Å and ~10) identify which DNA form, and what helical pitch (Å per turn) do they imply?
How can you tell the major groove from the minor groove, and why do many DNA-binding proteins prefer the major groove?
Predict: two DNA samples of equal length, one ~70% GC and one ~30% GC. Which has the higher melting temperature (T_m), and why?

Worked answers & discussion

Q1. G–C is held by 3 hydrogen bonds versus only 2 for A–T (the pre-filled worksheet counts), and G–C pairs also stack more favourably. More bonds plus stronger stacking means more energy is needed to separate the strands, so G–C-rich regions are more stable.
Q2. Antiparallel means the two strands run in opposite 5′→3′ directions — one chain’s 5′ end sits beside the other’s 3′ end. Colouring by chain gives each strand a distinct colour, so tracing one backbone from end to end shows it heading the opposite way to its partner.
Q3. Both should read roughly 2.8–3.0 Å. That is the donor–acceptor distance of a Watson–Crick hydrogen bond between an N–H and an N or O on the partner base — close enough to bond but not a covalent contact (a covalent N–O bond would be ~1.4 Å). Each rung is the same width because purine+pyrimidine always spans the same distance.
Q4. A ~3.4 Å rise with ~10 bp per turn is the signature of right-handed B-form DNA. Multiplying rise by bp/turn gives a helical pitch of ~34 Å per turn (with a ~34° twist between adjacent pairs).
Q5. The major groove is wide and shallow; the minor groove is narrow and deep. Because the backbones are offset rather than diametrically opposite, the two grooves differ in width. Proteins favour the major groove because it exposes more of each base’s edge, giving a richer, sequence-specific pattern of H-bond donors/acceptors to read — how transcription factors recognise their target sites.
Q6. The 70% GC sample has the higher T_m. More G–C pairs means more 3-bond rungs and stronger stacking, so more heat is required to denature the duplex. This is exactly why GC content is used to design PCR primers and predict where DNA melts — AT-rich stretches open first.
Take‑home. The double helix is a uniform, antiparallel ladder whose stability is tuned base by base: 2 bonds for A–T, 3 for G–C. Reading those rungs at ~3 Å, the ~3.4 Å rise and the offset grooves explains both how DNA is copied faithfully and how proteins find the right gene.

Open the Studio Back to tutorials

Labs

⬡ General labs

★ Advanced labs

🧬 Structural biology labs