Hogg's Research

number of exposures per visit?

2026-02-15T09:13:00.002-05:00

I wrote code this weekend to look at the question of how we should visit a star in the upcoming Terra Hunting Experiment. The current (straw-person) plan is that we will observe each visible star once per night for ten years, with one exposure of a sensibly-chosen exposure time at each visit. Is this a good idea? I was interested in this problem for two reasons. The first is that binning is sinning, with the corollary that bigger bins are worse than finer bins, and a single, long exposure is a very big bin. The second reason is that when there are non-trivial noise sources (like the quasi-periodic variations from p-mode oscillations of the surfaces of Sun-like stars), a few negatively- (or interestingly-) correlated noise draws can be combined in ways that are substantially more informative than by taking the average.

Of course, if you split an exposure (with a standard CCD, say) into sub-exposures, you take on real costs: There is a read time, which is time you aren't integrating, and there is a read noise, that affects each new exposure. So the best strategies are a complicated function of read time, read noise, and the signal-to-noise at which the stellar p-mode oscillations are visible in any realistic data. Related: There are amazingly different and interesting strategies with up-the-ramp detectors that are used in the infrared.

One final comment is that the objective, in my strongly held view, is to optimize the amount of information (about, say, the center-of-mass radial-velocity changes of the target star) per unit wall-clock time. We are paying for wall-clock time; let's get as much as we can out of it.

The LLMs, and why do we do astrophysics?

2026-02-12T19:06:00.004-05:00

Today my rant on LLMs and the practices of our field hit the arXiv. I was scared to post it, because it is such a weird contribution, and it is so revealing about myself and my own political positions and hangups. But I have to say: I got great and supportive feedback all day.

I got two comments on saying ACAB in the literature. The Astronomer Royal of Scotland quoted (on BlueSky) the last sentence, which I put there because Andy Casey (Monash, Flatiron) insisted. Many people sent me appreciation and thank-yous, and many people sent me comments and objections. Always constructive. The whole experience made me feel very happy about the state of our field and the way we all interact. I think maybe there will be critical mass to write some kind of collection of essays on the subject. That's a plan for 2026.

substellar objects (brown dwarfs)

2025-11-21T15:09:00.007-05:00

I spent the day at the NSBP / NSHP meeting in San José. My favorite session of the day was the morning astro session, which was entirely about brown dwarfs. I learned a lot in a very short time. Caprice Phillips (UCSC) introduced the session with an introduction to the scientific and technical questions in play. She put a lot of emphasis on using binaries and clusters to put detailed abundance ratios onto substellar objects. This was what I expected: I thought (walking in to this session) that all known abundance ratios for brown dwarfs were from such kinds of studies. I learned different (keep reading).

Gabriel Munoz Zarazua (SFSU) followed by showing spectra from M-dwarfs, brown dwarfs, and Jupiter. It definitely looks like a sequence. He does spectral fitting (what they call, in this business, retrievals). It looks like he is getting very good, somewhat precise, abundance ratios for the photospheres of substellar objects! I asked more about this in the question period, and apparently I am way behind the times (Emily Rauscher, Michigan, helpfully pointed this out to me): Now brown-dwarf photosphere models are so good, they can be used to measure abundances, and pretty well.

I also learned in this session (maybe from Jorge Sanchez, ASU, or maybe from Efrain Alvarado, SFSU) that there is a very strong mass–abundance relation in the Solar System. That is, we don't expect, if brown dwarfs form the way planets do, that the detailed abundances of the brown dwarfs will match exactly the detailed abundances of the primary stars. But now we are really in a position to test that. Sanchez showed that we can get, from even photometry, abundances for substellar objects in the Milky Way halo. Again, totally new to me! And he finds metallicities at or below −3. Alvarado showed data on an amazing system J1416, which is an L–T binary with no stellar companion. Apparently it is the only known completely substellar binary.

integrating out nuisances

2025-07-28T15:04:00.001-04:00

Further insipired by yesterday's post about binary fitting, I worked today on the treatment of nuisance parameters that have known distributions. These can be treated as noise sometimes. Let me explain:

If I had to cartoon inference (or measurement) in the face of nuisance parameters, I would say that frequentists profile (optimize) over the nuisances and Bayesians marginalize (integrate) over the nuisances. In general frequentists cannot integrate over anything, because there is no measure in any of the parameter spaces. But sometimes there is a measure. In particular, when there is a compact symmetry:

We know (or very strongly believe) that all possible orientations of a binary-star orbit are equally likely. In this model (or under this normal assumption) we have a distribution over two angles (theta and phi for that orbit pole, say); it is the distribution set by the compact group SO(2). Thus we can treat the orientation as a noise source with known distribution and integrate over it, just like we would any other noise source. So, in this case (and many cases like it) we can integrate (marginalize) even as frequentists. That is, there are frequentism-safe marginalizations possible in binary-star orbit fitting. This should drop the 12-parameter fits (for ESA Gaia data) down to 8-parameter, if I have done my math right.

binary stars with periods of exactly one year

2025-07-27T10:58:00.011-04:00

On Friday, Kareem El-Badry (Caltech) gave a seminar about looking for (and finding!) stars in binary orbits around dark or much darker companions, like black holes, neutron stars, and white dwarfs. He showed results that involve ESA Gaia astrometry, where he noted that the Gaia Mission has no sensitivity to periods right at (or within an inverse mission-length frequency difference of) one-year periods (inverse year frequencies). After the talk I objected that these are not exactly degenerate; El-Badry said that the inferences blow up there.

I spent some time on the weekend thinking about this point, and I now understand it: There is a particular one-year orbit that a star can have (around a darker companion) such that the photocenter of the system makes a motion that is identical to the apparent parallax motion. Thus there is an exact degeneracy between the parallax and a certain one-year orbit.

Does that mean that we can't measure orbits at one year (or, for that matter, parallaxes)? No, it does not. After all, the parallax ellipse has a particular celestial (angular) shape and phase. But it might require some kind of reparameterization of orbits near one-year periods. I think I know how to do that. Should we find the missing binaries? (Oh and by the way, this degeneracy means that, in a strict frequentist sense, Gaia can't measure parallaxes at all without additional information.)

how significant is your anomaly?

2025-07-24T15:50:00.022-04:00

So imagine that you have a unique data set Y, and in that data set Y you measure a bunch of parameters θ by a bunch of different methods. Then you find, in your favorite analysis, your estimate of one particular parameter is way out of line: All of physics must be wrong! How do you figure out the significance of your result?

If you only ever have data Y, you can't answer this question very satisfactorily: You searched Y for an anomaly, and now you want to test the significance. That's why so many a posteriori anomaly results end up going away: That search probably tested way more hypotheses than you think it did, so any significances should be reduced accordingly.

The best approach is to use only part of your data (somehow) to search, and then use a found anomaly to propose a hypothesis test, and then test that test in the held-out or new data. But that often isn't possible, or it is already too late. But if you can do this, then there is usually a likelihood ratio that is decisive about the significance of the anomaly!

I discussed all these issues today with Kate Storey-Fisher (Stanford) and Abby Williams (Chicago) today, as we are trying to finish a paper on the anomalous amplitude of the kinematic dipole in quasar samples.

finding emission lines (and other oddities) in hot stars

2025-07-23T09:00:00.001-04:00

I showed my robust spectral decomposition (dimensionality reduction) and residuals to the MPIA Binaries group today. There was much useful feedback (including that my H-gamma was actually H-delta; embarassing!). One comment was that the model isn't truly a causal separation between star and lines, so there will be some mean lines in the star model; lines aren't entirely outliers. That's true! The group suggested that I iterate to remove stars with lines from the training set.

After the meeting, I implemented some of that, but problems like this have a pathology: If you carefully remove stars with high residuals at some wavelength, then the training data will be deficient, or low, at that wavelength. And then the model will go lower, and then more stars will have excess at that wavelength and: Disaster. So when I implemented, I required a 2-sigma deviation, and I removed both high and low outliers. I don't know if this will work, but I am testing now.

wrote like the wind; frequentist vs Bayes on sparsity

2025-07-21T14:54:00.009-04:00

My goal this year in Heidelberg is to move forward all writing projects. I didn't really want to start new projects, but of course I can't help myself, hence the previous post. But today I crushed the writing: I wrote four pages in the book that Rix (MPIA) wants me to write, and I got more than halfway done with a Templeton Foundation pre-proposal that I'm thinking about, and I partially wrote up the method of the robust dimensionality reduction that I was working on over the weekend. So it was a good day.

That said, I don't think that the iteratively reweighted least squares implementation that I am using in my dimensionality reduction has a good probabilistic interpretation. That is, it can't be described in terms of a likelihood function. This is related to the fact that frequentist methods that enforce sparsity (like L1 regularization) don't look anything like Bayesian methods that encourage sparsity (like massed priors). I don't know how to present these issues in any paper I try to write.

young stars in SDSS-V

2025-07-19T16:53:00.019-04:00

Over the last two weeks, I built a new robust dimensionality-reduction method called Robust-HMF. This method is a hammer, looking for a nail. This week, Hans-Walter Rix (MPIA) suggested that I use the method on young stellar objects observed in SDSS-V BOSS spectra. I did that, and I found hundreds of young stars with narrow H-alpha emission lines. It turns out that the Robust-HMF method does really well on these data; it can fit all the absorption lines in the stars, plus all the wacky continuum shapes generated by a combination of instrumental effects and dust attenuation. That is, I can run the method on more-or-less raw data, provided that it has been shifted to rest frame (and the raw SDSS-V pipelines do that pretty well on stars like these).

Anyway, I don't know what to say about my results quantitatively yet, but I find hundreds of emission-line stars, and my equivalent-width sensitivity is excellent. Who wants this sample?

SPHEREx data

2025-07-18T09:00:00.001-04:00

Dustin Lang (Perimeter) and I spoke today about many things, but the conversation got de-railed when Lang showed me his visualizations of the brand-new NASA SPHEREx data. Oh. My. Goodness. First of all, the data are being released daily, before the team has done its analysis, so that anyone in the world can do anything with it. Talk about open! Talk about international! Talk about everything I believe about astrophysics! Also, the data are well documented, high in signal to noise, and released with good and useful metadata. This is killer. Of course Lang has already made a viewer for it and can do all comparisons to the DESI data. Get in touch with him if you want tools.

should I write a book?

2025-07-15T05:29:00.001-04:00

Is it research to have a set of conversations about whether to write a book on data analysis? Hans-Walter Rix (MPIA) thinks I should put together my arXiv-only submissions plus a lot more into a book about data analysis. His point of view is that the most important thing is how to convert an ill-posed question about the Universe into a well-posed operation on data.

robust matrix factorization

2025-07-13T14:18:00.001-04:00

There is a very nice algorithm and set of methods called “Robust PCA,” originating in a paper by Candès. This method makes use of ideas from convex optimization to simultaneously learn a low-rank representation of the data plus a sparse representation for the outliers. This kind of situation comes up in astronomy all the time.

Way back, Tsalmantza and I made a replacement for PCA called HMF that deals with heteroskedastic data (variable error bars or variable data weights) and also missing data; we used it to build low-rank models of quasar spectra. This weekend I built a robust version of HMF (Robust HMF maybe?) that uses ideas from iteratively reweighted least squares to mimic the algorithm behind Robust PCA. It works! And it works well. Unfortunately, right now the investigator has to tune the rank of the low-rank part and also the soft outlier cutoff used in the IRLS. I would love to figure out principled ways to choose both. If you want to follow along, development is happening here for now.

is it surprising that there are high-redshift supermassive black holes?

2025-07-11T09:43:00.001-04:00

A nice talk at MPIA by Hanna Ũbler (MPE) about very high-redshift (redshifts 8 to 14 even) galaxies and their black-hole contents started some nice discussions in the audience and afterwards about the formation of black holes. Because of the Eddington limit (which is a limit on luminosity), black-hole growth is probably limited. The limit is on luminosity, not mass accretion rate, and the relationship between these is the radiative efficiency. As the efficiency goes down, the mass accretion rate of an Eddington accretor goes up. So the question: When can you first form a super-massive (10 million solar masses, say) black hole is a question simultaneously about seed black holes and about radiative efficiency. Anyway, this is all unfortunate, because if the efficiency couldn't get very low, then the black holes we find with NASA JWST would already be putting very strong pressure on fundamental physics in the early Universe.

coherent oscillator injection and recovery

2025-07-10T12:36:00.002-04:00

Coherent oscillators—astronomical sources that pulse or oscillate in a phase-stable way over long timescales—have been useful astrophysical tools. For two examples: Stably pulsing pulsars were used to discover gravitational radiation (and are being used to find the stochastic background). Delta-scuti star asteroseismic modes were used to find orbital companions. This led undergrad Nana Miller (NYU) and me and others to look for all the coherent modes we can find among all the stars in the NASA Kepler Mission data. We have have technology to find modes, and we have technology to test for coherence. Now we have to do injection–recovery tests to estimate our detection limits. Today I delivered a simple plan for doing injections.

Our plan is to produce a catalog, not of stars but of modes, every one of which has a coherence time that is longer than the lifetime (4 years) of the Kepler Mission. Then: What do we use them for?

likelihood ratios not posteriors, please

2025-07-09T13:22:00.002-04:00

There is an informal meeting at MPIA every Wednesday regarding binary stars, with a bit of a focus on massive binaries. Today there was a very nice presentation by Jakob Stegmann (MPA) about some anomalies among the (only six) black-hole–neutron-star binaries discovered by NSF LIGO. He showed the example of GW 200105, which shows a large eccentricity (0.15-ish). This eccentricity is very hard to explain, given how the inspirals evolve as they radiate. But the analysis of the eccentricity (from perhaps this paper) is Bayesian, so it isn't clear how much the eccentricity result is forced by the data and how much is forced by the prior over nuisance parameters. That's one of the main points of my forthcoming paper on measurement. I think maybe I should just re-analyze this one with a profile likelihood. I hope the data and code are public!

robust dimensionality reductions

2025-07-08T10:12:00.001-04:00

Dimensionality reduction (the basic being PCA) is very sensitive to outliers: A single bad pixel can dominate most objectives and thus create a spurious dimension. One of the best and most classic solutions to this is the robust PCA method, which is presented in a (very long) paper with impressive math and beautiful results. Yesterday Hans-Walter Rix (MPIA) and I coded it up and applied it to ESA Gaia RVS spectra, with extensive (and impressive) help from Claude. It looks very promising, especially in capturing oddities in hot stars. Today I worked out that there should be something similar that takes into account data weights (inverses of squared uncertainties), and I wrote down the algorithm (on paper). We'll see.

stellar twins vs synthetic stellar twins

2025-07-07T10:03:00.001-04:00

In the Milky Way meeting at MPIA today, a bit of a discussion broke out about using stellar twins, inspired by work by Yuan-Sen Ting (OSU). The idea is: If you have two stars with very similar overall metallicity, and very similar temperature and surface gravity, then it should be possible to measure accurate element abundnace anomalies between the two stars, even in the absence of an extremely accurate spectral synthesis code.

My view, which does not contradict this point, is that an even better way to use this stellar-twin idea is to synthesize a twin for every star, using stars that are similar in (either) parameters or else spectral space. After all, an interpolation to your target star should more accurately represent it than even the most similar individual comparison star. That idea, fundamentally, is the main idea behind The Cannon.

how did the Solar System form?

2025-07-04T09:49:00.007-04:00

I saw a very nice talk today by Philippine Griveaud (MPIA) about how the Solar System formed. The idea is that the giant planets formed in an accretion disk. Their formation opened gaps and caused migration (first Type I and then Type II, if you must know :). That migration pulled them into a resonant chain. That is, if the giant planets formed the way we think they formed, they must have been in a resonant chain. But they aren't in such a chain now; what gives?

The idea is that when the gas is expended (or blown out by winds), the remaining planetestimals (think: asteroids, comets, Kuiper Belt objects) interact with the planets such that they get moved from orbit to orbit and eventually ejected. These dynamical interactions break the resonant chain, migrate the giant planets to their current locations, and scatter rocks and ice balls into the interstellar regions.

It was a great talk, but also led to a lot of interesting questions, such as: How does this all fit in with the formation of the rocky planets? And how does this square with our observations (growing rapidly, apparently) of interstellar asteroids? Oh and: How does all this connect to observations of debris disks, which I now (officially) love.

what is measured with stellar kinematics?

2025-07-02T05:38:00.002-04:00

In work on Galaxy dynamics, from stellar kinematics, we measure relative velocities and relative positions, of nearby stars relative to the Sun (or really the Solar System barycenter). These relative positions and velocities are coordinate free, in the sense that they don't imply a rest frame for anything (and indeed, the SS barycenter is not anywhere near the rest-frame position or rest-frame velocity of the Milky Way or Local Group or anything else).

In addition to this, any measurements we make are insensitive to any overall or external acceleration: If the Milky way is in free-fall, accelerating towards some external “great attractor” or anything else, none of these observables are affected in any way by that acceleration. So what is it that stellar kinematics can really be used to measure? I think somehow the answer has to be Galilean covariant (covariant to boosts and translations), but even better it should be generally covariant (in the Newtonian sense, which is well defined, apparently).

I did some research on this subject, and the literature is all about Newton–Cartan theory, but this theory is a Newtonian limit of general relativity. That isn't quite what we care about in stellar kinematics, since in stellar kinematics, we don't get to see any orbits as a function of time (we don't observe geodesics or geodesic deviation). What, exactly do we observe? I think what we observe is something about gradients of accelerations, but I don't know yet. Great project for this summer.

scattering in the Universe

2025-06-27T10:41:00.007-04:00

Distant objects in the Universe look the same as nearby ones, in the sense that a redshift 8 quasar looks like a point source just the same as a nearby M dwarf star looks like a point source. This must, somehow, strongly constrain the scattering properties of the Universe. The Universe is very transparent, but there definitely is scattering and absorption (by dust, by gas, by rocks); why don't we see this as some kind of “blurring” of distant sources? I discussed this today in the cafeteria at Flatiron with Charlie Epstein (Flatiron) and Marsha Berger (NYU).

My question was: What can we learn about scattering in the Universe from this fact that distant point sources look the same as nearby ones. They didn't have a simple answer. Epstein, however, said that, in the linear continuum limit, all that matters is epsilon and mu, so maybe we should just think about epsilon and mu fields that depend on position and frequency. Interesting thought. He also pointed out that pulsing sources should be dispersed (and they are!), and that dispersion should go up with distance (and it does!).

possible Trojan planet?

2024-12-09T08:58:00.002-05:00

In group meeting last week, Stefan Rankovic (NYU undergrad) presented results on a very low-amplitude possible transit in the lightcurve of a candidate long-period eclipsing binary system found in the NASA Kepler data. The weird thing is that (even though the period is very long) the transit of the possible planet looks just like the transit of the secondary star in the eclipsing binary. Like just like it, only lower in amplitude (smaller in radius).

If the transit looks identical, only lower in amplitude, it suggests that it is taking an extremely similar chord across the primary star, at the same speed, with no difference in inclination. How could that be? Well if they are moving at the same speed on the same path, maybe we have a 1:1 resonance, like a Trojan? If so, there are so many cool things about this system. It was an exciting group meeting, to be sure.

submitted!

2024-03-16T16:28:00.001-04:00

OMG I actually just submitted an actual paper, with me as first author. I submitted to the AAS Journals, with a preference for The Astronomical Journal. I don't write all that many first-author papers, so I am stoked about this. If you want to read it: It should come out on arXiv within days, or if you want to type pdflatex a few times, it is available at this GitHub repo. It is about how to combine many shifted images into one combined, mean image.

IAIFI Symposium, day two

2024-03-15T23:14:00.001-04:00

Today was day two of a meeting on generative AI in physics, hosted by MIT. My favorite talks today were by Song Han (MIT) and Thea Aarestad (ETH), both of whom are working on making ML systems run ultra-fast on extremely limited hardware. Themes were: Work at low precision. Even 4-bit number representations! Radical. And bandwidth is way more expensive than compute: Never move data, latents, or weights to new hardware; work as locally as you can. They both showed amazing performance on terrible, tiny hardware. In addition, Han makes really cute 3d-printed devices! A conversation at the end that didn't quite happen is about how Aarestad's work might benefit from equivariant methods: Her application area is triggers in the CMS device at the LHC; her symmetry group is the Lorentz group (and permutations and etc). The day started with me on a panel in which my co-panelists said absolutely unhhinged things about the future of physics and artificial intelligence. I learned that many people think we are only years away from having independently operating, fully functional aritificial physicists that are more capable than we are.

IAIFI Symposium, day one

2024-03-14T23:07:00.001-04:00

Today was the first day of a two-day symposium on the impact of Generative AI in physics. It is hosted by IAIFI and A3D3, two interdisciplinary and inter-institutional entities working on things related to machine learning. I really enjoyed the content today. One example was Anna Scaife (Manchester) telling us that all the different methods they have used for uncertainty quantification in astronomy-meets-ML contexts give different and inconsistent answers. It is very hard to know your uncertainty when you are doing ML. Another example was Simon Batzner (DeepMind) explaining that equivariant methods were absolutely required for the materials-design projects at DeepMind, and that introducing the equivariance absolutely did not bork optimization (as many believe it will). Those materials-design projects have been ridiculously successful. He said the amusing thing “Machine learning is IID, science is OOD”. I couldn't agree more. In a panel at the end of the day I learned that learned ML controllers now beat hand-built controllers in some robotics applications. That's interesting and surprising.

The Cannon and El Cañon

2024-03-12T17:45:00.002-04:00

At the end of the day I got a bit of quality time in with Danny Horta (Flatiron) and Adrian Price-Whelan (Flatiron), who have just (actually just before I met with them) created a new implementation of The Cannon (the data-driven model of stellar photospheres originally created by Melissa Ness and me back in 2014/2015). Why!? Not because the world needs another implementation. We are building a new implementation because we plan to extend out to El Cañon, which will extend the probabilistic model into the label domain: It will properly generate or treat noisy and missing labels. That will permit us to learn latent labels, and de-noise noisy labels.