2022-03-21

six-quark state?

Today was a great blackboard talk at CCPP by Glennys Farrar (NYU) about a possible six-quark state in QCD. She has been thinking about this for a decade or so, because it might have implications for dark matter and issues in QCD. Today she focused on the latter: There are terms in the g−2 calculation for the muon that can be estimated either with lattice QCD or by integrating some observed branching ratios from experiment. These two methods disagree, and the observational method disagrees (more strongly) with the g−2 measurement. But Farrar shows that if there is a long-lived 6-quark state, it can potentially affect the QCD calculation (implicitly) but would be evaded by the branching-ratio measurements (because it would evade all event triggers). Her model requires some good luck with QCD parameters and bound states, but if that luck holds, she can pull dark matter into the standard model and solve some precision-measurement issues! After her talk we discussed a bit about just how hard lattice QCD is. It's absurd!

2022-03-20

coordinate freedom

Bernhard Schölkopf (MPI-IS) and I spent time drinking coffee this weekend. Among the many subjects we discussed was language around invariance, equivariance, covariance, coordinate freedom, and symmetry. These words! I have strong opinions, as my loyal reader might know. But during the conversation I had an epiphany in which I understood why Einstein called the symmetry of general relativity “general covariance”: He was probably keying off the mathematicians, who used covariance back then the way we use equivariance now.

I don't like the word “equivariance” at all! Why do we want to write the laws of physics in a rotationally-symmetric (or rotationally equivariant or orientation-free) way? There are two completely different reasons! One is that the laws of physics are observed to be rotationally invariant (which leads to conservation of angular momentum and so on). The other is the theoretical idea that the laws of physics can't depend on investigator choices about coordinates. These are completely different, and the latter is extremely strong. We debated whether there was something to write about all this somewhere.

2022-03-18

how to build an astrophysics program?

In group meeting today John Forbes (Flatiron) asked an interesting question: How do you build a good astrophysics program at a small place? He's thinking about this because he is on the job market. My own answer is a bit weird: It is to work with people you trust, since when a place is small, and resources are shared, trust is paramount. But this conversation was interesting, and it was also an illustration of an amusing fact, which is that many of the most interesting discussion topics at the Astronomical Data Group meeting at Flatiron are often not about astronomical data analysis!

2022-03-17

causality and time ordering

I had a nice chat with David Blei (Columbia) at the end of the day about the question of whether causal inference (a subject in statistics) can be re-phrased in terms of making predictions about the time-ordering of events. He was not extremely positive about that project! But we talked about the causal-inference approaches. I don't like many of them! Because many of them somehow assume that it is possible to intervene on the situation, and how can you intervene on a unitary system (like, say, the Universe)? Does causality not exist in physics? Does the force cause the acceleration or does the acceleration cause the force? There isn't an answer to that in physics.

2022-03-15

writing about radial-velocity precision

Today I went through, with Megan Bedell (Flatiron), the paper we code-name EPRV, which is about the precision with which you can measure a (change in a) radial velocity using spectroscopy. One of the points we discussed is how the results depend on stellar temperature, spectrograph resolution, and wavelength coverage. There is no simple expression of course, because stars vary in such complicated ways with temperature, and the line lists are immense. So we end up having methods that are useful, but not simple back-of-envelope anything. Another point we discussed is that the assumption that the star doesn't vary is a very wrong assumption, and the whole point of the whole literature these days! We care about this point and want to address it in the next work we do here. But how to discuss future directions in present paper? I don't like promising things.

We also discussed our writing styles, which are hella different. I think that's good in a collaboration, of course!

2022-03-14

me reading?

As my collaborators and friends know, if there is one thing I hate to do, it is spend all day reading the literature. I love and respect the literature! But don't make me actually read it. But today I sucked it up and read some 20-ish papers about characterizing dark-matter halo shapes, to find out if the coordinate-free shape measurements that Kate Storey-Fisher (NYU) and I are measuring are new. I think they are! In almost every paper I read, the word “shape” translated to eigenvalues of the positional variance tensor, or maybe ratios of those. Am I wrong?

2022-03-12

what is a transmission function?

Mike Blanton (NYU) and I agree and disagree on almost everything about fundamental astronomy. As my loyal reader knows, I am writing something on how apparent magnitudes work, and also absolute, bolometric, reddening-corrected, and so on. The apparent magnitude of star depends (among other things) on a filter bandpass or a transmission function. There are two possible definitions of this. One is the fraction of light (as a function of wavelength or frequency) that makes it through the system, from the top of the atmosphere to stimulating the detector. The other is the mean contribution to the total counts read out by the detector of a photon (of a particular wavelength or frequency) impinging on the top of the atmosphere. These might sound similar—they are identical when your detector is a photon counter. But they aren't identical if your detector is, say, a bolometer. I struggled with how to simply communicate all this today.

2022-03-10

shapes of dark-matter halos

I had a very very long meeting today with Kate Storey-Fisher (NYU) in which we talked through every aspect of our current project, at every level of abstraction. It was great! And at the end of it, I had a way simpler description of our project than I think I could have articulated even yesterday: We are asking whether high-order, coordinate-free measurements of dark-matter halo shapes can predict galaxy contents.

2022-03-09

are data-driven approaches to RV measurement biased?

Matt Daunt (NYU) is re-building our wobble code for measuring stellar radial velocities without any stellar or tellurics model. He is finding that it is slightly biased towards smaller radial velocity amplitudes than what we inject into fake data. This also mirrors things that Bedell (Flatiron) and I have seen in various experiments. I think there is something going on with spectral edges: At the edge of the observed spectral domain, some observations have lines shifted into and out of the observations. The mean spectrum obtained from those measurements isn't necessarily capturing all of this fairly. Or at least it has to be handled carefully. Are we doing this right? Experiments we have suggest that we don't have this quite right yet.

2022-03-07

I'm wrong about HORIZONS

The JPL HORIZONS system is amazing! You can compute the position of anything in the Solar System, at any time. With Weichi Yao (NYU) and others, I have been looking at Halley's Comet, with the thought of making a machine-learning benchmark data set (this is an idea from Soledad Villar, JHU). When we look up Halley in HORIZONS, we find many Halleys, not just one. I hypothesized that this is because there are different solutions for Halley on different apparitions. But somehow I am sort-of wrong: That's true for most of the Halleys in the system. But then today in our meeting Yao showed that there's one that seems to do well at all epochs. Huh? Anyway, HORIZONS is better on content than documentation!

2022-03-04

Buckingham pi theorem is bad?

The Buckingham Pi theorem is about making physics problems dimensionless. It says that if you have a law of physics that you can manipulate into the form f(inputs) = 0, you can re-write that law with fewer, dimensionless inputs. It's interesting, and important, and it motivated the work that Soledad Villar (JHU) and I are doing on making machine-learning methods obey exact dimensional scalings and unit conversion symmetries.

However, I am not sure the Buckingham Pi theorem works (or is useful) when the function f() is a vector-valued or tensor-valued function with vector-valued and tensor-valued inputs, as it is, say in Maxwell's equations or the equations of general relativity. Villar and I discussed ways to save Buckingham Pi, but I think the main results might either not be correct at all, or not reduce dimensionality. I got upset about it! But it raises an interesting question: Can Buckingham Pi be saved?

My point is: Physics is full of vectors and tensors and the laws are coordinate-free. If going dimensionless is a good idea, then it should be a good idea for vector and tensor expressions that are coordinate-free!

2022-03-03

how can linear regression be hard?

Maybe I'm known in astronomy for being both a machine-learning developer and a machine-learning skeptic. I hope so! Anyway, I love linear regression, because it has a lot of the power of bigger ML models, but it's easy to implement and to understand. And yet!

Today Kate Storey-Fisher (NYU) and I looked at her code to predict galaxy properties given dark-matter-halo properties in a set of n-body simulations. We are doing very simple regressions but the condition numbers of the matrices are blowing up and some of our answers don't look great. And this is generic: Many linear-regression models are messed up by condition numbers and numerical linear algebra, and it is hard to diagnose, and it is hard to treat. And if linear regression is hard—and hard for us—why do I believe anything that inovolves 42 layers of fully-connected RELU network?

2022-03-02

generative models for quasars

I spent part of the day working with Christina Eilers on her Gaussian process latent-variable model for quasar spectra and physical properties. We re-wrote our title and abstract and went through the math in the paper. It's time to finish this up! We find that we can predict quasar masses with good accuracy (based on held-out data) based on single-epoch, limited-coverage optical spectra. It's sweet. And Eilers has beautiful demonstrations that she can predict unobserved spectral regions, because the model is trained on different quasars at different redshifts with different data. The big problem with this model is that it scales poorly; we can't imagine training on thousands of objects without substantial engineering efforts (and maybe not ever).

2022-03-01

MIT visit

I spent the day at MIT today. I learned a huge amount! One highlight was that Rob Simcoe (MIT) showed me the hardware in his lab, and we discussed trade-offs between software and hardware in instrument design. Another was talking to a great group of graduate students over lunch. My talk was about the ontology and epistemology of machine learning. My slides are here. At dinner, Deepto Chakrabarty (MIT) encouraged me to complete my pedagogical note on bolometric magnitudes and so on.