What’s the tea, sis?

A few years ago, a colleague and I were working on implementing the multivariate student-t distribution as a part of a new model. We were looking to implement the non-centered parameterization. Luckily enough, Andre Pfeuffer had shared the following solution on the Stan forums that rewrites the multivariate student-t in terms of inverse Chi-Square distribution:

data {
  int<lower=1> p;
  vector[p] mu;
  cholesky_factor_cov[p,p] L;
  real<lower=0> nu;
}
parameters {
  vector[p] z;
  real<lower=0> u;
}
transformed parameters {
  vector[p] x;
  x = mu + sqrt(nu / u) * (L * z); // distributed multi_student_t
}
model {
  target += normal_lpdf(z | 0, 1);
  target += chi_square_lpdf(u | nu);
}

This works well! But! This parameterization requires that you pass in the degrees of freedom, $ν$ , as data.¹ If you instead want to estimate $ν$ as a part of fitting the model, you’ll need an alternative parameterization.

¹ You could technically simply model $ν$ as a parameter with this specification, but then you end up with a centered prior over $u$ , which defeats the purpose of implementing a non-centered parameterization.

Luckily enough for you, dear reader, I’ve done this work for you. If you want to skip the derivation and get straight to estimating, you can plop this function into your Stan model. What follows below is the derivation that I shared with the Stan forums.

real multivariate_t_scale_lpdf(vector x,
                               real nu) {
    int N = size(x);
    vector[N] xx = nu/(x^2);
    real lp = 0.0;
    for (n in 1:N) {
        lp += log(2) + log(nu) + chi_square_lpdf(xx[n] | nu) - 3*log(x[n]);
    }
    return lp;
}

Deriving a multivariate student-t

The goal is to replace sqrt(nu / u) in Andre’s original solution with a new scale parameter, $x$ , and derive some density function for $x$ that takes $ν$ as an input. From some tinkering, I found that the quantile function for the outcome of interest is

$x = \sqrt{\frac{ν}{Q (1 - p, ν)}}$

where $Q$ is the quantile function of the Chi-Square distribution.

I didn’t exactly derive this quantile function robustly — it was a game of “guess and check.” If I simulate the outcome of interest, then compare the quantile distribution of the simulated values against my derived quantile distribution, the results line up well enough for me to say that this is the right function.

Code

library(tidyverse)
library(riekelib)

# simulate outcome w/df = 7
nu <- 7
raw <- sqrt(nu/rchisq(1e4, nu))

# quantiles for checking simulated results against empirical
sim <- 
  tibble(p = seq(from = 0, to = 1, length.out = 100),
         q = quantile(raw, probs = seq(from = 0, to = 1, length.out = 100)))

# check the quantile function
tibble(p = seq(from = 0, to = 1, length.out = 1000)) %>%
  mutate(q = sqrt(nu/qchisq(1 - p, nu))) %>%
  ggplot(aes(x = p,
             y = q)) + 
  geom_line() +
  geom_point(data = sim,
             color = "royalblue") +
  theme_rieke()

We want to get $p$ on its own so that we can eventually derive a density, $d$ . Some rearranging yields the following:

$Q (1 - p, ν) = \frac{ν}{x^{2}}$

Setting $a = \frac{ν}{x^{2}}$ and recalling that the quantile and cumulative functions are inverses lets us solve for $p$ explicitly. Here, $C$ is the cumulative distribution function for the Chi-Square distribution — this gives us a cumulative distribution function for $x$ .

$\begin{aligned} C (a, ν) & = 1 - p \\ p & = 1 - C (a, ν) \end{aligned}$

The density function is just the derivative of the CDF, so by virtue of the chain rule, we get the following if we take the derivative of our CDF with respect to $x$ :

$d = - D (a, ν) \times a^{'}$

where $D$ is the density function of the Chi-Square distribution and $a^{'}$ is the derivative of $a$ with respect to $x$ .

Sub in $\frac{ν}{x^{2}}$ for our temporary variable, $a$ , finish out the derivative, do some simplification, and voila! A density function for the multivariate student-t scale drops out!

$d = D (\frac{ν}{x^{2}}, ν) (\frac{ν}{x^{3}})$ Comparing the empirical density of the simulated values against the derived density function again yields an overlap that is, to me, good enough to call correct.

Code

# simulate outcome w/df = 7
nu <- 7
raw <- sqrt(nu/rchisq(1e4, nu))

tibble(x = seq(from = 0, to = 6, length.out = 1000)) %>%
  mutate(d = dchisq(nu/(x^2), nu) * (2*nu)/(x^3)) %>%
  ggplot(aes(x = x)) + 
  geom_line(aes(y = d)) +
  geom_density(data = tibble(x = raw),
               color = "royalblue") +
  theme_rieke()

Citation

BibTeX citation:

@online{rieke2026,
  author = {Rieke, Mark},
  title = {What’s the Tea, Sis?},
  date = {2026-02-07},
  url = {https://www.thedatadiary.net/posts/2026-02-07-multivariate-student-t/},
  langid = {en}
}

For attribution, please cite this work as:

Rieke, Mark. 2026. “What’s the Tea, Sis?” February 7, 2026. https://www.thedatadiary.net/posts/2026-02-07-multivariate-student-t/.

Deriving a multivariate student-t

Citation

Stay in touch

Support my work with a coffee