p-uniform* is a three-parameter selection model

meta-analysis
selective reporting
estimation
Author

James E. Pustejovsky

Published

March 8, 2026

Robbie van Aert and Marcel van Assen have a new article out on a method for estimating average effect size in meta-analyses subject to publication bias, called p-uniform*. The method has been around for a while (the first version of their paper is from 2018, ) and has been cited hundreds of times. The recent appearance of the peer-reviewed article prompted me to take a closer look.

The authors frame the p-uniform* method as a generalization of an earlier proposal of theirs, called p-uniform, which was limited in that it applies only to meta-analytic data where all effect size estimates are statistically significant at the conventional α=.05 level. The new method improves upon p-uniform by using all of the data (including effect size estimates that are not statistically significant) and by allowing for heterogeneity in the distribution of effect size parameters from different samples. The authors contrast p-uniform* with the well-known three-parameter selection model (3PSM), which is a special case of the step-function selection model introduced by Hedges (). They note that both methods can be understood as selection models, in that they are based upon explicit assumptions about the process that determines whether an effect size estimate from a study is reported and available for meta-analysis. Further, they argue that the new p-uniform* method is preferable to the 3PSM because it does not involve estimation of a selection weight parameter, and thus is more parsimonious:

P-uniform* has the advantage over 3PSM that the weight in the selection model does not need to be estimated. These weights are known to be imprecisely estimated in more complex selection models than the selection model of 3PSM where multiple weights need to be estimated (Hedges & Vevea, 1996, 2005; Vevea & Woods, 2005). P-uniform* only assumes that these weights are the same for the statistically significant and the same for the nonsignificant primary studies’ effect sizes, without requiring estimation of the weights. This makes p-uniform* a more parsimonious model than 3PSM with similar statistical properties as shown in our simulation studies ().

In this post, I offer a different interpretation of p-uniform*. Rather than being a more parsimonious model than the 3PSM, I think it is useful to understand p-uniform* as a different estimator of the average effect size and heterogeneity parameters in the 3PSM. I will demonstrate that the maximum likelihood estimator of p-uniform* is exactly equivalent to the maximum likelihood estimator of 3PSM under the (admittedly artificial) condition that all effect sizes in the meta-analytic data have the same sampling variance. Outside of this condition, the methods are not exactly equivalent but will tend to produce very similar point estimates of average effect size and between-study heterogeneity because both are based on unbiased estimating equations. However, confidence intervals produced by each method will differ except when based on the same method of construction and when all effect sizes have the same sampling variance.

Notation

My discussion is necessarily going to get a bit technical. Consider a meta-analysis that includes k independent effect size estimates, where ti and σi denote the observed effect size estimate and corresponding standard error from study i=1,...,k. Let’s say that statistical significance is defined based on the level 2×α (which will usually be .05, so α=.025). I will assume that the effect sizes are defined so that positive effects are what would be theoretically expected, and I will call an effect size estimate affirmative if it is statistically significant and in the expected direction. Let ai be an indicator for whether effect size estimate i is affirmative, so ai=I(tiσi>Φ1(1α)), where Φ1() is the standard normal quantile function. I will also use Φ() to denote the cumulative function and ϕ() to denote the density function of the standard normal distribution.

The three-parameter selection model

Before considering how p-uniform* relates to the 3PSM, let me explain the latter model. The three-parameter selection model makes two sets of assumptions. First, it is assumed that if we could observe every effect size estimate ever produced, then those effect size estimates would follow a conventional random effects model with mean μ and between-study standard deviation τ. Second, it is assumed that the probability of observing a given estimate depends only on its sign and statistical significance. Specifically, the probability of observing a non-affirmative estimate differs from the probability of observing an affirmative estimate by a factor of λ>0. If λ<1, then there is selection in favor of affirmative (statistically significant) results, which produces upward bias in average effect size estimates from conventional meta-analytic methods. These assumptions are essentially identical to those of p-uniform*.

Under these assumptions, the distribution of an observed effect size estimate is what I call a “piece-wise normal” distribution, with density (1)dT(T=ti,A=ai|σi)=λ1ai1(1λ)βi×1τ2+σi2ϕ(tμτ2+σi2), where βi=Φ(σiΦ1(1α)μτ2+σi2). Note that βi depends on μ and τ but not on λ. From the density, the log-likelihood of the 3PSM given the observed effect size estimates is (2)l(μ,τ,λ)=i=1klogdT(T=ti,A=ai|σi)=12i=1k((tiμ)2τ2+σi2+log(τ2+σi2))+kNlogλi=1klog(1(1λ)βi) where kN=i=1k(1ai) is the number of non-affirmative effect size estimates observed. The maximum likelihood estimators μ^, τ^, λ^ are the values that maximize l(μ,τ,λ).

One way to compute maximum likelihood estimators is by profiling in one or more of the variables. For instance, we can consider the profile log-likelihood of μ and τ, which is the log-likelihood function for given values of μ and τ, but maximized over λ. With profiling, the maximum likelihood estimator of λ is treated as a function of μ and λ, as λ^(μ,τ)=argmaxλ>0 l(μ,τ,λ). The profile log-likelihood is then (3)lP(μ,τ)=l(μ,τ,λ^(μ,τ)) and the maximum likelihood estimators μ^ and τ^ are the values that maximize lP(μ,τ), with λ^=λ^(μ^,τ^). The end result is the same as maximizing the whole log-likelihood all at once—it’s just that in some cases, it’s easier to maximize in pieces (especially if there is a simple way to compute λ^(μ,τ)).

For the three-parameter selection model, the maximum likelihood estimator of λ can also be defined as the solution to the score equation (the derivative of l(μ,τ,λ) with respect to λ) (4)kA=i=1k1βi1(1λ)βi. The quantity on the left is the observed number of affirmative effects, equal to kA=kkN=i=1kai. The quantity on the right is the expected number of affirmative results in the observed sample based on the three-parameter selection model. does not have a closed-form solution except in some simple special cases, but it can be solved readily with numerical algorithms.

p-uniform*

Now let me consider van Aert and van Assen’s proposed method, p-uniform*. The method is an estimator of μ and τ, but defined by a different objective function that involves truncated normal distributions (Equation 3 of ). For the affirmative effects, the objective function involves the density of each effect size estimate, given that it is affirmative; for the non-affirmative effects, it involves the density of each effect size estimate, given that it is not affirmative. Taking the log of their objective function (Equation 4 of ) gives (5)u(μ,τ)=12i=1k((tiμ)2τ2+σi2+log(τ2+σi2))i=1k[(1ai)logβi+ailog(1βi)]. The p-uniform estimators μ¨ and τ¨ are the quantities that maximize u(μ,τ).

The objective function u(μ,τ) can be derived directly from the three-parameter selection model. The truncated densities are the conditional densities of the effect size estimates, given their affirmative or non-affirmative status. Thus, u(μ,τ)=i=1klogdT(T=ti|A=ai,σi), whereas the 3PSM log-likelihood uses the unconditional density of the observed effect size estimates. The p-uniform* method is essentially the maximum likelihood estimator of μ and τ, conditional on whether each of the effects are affirmative (i.e., conditional on A1=a1,...,Ak=ak. Conditioning removes all of the information in the likelihood that relates to the selection parameter λ, so this parameter does not appear in u(μ,τ).

Using conditional distributions for inference is warranted when the conditioning information is an ancillary statistic that carries no information about the model parameters (). However, the statistical significance of the effect size estimates are not ancillary for the average effect size and heterogeneity parameters of the 3PSM. van Aert & van Assen () offer no justification for basing inferences about these parameters on the conditional distribution. It is not obvious to me that confidence intervals based on the conditional distribution would be valid—even asymptotically as k increases (except under conditions where the p-uniform* objective function coincides with the 3PSM log-likelihood).

Equal standard errors

The main simulations reported in van Aert & van Assen () are based on a data-generating process where all the effect size estimates have equal standard errors, so σ1=σ2==σ. This is a fairly artificial scenario that does not arise frequently in real meta-analysis, as far as I know. However, it is nonetheless a useful case to consider because it makes the math much easier and cleaner.

In this scenario with equal standard errors, the estimators of μ and τ from p-uniform* are exactly equal to the corresponding maximum likelihood estimators from the 3PSM. To see this, observe that when all σi are equal, then all the βi’s become constant, with βi=β=Φ(σΦ1(1α)μτ2+σ2) for i=1,...,k. The maximum likelihood estimator of λ () simplifies to λ^(μ,τ)=kN(1β)kAβ. The difference between the profile log-likelihood of the 3PSM () and the objective function of p-uniform* () can then be written as (6)lP(μ,τ)u(μ,τ)=kNlogλ^klog(1β+λ^β)+kNlog(β)+kAlog(1β)=kNlogkNkNlogkA+kNlog(1β)kNlog(β)klogkklog(1β)+klog(kA)+kNlog(β)+kAlog(1β)=kNlogkN+kAlogkAklogk. The difference between the objective functions of the 3PSM and p-uniform* is a constant that does not depend on the model parameters. Consequently, the maxima of the functions must be equal, and the profile likelihood confidence intervals for μ and τ must be identical.

Several implications follow from this. First, the equivalence of the p-uniform* and 3PSM estimators explains why the performance of the point estimators for μ is so similar (in terms of bias and root mean-squared error) in the main simulations of van Aert & van Assen () Second, the equivalence of the estimators runs counter to van Aert and van Assen’s argument that p-uniform* is more parsimonious and has fewer limitations. If the estimators are the same, then clearly one cannot have advantages over the other. Third, van Aert & van Assen () reported differences between p-uniform* and 3PSM in the performance of the confidence intervals for μ in the main simulations. However, given the equivalence of the objective functions, differences in the behavior of the confidence intervals must be attributable to differences in how they are constructed (profile likelihood confidence intervals for p-uniform* but Wald-type confidence intervals for the 3PSM), rather than differences between the models.

Varying standard errors

Except when the standard errors are all equal, the objective function of p-uniform* is not exactly identical to the profile log-likelihood of the 3PSM. However, objective function of p-uniform* produces unbiased estimating equations for μ and τ. To see this, observe that another way to express the p-uniform* estimator is as the solution to the system of equations that set the derivative of u(μ,τ) with respect to the parameters equal to zero. For p-uniform*, these derivatives are (7)uμ=i=1k[(Tiμ)τ2+σi2+ϕ(ci)τ2+σi2(1βiAiβi(1βi))]uτ2=12i=1k[((Tiμ)2(τ2+σi2)(τ2+σi2)2)+ciϕ(ci)(τ2+σi2)(1βiAiβi(1βi))] where ci=σiΦ1(1α)μτ2+σi2.

My previous post gave expressions for the mean and variance of an observed effect size estimate under the step-function selection model. For the case of a single-step model, these results imply that E(Tiμ)=τ2+σi2×(1λ)ϕ(ci)1(1λ)βiE[(Tiμ)2]=(τ2+σi2)×(1+(1λ)ciϕ(ci)1(1λ)βi). Furthermore, E(Ai)=1βi1(1λ)βi. Substituting these expressions into the expectations of leads to E(uμ)=i=1k[E(Tiμ)τ2+σi2+ϕ(ci)τ2+σi2(1βiE(Ai)βi(1βi))]=i=1kϕ(ci)τ2+σi2[1λ1(1λ)βi1λ(1(1λ)βi)]=0 and E(uτ2)=12i=1k[(E[(Tiμ)2](τ2+σi2)(τ2+σi2)2)+ciϕ(ci)(τ2+σi2)(1βiE(Ai)βi(1βi))]=12i=1k1(τ2+σi2)[1+(1λ)ciϕ(ci)1(1λ)βi1ciϕ(ci)(1λ)(1(1λ)βi)]=0. Thus, the p-uniform* estimator is based on unbiased estimating equations for μ and τ.

How different is the p-uniform* estimator from the 3PSM maximum likelihood estimator in this case of unequal variances? The degree of potential discrepancy depends on how different the objective functions are. Allowing for unequal variances (8)lP(μ,τ)u(μ,τ)=kNlogλ^i=1klog(1(1λ^)βi)+i=1k(1Ai)log(βi)+i=1kAilog(1βi)=i=1k(1Ai)log(λ^βi)+i=1kAilog(1βi)i=1klog(1(1λ^)βi)=i=1k(1Ai)log(1π^i)+i=1kAilog(π^i), where π^i=1βi1(1βi)λ^ is the probability of observing an affirmative result under the three-parameter selection model. The estimated selection parameter is defined by the constraint that kA=i=1kπ^i. Letting ω¯N=1kNi=1k(1Ai)log(1π^i) and ω¯A=1kAi=1kAilog(π^i), can be written as lP(μ,τ)lU(μ,τ)=kNω¯A+kAω¯A, with the constraint that kA(1π¯A)=kNπ¯N, where π¯N and π¯A are the average probabilities among the non-affirmative and affirmative effects.

van Aert and van Assen argue that p-uniform* is advantageous because it does not entail estimation of the selection weight λ. However, one could also take this as a deficiency, insofar as the p-uniform* estimator does not yield information about the strength of selection—a substantively interesting parameter that quantifies the severity of publication bias. This deficiency could be remedied by borrowing the estimator of λ from the step-function model, as λ¨=λ^(μ¨,τ¨). This estimator can be computed by solving for λ with βi evaluated at μ¨ and τ¨.

Like the 3PSM MLE, p-uniform* can be defined by a set of unbiased estimating equations. Although this property is not sufficient to ensure that p-uniform* estimator will work well, it does suggest that p-uniform* might have biases similar to those of the 3PSM MLE. Furthermore, sandwich variance estimators or non-parametric bootstrapping might be useful for assessing uncertainty in model parameter estimates generated with p-uniform* (as we examine for MLEs in ). However, it seems unlikely that p-uniform* would be more efficient than the MLE of the 3PSM under the model’s assumptions—maximum likelihood estimation is usually pretty hard to improve upon, at least for low-dimensional problems. Does p-uniform* have some robustness advantage that maximum likelihood lacks? In my view, there needs to be more thorough examination of the properties of p-uniform* (be it through theoretical analysis or simulation under varied, realistic data-generating processes) before it can be recommended for conducting meta-analyses of real data.

Back to top

References

Ghosh, M., Reid, N., & Fraser, D. A. S. (2010). Ancillary statistics: A review. Statistica Sinica, 20(4), 1309–1332. http://www.jstor.org/stable/24309506
Hedges, L. V. (1992). Modeling publication selection effects in meta-analysis. Statistical Science, 7(2), 246–255. https://doi.org/10.1214/ss/1177011364
Pustejovsky, J. E., Citkowicz, M., & Joshi, M. (2025). Estimation and Inference for Step-Function Selection Models in Meta-Analysis with Dependent Effects. https://doi.org/10.31222/osf.io/qg5x6_v1
van Aert, R. C. M., & van Assen, M. A. L. M. (2018, October 2). Correcting for publication bias in a meta-analysis with the p-uniform* method. https://doi.org/10.31222/osf.io/zqjr9
van Aert, R. C. M., & van Assen, M. A. L. M. (2026). Correcting for publication bias in a meta-analysis with the p-uniform* method. Psychonomic Bulletin & Review, 33(3), 102. https://doi.org/10.3758/s13423-025-02812-4

Footnotes

  1. The p-uniform* method does not explicitly define the λ, but it is implicit in the assumption that the “weights are the same for the statistically significant and the same for the nonsignificant primary studies’ effect sizes.” Although the weights are constant conditional on whether the effects are affirmative, they are allowed to differ between affirmative and non-affirmative effect sizes, and λ is the ratio of the weights for non-affirmative effects to the weights for affirmative ones.↩︎