<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article
  PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD with MathML3 v1.2 20190208//EN" "JATS-journalpublishing1-3-mathml3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:ali="http://www.niso.org/schemas/ali/1.0/" article-type="research-article" dtd-version="1.2" xml:lang="en">
<front>
<journal-meta><journal-id journal-id-type="publisher-id">METH</journal-id><journal-id journal-id-type="nlm-ta">Methodology</journal-id>
<journal-title-group>
<journal-title>Methodology</journal-title><abbrev-journal-title abbrev-type="pubmed">Methodology</abbrev-journal-title>
</journal-title-group>
<issn pub-type="ppub">1614-1881</issn>
<issn pub-type="epub">1614-2241</issn>
<publisher><publisher-name>PsychOpen</publisher-name></publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">meth.15665</article-id>
<article-id pub-id-type="doi">10.5964/meth.15665</article-id>
<article-categories>
<subj-group subj-group-type="heading"><subject>Original Article</subject></subj-group>

<subj-group subj-group-type="badge">
<subject>Data</subject>
<subject>Code</subject>
<subject>Materials</subject>
<subject>Preregistration</subject>
</subj-group>

</article-categories>
	<title-group>
		<article-title>Multiverse Analysis for Dynamic Network Models: Investigating the Inﬂuence of Plausible Alternative Modeling Choices</article-title>
		<alt-title alt-title-type="right-running">Network Multiverse</alt-title>
		<alt-title specific-use="APA-reference-style" xml:lang="en">Multiverse analysis for dynamic network models: Investigating the inﬂuence of plausible alternative modeling choices</alt-title>
	</title-group>
	
	
	
	
	
<contrib-group content-type="authors">
<contrib id="author-1" contrib-type="author" corresp="yes"><contrib-id contrib-id-type="orcid" authenticated="false">https://orcid.org/0000-0002-9558-4648</contrib-id><name name-style="western"><surname>Siepe</surname><given-names>Björn S.</given-names></name><xref ref-type="corresp" rid="cor1">*</xref><xref ref-type="aff" rid="aff1">1</xref></contrib>
	<contrib id="author-2" contrib-type="author"><contrib-id contrib-id-type="orcid" authenticated="false">https://orcid.org/0000-0002-6302-9252</contrib-id><name name-style="western"><surname>Heck</surname><given-names>Daniel W.</given-names></name><xref ref-type="aff" rid="aff1">1</xref></contrib>
<contrib contrib-type="editor">
<name>
	<surname>Estrada</surname>
	<given-names>Eduardo</given-names>
</name>
<xref ref-type="aff" rid="aff2"/>
</contrib>
	<aff id="aff1"><label>1</label><institution content-type="dept">Psychological Methods Lab, Department of Psychology</institution>, <institution>University of Marburg</institution>, <addr-line><city>Marburg</city></addr-line>, <country country="DE">Germany</country></aff>
	<aff id="aff2">Autonomous University of Madrid, Madrid, <country>Spain</country></aff>
</contrib-group>
<author-notes>
	<corresp id="cor1"><label>*</label>Psychological Methods Lab, Department of Psychology, University of Marburg, Gutenbergstraße 18, 35032 Marburg, Germany. <email xlink:href="bjoern.siepe@uni-marburg.de">bjoern.siepe@uni-marburg.de</email></corresp>
</author-notes>	
<pub-date pub-type="epub"><day>30</day><month>06</month><year>2025</year></pub-date>
<pub-date pub-type="collection" publication-format="electronic"><year>2025</year></pub-date>
<volume>21</volume>
<issue>2</issue>
<fpage>123</fpage>
<lpage>143</lpage>
<history>
<date date-type="received">
<day>23</day>
<month>09</month>
<year>2024</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>03</month>
<year>2025</year>
</date>
</history>
<permissions><copyright-year>2025</copyright-year><copyright-holder>Siepe &amp; Heck</copyright-holder><license license-type="open-access" specific-use="CC BY 4.0" xlink:href="https://creativecommons.org/licenses/by/4.0/"><ali:license_ref>https://creativecommons.org/licenses/by/4.0/</ali:license_ref><license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution (CC BY) 4.0 License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p></license></permissions>
<abstract>
<p>Specifying complex time series models typically allows for a wide range of plausible analysis strategies. However, researchers typically perform and report only a single, preferred analysis while ignoring alternatives that could yield different conclusions. As a remedy, we propose multiverse analysis to investigate the robustness of dynamic network analysis to arbitrary modeling choices. We focus on group iterative multiple model estimation (GIMME), a highly data-driven approach, and re-analyze two datasets (combined <italic>n</italic> = 199). We vary seven modeling parameters, resulting in 3,888 fitted models. Group-level and to a lesser extent subgroup-level results were mostly stable. Individual-level estimates were more heterogeneous, with some decisions strongly influencing results and conclusions. The robustness of GIMME to alternative modeling choices depends on the level of analysis. For some individuals, results may differ strongly even when changing the algorithm only slightly. Multiverse analysis is a valuable tool for checking the robustness of results from time series models.</p>
</abstract>
<kwd-group kwd-group-type="author"><kwd>time series</kwd><kwd>network analysis</kwd><kwd>multiverse</kwd><kwd>heterogeneity</kwd><kwd>GIMME</kwd></kwd-group>

</article-meta>
</front>
<body>
	<sec sec-type="intro" id="intro"><title/>
<p id="S1.PX1.P1">There has been a shift towards the use of intensive longitudinal data in the social sciences (<xref ref-type="bibr" rid="Xbib28">Wright &amp; Zimmermann, 2019</xref>). This includes data collection through repeated daily experience sampling questionnaires or passive sensors such as heart-rate monitors or smartphones (<xref ref-type="bibr" rid="Xbib28">Wright &amp; Zimmermann, 2019</xref>). Such data may contribute to a better understanding of the dynamics of intra-individual psychological processes and inter-individual differences therein. There exists a wide variety of different time series models for such types of data (for an overview, see <xref ref-type="bibr" rid="Xbib14">Jordan et al., 2020</xref>). Such time series models are often visualized and interpreted as dynamic networks of mutually inﬂuencing variables (<xref ref-type="bibr" rid="Xbib3">Bringmann et al., 2022</xref>). It is well known that model choice and specification substantially inﬂuence the results of such analyses. However, the statistical literature on the comparative performance of different methods is inconclusive, and hence, there is often no consensus on which modeling approach is preferable in a given application. As a result, there is considerable heterogeneity in analysis techniques used to investigate similar questions.</p>
<p id="S1.PX1.P2">Amongst the various alternatives, group iterative multiple model estimation (GIMME, <xref ref-type="bibr" rid="Xbib9">Gates &amp; Molenaar, 2012</xref>) has emerged as a popular approach. It combines the benefits of personalized models with the identification of shared relationships at the (sub-)group level (<xref ref-type="bibr" rid="Xbib27">Wright &amp; Woods, 2020</xref>). GIMME performs a data-driven search for directed temporal (across time points) and contemporaneous associations (at the same time point). The approach estimates separate person-specific models while searching for associations between variables shared by a majority of the full sample or subgroups thereof (<xref ref-type="bibr" rid="Xbib1">Beltz &amp; Gates, 2017</xref>). The algorithm has several strengths, including its good performance in simulation studies and its use of group information to improve individual model fit (e.g., <xref ref-type="bibr" rid="Xbib11">Hoekstra et al., 2022</xref>, <xref ref-type="bibr" rid="Xbib16">Lane et al., 2019</xref>, <xref ref-type="bibr" rid="Xbib19">Nestler &amp; Humberg, 2021</xref>). In the following, we interpret fitted GIMME models as dynamic networks, where <italic>nodes</italic> represent variables and <italic>edges</italic>/<italic>paths</italic> the statistical association between variables. An <italic>edge weight</italic> refers to the strength of this association. A more extensive introduction to dynamic network models is available in <xref ref-type="bibr" rid="Xbib2">Borsboom et al. (2021)</xref> and <xref ref-type="bibr" rid="Xbib3">Bringmann et al. (2022)</xref>.</p>
<p id="S1.PX1.P3">However, even when researchers have settled on a specific modeling approach such as GIMME, they still have ample ﬂexibility along the analysis path, ranging from variable choice, preprocessing, and modeling to the interpretation of results. This is especially true for highly data-driven, iterative algorithms such as GIMME. The conventional workﬂow involves making a certain number of decisions, based on which a single, supposedly most appropriate model is identified and reported. However, focusing on a single analysis disregards other reasonable modeling choices that could have led to meaningfully different conclusions (<xref ref-type="bibr" rid="Xbib24">Steegen et al., 2016</xref>).</p>
<p id="S1.PX1.P4">The idea of repeatedly performing a certain analysis iterating through a range of plausible alternative specifications is the premise of multiverse analysis (<xref ref-type="bibr" rid="Xbib24">Steegen et al., 2016</xref>). The goal of a multiverse analysis is to systematically analyze the uncertainty associated with defensible choices in the modeling process. This aspect would not be explored when performing and reporting only a single analysis (<xref ref-type="bibr" rid="Xbib6">Del Giudice &amp; Gangestad, 2021</xref>). In experience sampling research, multiverse analyses have been used to investigate outcomes such as the inﬂuence of the choice of central tendency measures (<xref ref-type="bibr" rid="Xbib25">Weermeijer et al., 2022</xref>) or measurement ﬂexibility (<xref ref-type="bibr" rid="Xbib5">Dejonckheere et al., 2018</xref>). These studies show that investigating the effects of alternative, often arbitrary analysis decisions is important for assessing the robustness of results.</p>
<p id="S1.PX1.P5">Previous multiverse analyses have usually focused on a single or a few parameters of interest. The stability of a few parameters across different specifications can easily be examined graphically and with inferential statistics (e.g., <xref ref-type="bibr" rid="Xbib10">Hall et al., 2022</xref>, <xref ref-type="bibr" rid="Xbib23">Simonsohn et al., 2020</xref>). Dynamic network analyses do not necessarily provide a single parameter of interest. Instead, a large number of estimates and network summaries have to be interpreted across (sub-)group and individual network structures (<xref ref-type="bibr" rid="Xbib26">Wright et al., 2019</xref>). Therefore, the application of multiverse analyses in this context requires new ways of analyzing results, which we will explain in this manuscript.</p>
<p id="S1.PX1.P6">Time series modeling using data-driven algorithms is a complex process involving multiple arbitrary decisions. By arbitrary, we mean that these choices are neither theoretically motivated, nor based on extensive statistical evidence. For GIMME, this includes thresholds for (sub-)group effects (defined in terms of a certain percentage of individuals) or cutoffs for good model fit, such as the Root Mean Square Error of Approximation (RMSEA), which are explained below. The cutoffs for good model fit are currently set to fixed, mostly arbitrary default values within the algorithm. While these defaults are motivated by cutoffs commonly used in structural equation modeling, the same cutoffs do not necessarily apply to GIMME models. By now, there is abundant literature criticizing the use of generic fit index cutoffs in the psychometrics literature (e.g., <xref ref-type="bibr" rid="Xbib17">McNeish &amp; Wolf, 2023</xref>). This criticism is even more relevant when transferring cutoffs to very different model classes, such as GIMME.</p>
<p id="S1.PX1.P7">A previous simulation study has already indicated that the default choices, which cannot be changed in the standard version of GIMME, may not always be optimal (<xref ref-type="bibr" rid="Xbib19">Nestler &amp; Humberg, 2021</xref>). <xref ref-type="bibr" rid="Xbib19">Nestler and Humberg (2021)</xref> also showed that characteristics of group-level results can have an unexpected effect on individual-level results. Thus, changing cutoffs could have downstream implications for individual-level models that are hard to anticipate due to the iterative nature of the algorithm. Relying on a single, estimated GIMME model as the only basis for drawing substantive conclusions ignores the inherent uncertainty when making such arbitrary modeling choices.</p>
<sec id="s1_1" sec-type="Aims"><title>Aims</title>
<p id="S1.PX1.P8">We propose to use multiverse analysis to study the robustness of psychological time series analysis. We explored a range of different fitting criteria and compared the resulting models to models estimated in previously published empirical studies. We aim to provide answers to questions such as: Had the models been specified slightly differently, would results and the corresponding conclusions substantially change? How can we best quantify and visualize the robustness of a complex model? Among several equally justifiable specifications, which aspects of a specific fitted GIMME model are most robust on a given dataset? In doing so, this manuscript serves as a blueprint and guidance for future multiverse analyses with GIMME.</p></sec></sec>
<sec id="s2" sec-type="Methods"><title>Method</title>
	<p id="S2.PX1.P1">We used R version 4.3.1. (<xref ref-type="bibr" rid="Xbib21">R Core Team, 2023</xref>) and preregistered all analyses. The preregistration (and deviations from it), code, information on the computational environment, all R packages used, and all supplementary materials are available at <xref ref-type="bibr" rid="Xbib22.25">Siepe and Heck (2025a)</xref>.</p>
<sec id="s2_1" sec-type="Data"><title>Data</title>
<sec id="s2_1_1" sec-type="Personality|Dataset"><title>Personality Dataset</title>
<p id="S2.PX1.P2">The first dataset (‘personality dataset’) contains daily diary measures of <inline-formula><mml:math id="math-1" display="inline"><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn>94</mml:mn></mml:math></inline-formula> participants (<xref ref-type="bibr" rid="Xbib26">Wright et al., 2019</xref>). Participants had at least one personality disorder diagnosis. Daily diary measures were completed in the evening, including questions about a daily summary of affect, interpersonal behavior, stress, and functioning. Except for functioning (assessed with a single 5-point item), all constructs were measured by multiple items aggregated into sum scores. Participants were only included in the analysis when they provided at least 60 observations. The average time series length of included participants was <inline-formula><mml:math id="math-2" display="inline"><mml:mn>91.48</mml:mn></mml:math></inline-formula> (<italic>SD</italic> = <inline-formula><mml:math id="math-3" display="inline"><mml:mn>9.00</mml:mn></mml:math></inline-formula>). More information can be found in <xref ref-type="bibr" rid="Xbib26">Wright et al. (2019)</xref>.</p></sec>
<sec id="s2_1_2" sec-type="Emotion|Dataset"><title>Emotion Dataset</title>
<p id="S2.PX1.P3">The second dataset (‘emotion dataset’) contains experience sampling data of <inline-formula><mml:math id="math-4" display="inline"><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn>105</mml:mn></mml:math></inline-formula> individuals (<xref ref-type="bibr" rid="Xbib15">Kullar et al., 2024</xref>). Participants either had a major depressive or a bipolar disorder, were in remission of a major depressive disorder, or had no history of any mood or anxiety disorder. Smartphone surveys were sent out five times daily for 14 days. In these, participants rated nine momentary emotions on 7-point Likert scales. Further, they stated how long their current emotional state lasted and whether they thought about something else than their current activity. The model by <xref ref-type="bibr" rid="Xbib15">Kullar et al. (2024)</xref> included <italic>time-of-day</italic> as an exogenous predictor to handle time trends. Individuals were excluded when they completed less than 50% of the surveys. The average time series length of included individuals was <inline-formula><mml:math id="math-5" display="inline"><mml:mn>62.31</mml:mn></mml:math></inline-formula> (<italic>SD</italic> = <inline-formula><mml:math id="math-6" display="inline"><mml:mn>8.11</mml:mn></mml:math></inline-formula>). More information can be found in <xref ref-type="bibr" rid="Xbib15">Kullar et al. (2024)</xref>.</p></sec></sec>
<sec id="s2_2" sec-type="Gimme"><title>GIMME</title>
	<p id="S2.PX1.P4">To avoid confusion across the different levels of analyses, we use the following terms: <italic>GIMME model</italic> as the overarching modeling approach, a <italic>specification</italic> as a specific iteration of the multiverse, a <italic>fitted model</italic> as the overall GIMME result within a specific iteration, and an <italic>individual fitted model</italic> as the result of GIMME for a specific individual within an iteration. GIMME is a form of unified structural equation modeling (uSEM) in which each variable is predicted by itself and all other variables at the previous time point (temporal) as well as all other variables at the same time point (contemporaneous). We provide a full description of the GIMME algorithm in our supplementary materials (<xref ref-type="bibr" rid="Xbib22.25">Siepe &amp; Heck, 2025a</xref>).</p>
<p id="S2.PX1.P5">Brieﬂy, the algorithm starts by estimating a null model without any paths for each individual.<xref ref-type="fn" rid="fn-1"><sup>1</sup></xref><fn id="fn-1"><label>1</label>
<p id="S2.PX1.P6">Except for autoregressive paths in default GIMME.</p></fn> Then, group-level associations are searched in an iterative manner using modification indices for each individual. Paths are added at the group level if they significantly improve model fit for a certain proportion of individuals. Subgroups are then searched based on similarities in the individual fitted models. In the resulting subgroups, subgroup paths are added (i.e., freely estimated per individual) and pruned in the same way as for group effects. Finally, for each individual fitted model, paths are added that significantly improve model fit until an ‘excellent’ model fit is achieved at the individual level. Per default, this is defined as two out of four fit indices satisfying certain thresholds: Root Mean Squared Error of Approximation (RMSEA &lt; .05), Standardized Root Mean Square Residual (SRMR &lt; .05), Comparative Fit Index (CFI &gt; .95), Non Normed Fit Index (NNFI &gt; .95; see <xref ref-type="bibr" rid="Xbib12">Hu &amp; Bentler, 1999</xref>, for more information).</p>
	<p id="S2.PX1.P7">Here, we focus on ‘classic’ exploratory GIMME as described in <xref ref-type="bibr" rid="Xbib8">Gates et al. (2017)</xref> to mimic the analyses in the empirical papers from which we obtained our datasets. In the original GIMME package, users can neither modify the threshold values used for each fit index nor the number of fit indices that need to satisfy these thresholds. The modified package that enables these changes is available at <xref ref-type="bibr" rid="Xbib22.80">Siepe and Heck (2025d)</xref>.</p></sec>
<sec id="s2_3" sec-type="Multiverse|Analysis"><title>Multiverse Analysis</title>
	<p id="S2.PX1.P8">We vary seven parameters of the GIMME approach. This includes two parameters adjustable in the original GIMME package (group and subgroup threshold) and five parameters that have so far been defined as fixed (goodness-of-fit criteria). The first two parameters specify the required proportion of individuals at different levels for which a path must significantly improve model fit (default values designated with a superscripted ‘a’, i.e., <sup>a</sup>):

<list list-type="simple">
<list-item>
<p id="S2.PX1.P9">1. Group threshold ∈ {50%, 60%, 75%<sup>a</sup>, 80%}</p>
</list-item>
<list-item>
	<p id="S2.PX1.P10">2. Subgroup threshold ∈ {50%, 60%, 75%<sup>a</sup>, 80%}</p></list-item>
</list></p>
<p id="S2.PX1.P11">Five parameters refer to the fit indices used for model selection:
<list list-type="simple">
<list-item>
<p id="S2.PX1.P12">3. RMSEA cutoff ∈ { .03, .05<sup>a</sup>, .08}</p></list-item>
<list-item>
<p id="S2.PX1.P13">4. SRMR cutoff ∈ { .03, .05<sup>a</sup>, .08}</p></list-item>
<list-item>
	<p id="S2.PX1.P14">5. NNFI cutoff ∈ { .90, .95<sup>a</sup>, .97}</p></list-item>
<list-item>
	<p id="S2.PX1.P15">6. CFI cutoff ∈ { .90, .95<sup>a</sup>, .97}</p></list-item>
<list-item>
	<p id="S2.PX1.P16">7. Fit measures satisfying the cutoffs ∈ {1, 2<sup>a</sup>, 3}</p></list-item>
</list></p>
<p id="S2.PX1.P17">Creating a grid of all possible combinations results in <inline-formula><mml:math id="math-14" display="inline"><mml:msup><mml:mrow><mml:mn>4</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>×</mml:mo><mml:msup><mml:mrow><mml:mn>3</mml:mn></mml:mrow><mml:mrow><mml:mn>5</mml:mn></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mn>3</mml:mn><mml:mo>,</mml:mo><mml:mn>888</mml:mn></mml:math></inline-formula> possible model specifications for each data set. One of these combinations provides the <italic>reference specification</italic> using the GIMME default settings, except for subgroup thresholds. For the emotion dataset, we used a subgroup threshold of <inline-formula><mml:math id="math-15" display="inline"><mml:mn>51</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula> as indicated in <xref ref-type="bibr" rid="Xbib15">Kullar et al. (2024)</xref>. For the personality data set, we used the default GIMME settings (highlighted in bold) as reference specifications, except for the subgroup threshold of 50% used in the original study. In the following, we will refer to the cutoffs for the (sub-)group thresholds ranging from 50% to 80% as ‘liberal’, ‘medium-liberal’, ‘medium-strict’, and ‘strict’. The labels for the fit index cutoffs ‘strict’, ‘medium’, and ‘liberal’ refer to increasing maximum cutoffs of.03,.05, and.08, respectively, and decreasing minimum cutoffs of.97,.95, and.90, respectively. For the number of fit measures satisfying the cutoffs, larger values are stricter.</p>
<p id="S2.PX1.P18">Multiverse analyses have been criticized for combining analysis choices that are not equally justified (<xref ref-type="bibr" rid="Xbib6">Del Giudice &amp; Gangestad, 2021</xref>). The different modeling choices we assess can be considered arbitrary since they are neither theoretically motivated nor justified by a wealth of statistical literature. Rather, the two (sub-)group thresholds reﬂect assumptions of the researcher about how many individuals must have a non-zero path for it to be included at the group level. The different fit index cutoffs serve as stopping criteria and thus adjust the conservativeness of the algorithm. Overall, all seven parameters can be expected to affect the sensitivity-specificity tradeoff.</p>
	<p id="S2.PX1.P19">Changes to the stopping rules of GIMME have not yet been evaluated in simulation studies. Hence, we performed a small simulation study to assess the performance of GIMME when using the modified cutoffs. Results are provided in the supplement (see <xref ref-type="bibr" rid="Xbib22.25">Siepe &amp; Heck, 2025a</xref>). The results showed that the different fit index cutoffs did not substantially affect the overall performance across simulation conditions. We conclude that there is insuﬃcient evidence to generally favor a specific fit-index specification over any other.</p>
<sec id="s2_3_1" sec-type="Statistics|Interest"><title>Statistics of Interest</title>
<p id="S2.PX1.P20">We compute various statistics at all levels of the GIMME model structure to compare the <inline-formula><mml:math id="math-16" display="inline"><mml:mn>3</mml:mn><mml:mo>,</mml:mo><mml:mn>888</mml:mn></mml:math></inline-formula> model specifications. We summarise all statistics in <xref ref-type="table" rid="x1-100021">Table 1</xref>. At the <italic>group</italic> level, we compare the overall number and the specific group-level edges against the reference model. We ignore autoregressive edges, as they are freely estimated by default. Counting these would artificially inﬂate homogeneity.<xref ref-type="fn" rid="fn-2"><sup>2</sup></xref><fn id="fn-2"><label>2</label>
<p id="S2.PX1.P21">We do not ignore autoregressive effects when computing edge weight differences.</p></fn> Homogeneity is investigated by dividing the number of group edges by the total number of nonzero edges across individuals, as proposed by <xref ref-type="bibr" rid="Xbib11">Hoekstra et al. (2022)</xref>. High values imply that most non-zero edges are shared across individuals.</p>
<table-wrap id="x1-100021" position="anchor" orientation="portrait">
<label>Table 1</label><caption><title>Summary Statistics for a Fitted GIMME Model</title></caption>
<table frame="hsides" rules="groups"><colgroup span="1">
<col width="" align="left"/>
	<col width="" align="left"/></colgroup>
<thead>
<tr>
<th>Level</th>
<th>Summary</th>
</tr>
</thead>
<tbody>
<tr>
	<td>Group</td>
	<td>• Number of group-level edges</td>
</tr>
<tr>
<td/>
	<td>• Specific group-level edges present/absent</td>
</tr>
<tr>
	<td/>
	<td>• Homogeneity: number of group edges divided by number of total edges</td>
</tr>
<tr style="grey-border-top">
<td>Subgroup</td>
	<td>• Number of subgroups</td>
</tr>
	<tr>
	<td/>
	<td>• Size of subgroups</td>
</tr>
<tr>
	<td/>
	<td>• Huber-Arabie Adjusted Rand Index (ARI)</td>
</tr>
<tr>
	<td/>
	<td>• Variation of Information (VI)</td>
</tr>
<tr style="grey-border-top">
<td>Individual</td>
<td>• Difference in adjacency matrix</td></tr>
	<tr>
		<td/>
		<td>• Difference in edge weights</td>
	</tr>
	<tr>
	<td/>
	<td>• Difference in density</td>
</tr>
<tr>
	<td/>
	<td>• Difference in fit measures</td>
</tr>
<tr>
	<td/>
	<td>• Difference in out-strength</td>
</tr>
<tr>
<td/>
	<td>• Most central node</td>
</tr>
<tr>
<td/>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p id="S2.PX1.P22"><italic>Note.</italic> Individual results are first calculated per individual and then aggregated across all individuals in a specification.</p>
</table-wrap-foot>
</table-wrap>
<p id="S2.PX1.P23">At the <italic>subgroup</italic> level, we calculate the number and size of subgroups. Furthermore, we evaluate the adequacy of the estimated subgroup solutions by computing the Hubert-Arabie Adjusted Rand Index (ARI) and the Variation of Information (VI). The ARI is a measure for cluster recovery and quantifies the contingency between the true subgroup structure and the estimated clustering while correcting for chance similarity (<xref ref-type="bibr" rid="Xbib13">Hubert &amp; Arabie, 1985</xref>). An ARI of at least <inline-formula><mml:math id="math-17" display="inline"><mml:mn>0.80</mml:mn></mml:math></inline-formula> has been used as a threshold for similarity (<xref ref-type="bibr" rid="Xbib7">Gates et al., 2019</xref>). The VI is based on information theory and assesses the distance between two clusterings, defined as the information that is lost when moving from one clustering to another (<xref ref-type="bibr" rid="Xbib18">Meilă, 2007</xref>).</p>
<p id="S2.PX1.P24">At the <italic>individual</italic> level, we compute differences between all specifications and the reference specification for the edge weight estimates and the adjacency matrix of an individual. The adjacency matrix encodes the occurrence of certain paths in a binary fashion. As an overall measure of the network structures, we compute network density (the average of absolute edge weights) for temporal and contemporaneous relationships. We used all edge weights, including zero weights, as excluding zero weights could lead to an inﬂated perception of network density. We also compare the fit statistics for an individual model. Finally, we calculate out-strength centrality defined as the sum of all edge weights going out of a specific node. This measure has been used as a proxy for the importance of a variable (<xref ref-type="bibr" rid="Xbib4">Bringmann et al., 2019</xref>). We compare centrality estimates against the reference model and check whether the most central node is identical, as the most central node is sometimes considered a potential target for treatments (<xref ref-type="bibr" rid="Xbib3">Bringmann et al., 2022</xref>).</p></sec>
<sec id="s2_3_2" sec-type="Visualizations"><title>Visualizations</title>
	<p id="S2.PX1.P25">Visualizations are an important tool for multiverse analysis (<xref ref-type="bibr" rid="Xbib10">Hall et al., 2022</xref>). To make our results more accessible, we provide an interactive Shiny app to explore the multiverse. The app is hosted at <xref ref-type="bibr" rid="Xbib22.50">Siepe &amp; Heck, 2025b</xref> or can be downloaded to render locally from <xref ref-type="bibr" rid="Xbib22.75">Siepe &amp; Heck, 2025c</xref>. The app includes functionality to calculate grouped summary statistics per specification, visualize results in different ways, and compute networks for each specification.</p></sec></sec></sec>
<sec id="s3" sec-type="Results"><title>Results</title>
<sec id="s3_1" sec-type="Visualizations"><title>Personality Dataset</title>
<p id="S3.PX1.P1">We were unable to exactly replicate the results of <xref ref-type="bibr" rid="Xbib26">Wright et al. (2019)</xref>, likely due to changes to the GIMME algorithm over the years. We will use the results obtained with the current version of GIMME since they closely resemble the original results. Only one effect surpassed the group-level threshold. Most individual temporal networks were sparse, with the lagged effect from <italic>Negative Affect</italic> to <italic>Stress</italic> being the most commonly estimated edge (9 individuals) besides autoregressive effects and the (sub-)group effects. Individual contemporaneous networks were more densely connected. Three subgroups were identified (59, 24, and 11 members).<xref ref-type="fn" rid="fn-3"><sup>3</sup></xref><fn id="fn-3"><label>3</label>
<p id="S3.PX1.P2">In the original publication, one individual was in the largest subgroup instead of the second largest one.</p></fn> A subgroup path emerged only in the first subgroup. At the individual level, <italic>Negative Affect</italic> and <italic>Stress</italic> had the highest temporal and contemporaneous centrality for the largest number of individuals, respectively.</p>
<sec id="s3_1_1" sec-type="Multiverse|Group|Level"><title>Multiverse: Group Level</title>
<p id="S3.PX1.P3">All models converged. Around <inline-formula><mml:math id="math-18" display="inline"><mml:mn>92.4</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula> of the specifications resulted in a single group-level path from <italic>Stress</italic> to <italic>Negative Affect</italic>, matching the reference model. The remaining <inline-formula><mml:math id="math-19" display="inline"><mml:mn>7.6</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula> specifications led to two group-level paths. The homogeneity value varied across specifications and went up to twice the value of the reference fit. As a hypothetical example, if there was one group-level path (for all <inline-formula><mml:math id="math-20" display="inline"><mml:mn>94</mml:mn></mml:math></inline-formula> individuals) and <inline-formula><mml:math id="math-21" display="inline"><mml:mn>47</mml:mn></mml:math></inline-formula> additional, non-group-level paths, homogeneity would be <inline-formula><mml:math id="math-22" display="inline"><mml:mn>94</mml:mn><mml:mo>/</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mn>94</mml:mn><mml:mo>+</mml:mo><mml:mn>47</mml:mn><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>2</mml:mn><mml:mo>/</mml:mo><mml:mn>3</mml:mn></mml:math></inline-formula>. Across the multiverse, it ranged from <inline-formula><mml:math id="math-23" display="inline"><mml:mn>12.6</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula> to <inline-formula><mml:math id="math-24" display="inline"><mml:mn>29.1</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula>, compared to the reference homogeneity of <inline-formula><mml:math id="math-25" display="inline"><mml:mn>14.8</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula>.</p></sec>
<sec id="s3_1_2" sec-type="Multiverse|Subgroup|Level"><title>Multiverse: Subgroup Level</title>
<p id="S3.PX1.P4">The same three subgroups were obtained in all specifications. Consequently, we did not compute any subgroup comparison metrics. Due to potential differences in the individual-level models, some characteristics of the subgroups, such as the number of paths between certain nodes, may still vary across specifications.</p></sec>
<sec id="s3_1_3" sec-type="Multiverse|Individual|Level"><title>Multiverse: Individual Level</title>
<p id="S3.PX1.P5">We calculated the absolute difference between the individual fitted and the reference adjacency matrices. Then, we summed these differences across all individuals. The average difference of the adjacency matrix for an individual was <inline-formula><mml:math id="math-26" display="inline"><mml:mn>1.745</mml:mn></mml:math></inline-formula> (SD = <inline-formula><mml:math id="math-27" display="inline"><mml:mn>0.718</mml:mn></mml:math></inline-formula>), which means that, on average, the presence (or absence) of slightly less than two paths differed from the reference fit. However, seven individuals had an average difference larger than four.</p>
<p id="S3.PX1.P6"><xref ref-type="fig" rid="x1-160011">Figure 1</xref> shows a specification curve analysis (<xref ref-type="bibr" rid="Xbib23">Simonsohn et al., 2020</xref>) which visualizes the value of a statistic of interest across all specifications. The first panel shows the statistic of interest at the y-axis (here, the mean of the adjacency matrix differences). For example, a value of <inline-formula><mml:math id="math-28" display="inline"><mml:mn>1</mml:mn></mml:math></inline-formula> indicates that on average, across all individual models in a certain specification, one path differed in its presence to the reference model. On the x-axis, all 3,888 model specifications are ordered using this statistic. The second panel shows the strictness of the different specifications.</p><fig id="x1-160011" position="float" orientation="portrait"><label>Figure 1</label><caption><title>Specification Curve of Adjacency Differences for the Personality Dataset.</title>
	<p id="S3.PX1.P7"><italic>Note.</italic> The upper panel is ordered by the size of the mean of the summed differences. The lower panel shows the color-coded levels of different specifications. The verbal labels correspond to different numerical values for each of the 7 factors in the lower panel (for details, see the <xref ref-type="sec" rid="s2">Method</xref> section).</p></caption><graphic mimetype="image" mime-subtype="png" xlink:href="meth.15665-f1.png" position="float" orientation="portrait"/></fig>
<p id="S3.PX1.P8"><xref ref-type="fig" rid="x1-160011">Figure 1</xref> shows a clear trend: When fewer fit indices have to reach a certain criterion for a model to converge (<italic>n.excellent</italic>), the difference to the reference model becomes larger. For the four fit index cutoffs, there appears to be a similar, but weaker, trend. When the cutoffs are more liberal, the difference to the reference model increases.</p>
<p id="S3.PX1.P9">Next, we investigate the frequency of certain paths being included across all specifications of the multiverse. We present a network multiverse plot in <xref ref-type="fig" rid="x1-160022">Figure 2</xref> to visualize this. The upper row shows the percentages of individual fitted models in the reference fit containing a specific edge. In the lower row, the plot shows the proportion of all individual models across all specifications that included a specific edge. For example, roughly <inline-formula><mml:math id="math-29" display="inline"><mml:mn>10</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula> of all individual models across all specifications contained the path from <italic>Stress</italic> to <italic>Negative Affect</italic>. We do not show paths that were included in less than <inline-formula><mml:math id="math-30" display="inline"><mml:mn>5</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula> of all models. Overall, the results of the multiverse are very similar to the reference fit. Temporal paths occurred less frequently than contemporaneous ones. Each possible effect was included at least once in any model across the multiverse.</p><fig id="x1-160022" position="float" orientation="portrait"><label>Figure 2</label><caption><title>Multiverse Network for the Personality Dataset</title>
<p id="S3.PX1.P10"><italic>Note.</italic> The upper row of the plot shows the percentage of non-zero individual paths in the reference fit (heterogeneity across individuals). The lower row shows the percentage of non-zero edges across all specifications and individual models (indicating heterogeneity across individuals and specifications). Autoregressive paths are omitted. The sign of an effect is ignored. Thickness and transparency of the arrows are scaled to the maximum edge size.</p></caption><graphic mimetype="image" mime-subtype="png" xlink:href="meth.15665-f2.png" position="float" orientation="portrait"/></fig>
<p id="S3.PX1.P11">To compare individual path estimates, we computed the absolute difference of all paths of all individual fitted models within a certain specification relative to the reference model. We then calculated the average of all differences and of all non-zero differences across all individuals. The average mean non-zero difference across all specifications was <inline-formula><mml:math id="math-31" display="inline"><mml:mn>0.125</mml:mn></mml:math></inline-formula> (SD = <inline-formula><mml:math id="math-32" display="inline"><mml:mn>0.031</mml:mn></mml:math></inline-formula>), while the average mean difference was <inline-formula><mml:math id="math-33" display="inline"><mml:mn>0.012</mml:mn></mml:math></inline-formula> (SD = <inline-formula><mml:math id="math-34" display="inline"><mml:mn>0.005</mml:mn></mml:math></inline-formula>). The average non-zero edge weight in the reference model was <inline-formula><mml:math id="math-35" display="inline"><mml:mn>0.320</mml:mn></mml:math></inline-formula>. This means that when an edge weight in any multiverse specification was different from the edge weight in the reference fit, they differed by an average absolute value of <inline-formula><mml:math id="math-36" display="inline"><mml:mn>0.125</mml:mn></mml:math></inline-formula>.</p>
	<p id="S3.PX1.P12">To aid interpretation, we use network density (the sum of all absolute edge weights) as a summary statistic and compare it against the reference fit. A clear trend emerged: The more cutoffs are required for convergence (coded here as ‘strict’), the denser the resulting networks were. A similar, although weaker, trend emerged for the four fit index cutoffs. We present a specification curve analysis as well as detailed results for out-strength centrality and different fit statistics are provided in the supplement (<xref ref-type="bibr" rid="Xbib22.25">Siepe &amp; Heck, 2025a</xref>). For some individuals (<inline-formula><mml:math id="math-37" display="inline"><mml:mn>12</mml:mn></mml:math></inline-formula> for the temporal and <inline-formula><mml:math id="math-38" display="inline"><mml:mn>6</mml:mn></mml:math></inline-formula> for the contemporaneous network), the most central node was identical to the reference model in less than one-third of all specifications.</p></sec></sec>
<sec id="s3_2" sec-type="Emotion|Dataset"><title>Emotion Dataset</title>
<p id="S3.PX1.P13">Similar to the original analyses, all but one individual fitted model in the reference analysis converged successfully. In the reference model for the emotion dataset, only one effect (contemporaneous effect from <italic>Happy</italic> to <italic>Enthusiastic</italic>) surpassed the threshold to be included as a group effect. In general, individual temporal networks were relatively sparse compared to the contemporaneous networks. Besides autoregressive effects (included in all models by default), the most common temporal effect was the path from <italic>Stressed</italic> to <italic>Relaxed</italic> (estimated for <inline-formula><mml:math id="math-39" display="inline"><mml:mn>10</mml:mn></mml:math></inline-formula> participants). Two subgroups with <inline-formula><mml:math id="math-40" display="inline"><mml:mn>53</mml:mn></mml:math></inline-formula> and <inline-formula><mml:math id="math-41" display="inline"><mml:mn>51</mml:mn></mml:math></inline-formula> members were obtained with five and four contemporaneous paths, respectively. One path was not shared among the two subgroups, whereas the other four contemporaneous paths connected the same variable pairs, but in different directions. <italic>Happy</italic> (temporal network) and <italic>Stressed</italic> (both networks) were the most frequent central nodes.</p>
<sec id="s3_2_1" sec-type="Multiverse|Group|Level"><title>Multiverse: Group Level</title>
<p id="S3.PX1.P14">In <inline-formula><mml:math id="math-42" display="inline"><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>335</mml:mn></mml:math></inline-formula> specifications, one individual-level model did not converge (the same one as in the original analysis). In <inline-formula><mml:math id="math-43" display="inline"><mml:mn>63</mml:mn></mml:math></inline-formula> specifications, the models of two individuals did not converge. In all other specifications, all models converged. In about half of the specifications, one group-level effect was estimated, matching the reference model. In the other half of the specifications, four or five group-level effects were estimated (each with roughly the same proportion). With stricter group cutoffs, these effects often became subgroup effects, which can be explored in our shiny application. Homogeneity of the multiverse models ranged between <inline-formula><mml:math id="math-44" display="inline"><mml:mn>6.4</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="math-45" display="inline"><mml:mn>65.4</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula> (<italic>M</italic> = <inline-formula><mml:math id="math-46" display="inline"><mml:mn>24.2</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula>). The reference model had a homogeneity of <inline-formula><mml:math id="math-47" display="inline"><mml:mn>7.9</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula>, so most of the multiverse specifications resulted in a higher homogeneity.</p></sec>
<sec id="s3_2_2" sec-type="Multiverse|Subgroup|Level"><title>Multiverse: Subgroup Level</title>
<p id="S3.PX1.P15">Three subgroups were found in all specifications, but three different subgroup configurations emerged. A subgroup comprised of a single individual emerged in all specifications. The other two subgroups were either of size <inline-formula><mml:math id="math-48" display="inline"><mml:mn>31</mml:mn></mml:math></inline-formula> and <inline-formula><mml:math id="math-49" display="inline"><mml:mn>73</mml:mn></mml:math></inline-formula>, <inline-formula><mml:math id="math-50" display="inline"><mml:mn>65</mml:mn></mml:math></inline-formula> and <inline-formula><mml:math id="math-51" display="inline"><mml:mn>39</mml:mn></mml:math></inline-formula>, or <inline-formula><mml:math id="math-52" display="inline"><mml:mn>53</mml:mn></mml:math></inline-formula> and <inline-formula><mml:math id="math-53" display="inline"><mml:mn>51</mml:mn></mml:math></inline-formula> (as in the original study), respectively. Each of these alternatives occurred <inline-formula><mml:math id="math-54" display="inline"><mml:mn>972</mml:mn></mml:math></inline-formula> times. Therefore, the ARI and VI took three distinct values. We only report the ARI here due to its interpretability. In half of the specifications, it was <inline-formula><mml:math id="math-55" display="inline"><mml:mn>1</mml:mn></mml:math></inline-formula>, indicating perfect similarity. In the other specifications, it was either <inline-formula><mml:math id="math-56" display="inline"><mml:mn>0.09</mml:mn></mml:math></inline-formula> or <inline-formula><mml:math id="math-57" display="inline"><mml:mn>0.12</mml:mn></mml:math></inline-formula>. As stated before, an ARI of at least <inline-formula><mml:math id="math-58" display="inline"><mml:mn>0.80</mml:mn></mml:math></inline-formula> is sometimes used as a threshold for similarity (<xref ref-type="bibr" rid="Xbib7">Gates et al., 2019</xref>). Hence, roughly half of the subgroup solutions are very dissimilar to the original study.</p>
<p id="S3.PX1.P16">Due to an oversight, we initially used a subgroup threshold of <inline-formula><mml:math id="math-59" display="inline"><mml:mn>50</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula> instead of the original value of <inline-formula><mml:math id="math-60" display="inline"><mml:mn>51</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula>. This small change led to <inline-formula><mml:math id="math-61" display="inline"><mml:mn>128</mml:mn></mml:math></inline-formula> paths being different compared to the original study, as the addition of a subgroup effect had downstream effects. All results reported here refer to the analysis using <inline-formula><mml:math id="math-62" display="inline"><mml:mn>51</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula>.</p></sec>
<sec id="s3_2_3" sec-type="Multiverse|Individual|Level"><title>Multiverse: Individual Level</title>
<p id="S3.PX1.P17">The average absolute individual adjacency matrix difference was <inline-formula><mml:math id="math-63" display="inline"><mml:mn>8.872</mml:mn></mml:math></inline-formula> (<italic>SD</italic> = <inline-formula><mml:math id="math-64" display="inline"><mml:mn>3.286</mml:mn></mml:math></inline-formula>). This means that, on average, almost nine paths differed in their presence between the multiverse and the reference model. <xref ref-type="fig" rid="x1-200013">Figure 3</xref> shows a specification curve plot illustrating these differences. Larger differences on the right side of the plot indicate larger average deviations. A clear effect of group thresholds emerged, where more liberal group thresholds resulted in larger average differences. In contrast, more liberal subgroup cutoffs resulted both in relatively strong positive and negative differences (right and left side of <xref ref-type="fig" rid="x1-200013">Figure 3</xref>).</p><fig id="x1-200013" position="float" orientation="portrait"><label>Figure 3</label><caption><title>Specification Curve of Adjacency Differences for the Emotion Dataset</title>
<p id="S3.PX1.P18"><italic>Note.</italic> Panel A is ordered by the size of the differences of adjacency matrices per specification. Panel B shows the color-coded levels of different specifications. The y-axis has a different scale than <xref ref-type="fig" rid="x1-160011">Figure 1</xref>.</p></caption><graphic mimetype="image" mime-subtype="png" xlink:href="meth.15665-f3.png" position="float" orientation="portrait"/></fig>
<p id="S3.PX1.P19"><xref ref-type="fig" rid="x1-200024">Figure 4</xref> shows a multiverse network plot illustrating the frequency with which different paths were found across the multiverse. The plot indicates some discrepancies between the multiverse specifications compared to the reference model, especially regarding the directionality of the contemporaneous paths. As for the personality dataset, paths were included more frequently in the contemporaneous than in the temporal network.</p><fig id="x1-200024" position="float" orientation="portrait"><label>Figure 4</label><caption><title>Multiverse Network for the Emotion Dataset</title>
<p id="S3.PX1.P20"><italic>Note.</italic> See <xref ref-type="fig" rid="x1-160022">Figure 2</xref> for an explanation of the plot. ‘HighCorrPositive’ and ‘HighCorrNegative’ refer to the aggregate Positive and Negative Affect variables, respectively, to keep the notation consistent with the original paper.</p></caption><graphic mimetype="image" mime-subtype="png" xlink:href="meth.15665-f4.png" position="float" orientation="portrait"/></fig>
	<p id="S3.PX1.P21">We again calculated the average absolute differences in edge weights. The average difference when excluding paths that are zero across all specifications was <inline-formula><mml:math id="math-65" display="inline"><mml:mn>0.202</mml:mn></mml:math></inline-formula> (<italic>SD</italic> = <inline-formula><mml:math id="math-66" display="inline"><mml:mn>0.059</mml:mn></mml:math></inline-formula>), while the average mean difference of all paths (including zero paths) was <inline-formula><mml:math id="math-67" display="inline"><mml:mn>0.030</mml:mn></mml:math></inline-formula> (<italic>SD</italic> = <inline-formula><mml:math id="math-68" display="inline"><mml:mn>0.011</mml:mn></mml:math></inline-formula>), both larger than in the personality dataset. This indicates that, when an edge was different for an individual, it differed by roughly <inline-formula><mml:math id="math-69" display="inline"><mml:mn>0.2</mml:mn></mml:math></inline-formula> on average. Given the average absolute nonzero edge weight of <inline-formula><mml:math id="math-70" display="inline"><mml:mn>0.358</mml:mn></mml:math></inline-formula> in the reference model, this is a substantial difference.</p>
	<p id="S3.PX1.P22">For network density, we found that while most specifications are within <inline-formula><mml:math id="math-71" display="inline"><mml:mn>10</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula> of the reference density, some differences are close to <inline-formula><mml:math id="math-72" display="inline"><mml:mn>30</mml:mn><mml:mi>%</mml:mi></mml:math></inline-formula>. The clearest pattern emerged for the number of fit indices exceeding a cutoff for convergence, where a stricter setting led to denser networks. A similar tendency emerged for stricter settings of the RMSEA and the CFI. A visualization of network density differences as well as results for centrality and fit indices are available in the supplement (<xref ref-type="bibr" rid="Xbib22.25">Siepe &amp; Heck, 2025a</xref>). For <inline-formula><mml:math id="math-73" display="inline"><mml:mn>30</mml:mn></mml:math></inline-formula> (temporal network) and <inline-formula><mml:math id="math-74" display="inline"><mml:mn>29</mml:mn></mml:math></inline-formula> (contemporaneous network) individuals, the most central node was identical in less than one-third of the specifications.</p></sec></sec>
<sec id="s4" sec-type="Discussion"><title>Discussion</title>
<p id="S4.PX1.P1">We introduced multiverse analysis to multivariate psychological time series modeling and used it to evaluate the robustness of previous empirical research to alternative plausible model specifications. Results were generally robust at the group level. However, fitted individual-level networks were more sensitive to alternative specifications.</p>
<sec id="s4_1" sec-type="Implications|Applied|Research"><title>Implications for Applied Research</title>
<p id="S4.PX1.P2">Users can already change group and subgroup thresholds when using the standard GIMME package (<xref ref-type="bibr" rid="Xbib16">Lane et al., 2019</xref>). In our reanalysis, this choice had no relevant impact on the personality dataset. In the emotion dataset, however, changing the (sub-)group thresholds had a visible impact on the model outcome. Even a slight change from 50% to 51% had a small but notable effect on the estimated model. The average number of observations in the emotion dataset was relatively small. This is consistent with evidence from simulation studies (<xref ref-type="bibr" rid="Xbib16">Lane et al., 2019</xref>, <xref ref-type="bibr" rid="Xbib19">Nestler &amp; Humberg, 2021</xref>) showing that results of GIMME with relatively few time points should be interpreted cautiously.</p>
<p id="S4.PX1.P3">In both data sets, varying the number of fit indices needed to establish convergence from one to three had a substantial inﬂuence on the results. For example, the density of networks, a popular network characteristic (<xref ref-type="bibr" rid="Xbib3">Bringmann et al., 2022</xref>), differed substantially depending on the fit index convergence criterion. The most central node, sometimes used as an indicator for therapeutic targets (<xref ref-type="bibr" rid="Xbib3">Bringmann et al., 2022</xref>), also differed across many specifications for some individuals. Even for summary statistics that may be interpreted as relatively stable across the multiverse (e.g., differences in adjacency matrices), we found considerable interindividual differences in this stability. While model results may be robust overall, they may differ strongly for some individuals. The implications of our results therefore depend on the level of analysis that one is interested in.</p>
<p id="S4.PX1.P4">Results at the group level generally seem to be more robust than those at the individual level. Especially if models such as GIMME are interpreted at the individual level, for instance, for developing individualized treatment rules (<xref ref-type="bibr" rid="Xbib20">Ong et al., 2022</xref>), the inherent uncertainty in model fitting should be considered. A multiverse analysis not only guards against the overinterpretation of chance findings but may also increase the trustworthiness of results if conclusions remain robust.</p>
<p id="S4.PX1.P5">In practice, researchers could thus pre-specify the preferred main analysis and the main outcome(s) of interest together with certain robustness criteria. For example, a primary goal of a study may concern the subgrouping of individuals into clusters. In this case, researchers should focus on the robustness of subgroup solutions by defining the percentage of specifications in the multiverse yielding identical or highly similar subgroups. The primary interest may alternatively be in estimating the association of a network metric such as network density with an external covariate or outcome. To assess the robustness of this association across the multiverse, researchers can adapt the changes we have made to the GIMME package. Of course, one can also modify existing analysis options or other pre-processing choices considered to be relevant. This recommendation applies not only to GIMME analyses but also to other dynamic network analyses whenever multiple decisions need to be made in the modeling process. Instead of exploring a very large multiverse as in this article, researchers can start small with one or two parameters of interest and then expand based on the computational resources available.</p></sec>
<sec id="s4_2" sec-type="Implications|Methodological|Research"><title>Implications for Methodological Research</title>
<p id="S4.PX1.P6">The interpretation of models such as GIMME, containing many parameters at multiple levels, can already be challenging when obtaining only a single model output. Fitting a few thousand such models increases the diﬃculties in interpretation. We proposed several approaches for aggregating, visualizing, and summarizing relevant parameters at the group, subgroup, and individual levels. In doing so, we provide guidance and inspiration for future multiverse analyses in multivariate time series analyses.</p>
	<p id="S4.PX1.P7">Our results have implications for methodological research on GIMME more broadly. We showed that slightly changing arbitrary model-search criteria can markedly alter the resulting estimates. While default GIMME has shown good performance in simulation studies (e.g., <xref ref-type="bibr" rid="Xbib8">Gates et al., 2017</xref>, <xref ref-type="bibr" rid="Xbib11">Hoekstra et al., 2022</xref>, <xref ref-type="bibr" rid="Xbib16">Lane et al., 2019</xref>), substantive conclusions based on GIMME may not always be robust to relatively small changes in the algorithm. The simulation study of some selected scenarios in the supplementary material (<xref ref-type="bibr" rid="Xbib22.25">Siepe &amp; Heck, 2025a</xref>) illustrates that the impact of these arbitrary modeling choices on the performance of GIMME can be small and depends on context. In some scenarios, such as data sets with few observations, more liberal cutoffs might be justified to increase sensitivity. Hence, we have no principled reasons to generally prefer one set of algorithmic decisions for GIMME.</p>
<p id="S4.PX1.P8">This highlights a larger point: Even if future simulations provide evidence about the performance of particular model specifications, researchers should not overestimate the certainty of recommendations derived from simulation studies with a limited range of settings (<xref ref-type="bibr" rid="Xbib22">Siepe et al., 2023</xref>). In simulation studies, researchers always have to focus on a subset of data-generating scenarios (e.g., a specific number of observations or the strength of a true effect) which are then repeatedly used to simulate data sets with random noise. Such simulations often require idealized assumptions, such as multivariate normality or some forms of homogeneity and independence. In many applied settings, especially when dealing with complex time-series data, it is often unclear to what extent such assumptions are met and how much their violation would affect the results of a study. Multiverse analyses provide a remedy for these limitations by using the data at hand with all their peculiarities. Instead of relying on simulation results on how certain analysis decisions affect the results in general, one can directly check whether different preprocessing steps and alternative modeling decisions matter for the analysis at hand. While simulation studies can thus answer broad questions about average performance in reasonable settings, multiverse analyses can show the robustness of results for a specific data set. Exploring the robustness of results and conclusions for specific empirical data sets will therefore remain an important task.</p></sec>
<sec id="s4_3" sec-type="Limitations"><title>Limitations</title>
<p id="S4.PX1.P9">Beyond the choices we studied, there are many other important decisions in modeling time series data. For example, the variables in the original personality analysis were apparently not detrended. As an exploratory analysis, we applied a commonly used detrending procedure to the dataset. This led to the detection of seven instead of three subgroups in the original analysis, indicating that other choices in the analysis pipeline may be consequential. However, the decision of whether and how to detrend is not arbitrary and therefore not necessarily appropriate for a multiverse analysis.</p>
<p id="S4.PX1.P10">Our multiverse analysis here was a case study using a single modeling framework on two datasets that differed in a number of characteristics. Nevertheless, the results of our analysis do not necessarily generalize to new datasets or other modeling approaches. This again emphasizes the utility of future multiverse analyses.</p>
<p id="S4.PX1.P11">Besides the multiverse approach, GIMME analyses themselves have several limitations (e.g., <xref ref-type="bibr" rid="Xbib15">Kullar et al., 2024</xref>, <xref ref-type="bibr" rid="Xbib26">Wright et al., 2019)</xref>. These include the required resources in terms of length and frequency of assessments, the selection of relevant variables for individuals, and the assumption of stationarity. Also, while there is great promise in using models such as GIMME in applied clinical settings (<xref ref-type="bibr" rid="Xbib27">Wright &amp; Woods, 2020</xref>), it is not yet clear which parameters or summary statistics will be most useful (<xref ref-type="bibr" rid="Xbib3">Bringmann et al., 2022</xref>).</p></sec></sec>
<sec id="s5" sec-type="Conclusion"><title>Conclusion</title>
<p id="S5.PX1.P1">We conducted a multiverse analysis to re-analyze two previously published analyses that used GIMME. Overall, we found satisfactory robustness of group-level results against alternative model specifications. However, some individual-level results varied substantially between different specifications. Overall, multiverse analysis is a valuable tool for investigating researchers’ degrees of freedom in complex time series analyses.</p></sec></sec>
</body>
<back>
	
	<fn-group content-type="author-contribution">
		<fn fn-type="con">
			<p><italic>Björn S. Siepe</italic>: Conceptualization, Methodology, Formal Analysis, Investigation, Software, Visualization, Writing—Original Draft Preparation, Writing—Review &amp; Editing. <italic>Daniel W. Heck</italic>: Conceptualization, Methodology, Supervision, Writing—Review &amp; Editing
			</p>
		</fn>
	</fn-group>
	
	
	
<ref-list><title>References</title>
<ref id="Xbib1"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Beltz</surname>, <given-names>A. M.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Gates</surname>, <given-names>K. M.</given-names></string-name> (<year>2017</year>). <article-title>Network mapping with GIMME</article-title>. <source>Multivariate behavioral research</source>, <volume>52</volume>(<issue>6</issue>), <fpage>789</fpage>–<lpage>804</lpage>. <pub-id pub-id-type="doi">10.1080/00273171.2017.1373014</pub-id></mixed-citation></ref>
	<ref id="Xbib2"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Borsboom</surname>, <given-names>D.</given-names></string-name>, <string-name name-style="western"><surname>Deserno</surname>, <given-names>M. K.</given-names></string-name>, <string-name name-style="western"><surname>Rhemtulla</surname>, <given-names>M.</given-names></string-name>, <string-name name-style="western"><surname>Epskamp</surname>, <given-names>S.</given-names></string-name>, <string-name name-style="western"><surname>Fried</surname>, <given-names>E. I.</given-names></string-name>, <string-name name-style="western"><surname>McNally</surname>, <given-names>R. J.</given-names></string-name>, <string-name name-style="western"><surname>Robinaugh</surname>, <given-names>D. J.</given-names></string-name>, <string-name name-style="western"><surname>Perugini</surname>, <given-names>M.</given-names></string-name>, <string-name name-style="western"><surname>Dalege</surname>, <given-names>J.</given-names></string-name>, <string-name name-style="western"><surname>Costantini</surname>, <given-names>G.</given-names></string-name>, <string-name name-style="western"><surname>Isvoranu</surname>, <given-names>A.-M.</given-names></string-name>, <string-name name-style="western"><surname>Wysocki</surname>, <given-names>A. C.</given-names></string-name>, <string-name name-style="western"><surname>van Borkulo</surname>, <given-names>C. D.</given-names></string-name>, <string-name name-style="western"><surname>van Bork</surname>, <given-names>R</given-names></string-name>., &amp; <string-name name-style="western"><surname>Waldorp</surname>, <given-names>L. J</given-names></string-name>.  (<year>2021</year>). <article-title>Network analysis of multivariate data in psychological science</article-title>. <source>Nature Reviews Methods Primers</source>, <volume>1</volume>(<issue>1</issue>), <elocation-id>58</elocation-id>. <pub-id pub-id-type="doi">10.1038/s43586-021-00055-w</pub-id></mixed-citation></ref>
<ref id="Xbib3"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Bringmann</surname>, <given-names>L. F.</given-names></string-name>, <string-name name-style="western"><surname>Albers</surname>, <given-names>C.</given-names></string-name>, <string-name name-style="western"><surname>Bockting</surname>, <given-names>C.</given-names></string-name>, <string-name name-style="western"><surname>Borsboom</surname>, <given-names>D.</given-names></string-name>, <string-name name-style="western"><surname>Ceulemans</surname>, <given-names>E.</given-names></string-name>, <string-name name-style="western"><surname>Cramer</surname>, <given-names>A.</given-names></string-name>, <string-name name-style="western"><surname>Epskamp</surname>, <given-names>S.</given-names></string-name>, <string-name name-style="western"><surname>Eronen</surname>, <given-names>M. I.</given-names></string-name>, <string-name name-style="western"><surname>Hamaker</surname>, <given-names>E.</given-names></string-name>, <string-name name-style="western"><surname>Kuppens</surname>, <given-names>P.</given-names></string-name>, <string-name name-style="western"><surname>Lutz</surname>, <given-names>W.</given-names></string-name>, <string-name name-style="western"><surname>McNally</surname>, <given-names>R. J.</given-names></string-name>, <string-name name-style="western"><surname>Molenaar</surname>, <given-names>P.</given-names></string-name>, <string-name name-style="western"><surname>Tio</surname>, <given-names>P.</given-names></string-name>, <string-name name-style="western"><surname>Voelkle</surname>, <given-names>M. C.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Wichers</surname>, <given-names>M.</given-names></string-name> (<year>2022</year>). <article-title>Psychopathological networks: Theory, methods and practice</article-title>. <source>Behaviour Research and Therapy</source>, <volume>149</volume>, <elocation-id>104011</elocation-id>. <pub-id pub-id-type="doi">10.1016/j.brat.2021.104011</pub-id></mixed-citation></ref>
<ref id="Xbib4"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Bringmann</surname>, <given-names>L. F.</given-names></string-name>, <string-name name-style="western"><surname>Elmer</surname>, <given-names>T.</given-names></string-name>, <string-name name-style="western"><surname>Epskamp</surname>, <given-names>S.</given-names></string-name>, <string-name name-style="western"><surname>Krause</surname>, <given-names>R. W.</given-names></string-name>, <string-name name-style="western"><surname>Schoch</surname>, <given-names>D.</given-names></string-name>, <string-name name-style="western"><surname>Wichers</surname>, <given-names>M.</given-names></string-name>, <string-name name-style="western"><surname>Wigman</surname>, <given-names>J. T.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Snippe</surname>, <given-names>E.</given-names></string-name> (<year>2019</year>). <article-title>What do centrality measures measure in psychological networks?</article-title> <source>Journal of Abnormal Psychology</source>, <volume>128</volume>(<issue>8</issue>), <fpage>892</fpage>–<lpage>903</lpage>. <pub-id pub-id-type="doi">10.1037/abn0000446</pub-id></mixed-citation></ref>
	<ref id="Xbib5"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Dejonckheere</surname>, <given-names>E.</given-names></string-name>, <string-name name-style="western"><surname>Mestdagh</surname>, <given-names>M.</given-names></string-name>, <string-name name-style="western"><surname>Houben</surname>, <given-names>M.</given-names></string-name>, <string-name name-style="western"><surname>Erbas</surname>, <given-names>Y.</given-names></string-name>, <string-name name-style="western"><surname>Pe</surname>, <given-names>M.</given-names></string-name>, <string-name name-style="western"><surname>Koval</surname>, <given-names>P.</given-names></string-name>, <string-name name-style="western"><surname>Brose</surname>, <given-names>A.</given-names></string-name>, <string-name name-style="western"><surname>Bastian</surname>, <given-names>B.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Kuppens</surname>, <given-names>P.</given-names></string-name> (<year>2018</year>). <article-title>The bipolarity of affect and depressive symptoms</article-title>. <source>Journal of Personality and Social Psychology</source>, <volume>114</volume>(<issue>2</issue>), <fpage>323</fpage>–<lpage>341</lpage>. <pub-id pub-id-type="doi">10.1037/pspp0000186</pub-id></mixed-citation></ref>
<ref id="Xbib6"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Del Giudice</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Gangestad</surname>, <given-names>S. W.</given-names></string-name> (<year>2021</year>). <article-title>A traveler’s guide to the multiverse: Promises, pitfalls, and a framework for the evaluation of analytic decisions</article-title>. <source>Advances in Methods and Practices in Psychological Science</source>, <volume>4</volume>(<issue>1</issue>). <pub-id pub-id-type="doi">10.1177/2515245920954925</pub-id></mixed-citation></ref>
<ref id="Xbib7"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Gates</surname>, <given-names>K. M.</given-names></string-name>, <string-name name-style="western"><surname>Fisher</surname>, <given-names>Z. F.</given-names></string-name>, <string-name name-style="western"><surname>Arizmendi</surname>, <given-names>C.</given-names></string-name>, <string-name name-style="western"><surname>Henry</surname>, <given-names>T. R.</given-names></string-name>, <string-name name-style="western"><surname>Duffy</surname>, <given-names>K. A.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Mucha</surname>, <given-names>P. J.</given-names></string-name> (<year>2019</year>). <article-title>Assessing the robustness of cluster solutions obtained from sparse count matrices</article-title>. <source>Psychological Methods</source>, <volume>24</volume>(<issue>6</issue>), <fpage>675</fpage>–<lpage>689</lpage>. <pub-id pub-id-type="doi">10.1037/met0000204</pub-id></mixed-citation></ref>
<ref id="Xbib8"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Gates</surname>, <given-names>K. M.</given-names></string-name>, <string-name name-style="western"><surname>Lane</surname>, <given-names>S. T.</given-names></string-name>, <string-name name-style="western"><surname>Varangis</surname>, <given-names>E.</given-names></string-name>, <string-name name-style="western"><surname>Giovanello</surname>, <given-names>K.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Guiskewicz</surname>, <given-names>K.</given-names></string-name> (<year>2017</year>). <article-title>Unsupervised classification during time-series model building</article-title>. <source>Multivariate Behavioral Research</source>, <volume>52</volume>(<issue>2</issue>), <fpage>129</fpage>–<lpage>148</lpage>. <pub-id pub-id-type="doi">10.1080/00273171.2016.1256187</pub-id></mixed-citation></ref>
<ref id="Xbib9"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Gates</surname>, <given-names>K. M.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Molenaar</surname>, <given-names>P. C. M.</given-names></string-name> (<year>2012</year>). <article-title>Group search algorithm recovers effective connectivity maps for individuals in homogeneous and heterogeneous samples</article-title>. <source>NeuroImage</source>, <volume>63</volume>(<issue>1</issue>), <fpage>310</fpage>–<lpage>319</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2012.06.026</pub-id></mixed-citation></ref>
<ref id="Xbib10"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Hall</surname>, <given-names>B. D.</given-names></string-name>, <string-name name-style="western"><surname>Liu</surname>, <given-names>Y.</given-names></string-name>, <string-name name-style="western"><surname>Jansen</surname>, <given-names>Y.</given-names></string-name>, <string-name name-style="western"><surname>Dragicevic</surname>, <given-names>P.</given-names></string-name>, <string-name name-style="western"><surname>Chevalier</surname>, <given-names>F.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Kay</surname>, <given-names>M.</given-names></string-name> (<year>2022</year>). <article-title>A survey of tasks and visualizations in multiverse analysis reports</article-title>. <source>Computer Graphics Forum</source>, <volume>41</volume>(<issue>1</issue>), <fpage>402</fpage>–<lpage>426</lpage>. <pub-id pub-id-type="doi">10.1111/cgf.14443</pub-id></mixed-citation></ref>
<ref id="Xbib11"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Hoekstra</surname>, <given-names>R. H. A.</given-names></string-name>, <string-name name-style="western"><surname>Epskamp</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Borsboom</surname>, <given-names>D.</given-names></string-name> (<year>2022</year>). <article-title>Heterogeneity in individual network analysis: reality or illusion?</article-title> <source>Multivariate Behavioral Research</source>, <volume>58</volume>(<issue>4</issue>), <fpage>762</fpage>–<lpage>786</lpage>. <pub-id pub-id-type="doi">10.1080/00273171.2022.2128020</pub-id></mixed-citation></ref>
<ref id="Xbib12"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Hu</surname>, <given-names>L.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Bentler</surname>, <given-names>P. M.</given-names></string-name> (<year>1999</year>). <article-title>Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives</article-title>. <source>Structural Equation Modeling: A Multidisciplinary Journal</source>, <volume>6</volume>(<issue>1</issue>), <fpage>1</fpage>–<lpage>55</lpage>. <pub-id pub-id-type="doi">10.1080/10705519909540118</pub-id></mixed-citation></ref>
<ref id="Xbib13"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Hubert</surname>, <given-names>L.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Arabie</surname>, <given-names>P.</given-names></string-name> (<year>1985</year>). <article-title>Comparing partitions</article-title>. <source>Journal of Classification</source>, <volume>2</volume>(<issue>1</issue>), <fpage>193</fpage>–<lpage>218</lpage>. <pub-id pub-id-type="doi">10.1007/BF01908075</pub-id></mixed-citation></ref>
<ref id="Xbib14"><mixed-citation publication-type="journal"><string-name name-style="western"><surname>Jordan</surname>, <given-names>D. G.</given-names></string-name>, <string-name name-style="western"><surname>Winer</surname>, <given-names>E. S.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Salem</surname>, <given-names>T.</given-names></string-name> (<year>2020</year>). <article-title>The current status of temporal network analysis for clinical science: Considerations as the paradigm shifts?</article-title> <source>Journal of Clinical Psychology</source>, <volume>76</volume>(<issue>9</issue>), <fpage>1591</fpage>–<lpage>1612</lpage>. <pub-id pub-id-type="doi">10.1002/jclp.22957</pub-id></mixed-citation></ref>
	<ref id="Xbib15"><mixed-citation publication-type="author"><string-name name-style="western"><surname>Kullar</surname>, <given-names>M.</given-names></string-name>, <string-name name-style="western"><surname>Carter</surname>, <given-names>S.</given-names></string-name>, <string-name name-style="western"><surname>Hitchcock</surname>, <given-names>C.</given-names></string-name>, <string-name name-style="western"><surname>Whittaker</surname>, <given-names>S.</given-names></string-name>, <string-name name-style="western"><surname>Wright</surname>, <given-names>A. G. C.</given-names></string-name>, <string-name name-style="western"><surname>Dalgleish</surname>, <given-names>T.</given-names></string-name> (<year>2024</year>). <article-title>Patterns of emotion-network dynamics are orthogonal to mood disorder status: An experience sampling investigation</article-title>. <source>Emotion</source>, <volume>24</volume> (<issue>1</issue>), <fpage>116</fpage>–<lpage>129</lpage>. <pub-id pub-id-type="doi">10.1037/emo0001245</pub-id></mixed-citation></ref>
<ref id="Xbib16"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Lane</surname>, <given-names>S. T.</given-names></string-name>, <string-name name-style="western"><surname>Gates</surname>, <given-names>K. M.</given-names></string-name>, <string-name name-style="western"><surname>Pike</surname>, <given-names>H. K.</given-names></string-name>, <string-name name-style="western"><surname>Beltz</surname>, <given-names>A. M.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Wright</surname>, <given-names>A. G.</given-names></string-name> (<year>2019</year>). <article-title>Uncovering general, shared, and unique temporal patterns in ambulatory assessment data</article-title>. <source>Psychological Methods</source>, <volume>24</volume>(<issue>1</issue>), <fpage>54</fpage>–<lpage>69</lpage>. <pub-id pub-id-type="doi">10.1037/met0000192</pub-id></mixed-citation></ref>
<ref id="Xbib17"><mixed-citation publication-type="other"><string-name name-style="western"><surname>McNeish</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Wolf</surname>, <given-names>M. G.</given-names></string-name> (<year>2023</year>). <article-title>Dynamic fit index cutoffs for confirmatory factor analysis models</article-title>. <source>Psychological Methods</source>, <volume>28</volume>(<issue>1</issue>), <fpage>61</fpage>–<lpage>88</lpage>. <pub-id pub-id-type="doi">10.1037/met0000425.supp</pub-id></mixed-citation></ref>
	<ref id="Xbib18"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Meilă</surname>, <given-names>M.</given-names></string-name> (<year>2007</year>). <article-title>Comparing clusterings—an information based distance</article-title>. <source>Journal of Multivariate Analysis</source>, <volume>98</volume>(<issue>5</issue>), <fpage>873</fpage>–<lpage>895</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmva.2006.11.013</pub-id></mixed-citation></ref>
<ref id="Xbib19"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Nestler</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Humberg</surname>, <given-names>S.</given-names></string-name> (<year>2021</year>). <article-title>Gimme’s ability to recover group-level path coefficients and individual-level path coefficients</article-title>. <source>Methodology</source>, <volume>17</volume>(<issue>1</issue>), <fpage>58</fpage>–<lpage>91</lpage>. <pub-id pub-id-type="doi">10.5964/meth.2863</pub-id></mixed-citation></ref>
<ref id="Xbib20"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Ong</surname>, <given-names>C. W.</given-names></string-name>, <string-name name-style="western"><surname>Hayes</surname>, <given-names>S. C.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Hofmann</surname>, <given-names>S. G.</given-names></string-name> (<year>2022</year>). <article-title>A process-based approach to cognitive behavioral therapy: A theory-based case illustration</article-title>. <source>Frontiers in Psychology</source>, <volume>13</volume>, <elocation-id>1002849</elocation-id>. <pub-id pub-id-type="doi">10.3389/fpsyg.2022.1002849</pub-id></mixed-citation></ref>
	<ref id="Xbib21"><mixed-citation publication-type="other"><collab>R Core Team</collab> (<year>2023</year>). <source>R: A language and environment for statistical computing</source> <comment>(Version 4.3.2)</comment>. R Project for Statistical Computing. <ext-link ext-link-type="uri" xlink:href="https://www.R-project.org/">https://www.R-project.org/</ext-link></mixed-citation></ref>
<ref id="Xbib22"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Siepe</surname>, <given-names>B. S.</given-names></string-name>, <string-name name-style="western"><surname>Bartoš</surname>, <given-names>F.</given-names></string-name>, <string-name name-style="western"><surname>Morris</surname>, <given-names>T.</given-names></string-name>, <string-name name-style="western"><surname>Boulesteix</surname>, <given-names>A.-L.</given-names></string-name>, <string-name name-style="western"><surname>Heck</surname>, <given-names>D. W.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Pawel</surname>, <given-names>S.</given-names></string-name> (<year>2023</year>). <source>Simulation studies for methodological research in psychology: A standardized template for planning, preregistration, and reporting</source>. PsyArXiv Preprints. <pub-id pub-id-type="doi">10.31234/osf.io/ufgy6</pub-id></mixed-citation></ref>

	<ref id="Xbib22.25"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Siepe</surname>, <given-names>B. S.</given-names></string-name>,  &amp; <string-name name-style="western"><surname>Heck</surname>, <given-names>D. W.</given-names></string-name> (<year>2025a</year>). <source>Dynamic network multiverse</source>. [Preregistration, Code, Data, Supplementary Materials]. OSF. <ext-link ext-link-type="uri" xlink:href="https://osf.io/xvrz5/">https://osf.io/xvrz5/</ext-link></mixed-citation></ref>
	<ref id="Xbib22.50"><mixed-citation publication-type="data"><string-name name-style="western"><surname>Siepe</surname>, <given-names>B. S.</given-names></string-name>,  &amp; <string-name name-style="western"><surname>Heck</surname>, <given-names>D. W.</given-names></string-name> (<year>2025b</year>). <italic>Network Multiverse Shiny App</italic>. [Shiny App: Hosted]. Shinyapps.io. <ext-link ext-link-type="uri" xlink:href="https://tinyurl.com/netshiny">https://tinyurl.com/netshiny</ext-link></mixed-citation></ref>
	<ref id="Xbib22.75"><mixed-citation publication-type="data"><string-name name-style="western"><surname>Siepe</surname>, <given-names>B. S.</given-names></string-name>,  &amp; <string-name name-style="western"><surname>Heck</surname>, <given-names>D. W.</given-names></string-name> (<year>2025c</year>). <source>Network Multiverse Shiny App</source>. [Shiny App: Download]. GitHub. <ext-link ext-link-type="uri" xlink:href="https://tinyurl.com/git-shiny">https://tinyurl.com/git-shiny</ext-link></mixed-citation></ref>
	<ref id="Xbib22.80"><mixed-citation publication-type="data"><string-name name-style="western"><surname>Siepe</surname>, <given-names>B. S.</given-names></string-name>,  &amp; <string-name name-style="western"><surname>Heck</surname>, <given-names>D. W.</given-names></string-name> (<year>2025d</year>). <source>Fork of Group Iterative Multiple Model Estimation for multiverse analysis</source>. [R package: GIMME modified package]. GitHub. <ext-link ext-link-type="uri" xlink:href="https://github.com/bsiepe/mv-gimme">https://github.com/bsiepe/mv-gimme</ext-link></mixed-citation></ref>
	
<ref id="Xbib23"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Simonsohn</surname>, <given-names>U.</given-names></string-name>, <string-name name-style="western"><surname>Simmons</surname>, <given-names>J. P.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Nelson</surname>, <given-names>L. D.</given-names></string-name> (<year>2020</year>). <article-title>Specification curve analysis</article-title>. <source>Nature Human Behaviour</source>, <volume>4</volume>(<issue>11</issue>), <fpage>1208</fpage>–<lpage>1214</lpage>. <pub-id pub-id-type="doi">10.1038/s41562-020-0912-z</pub-id></mixed-citation></ref>
<ref id="Xbib24"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Steegen</surname>, <given-names>S.</given-names></string-name>, <string-name name-style="western"><surname>Tuerlinckx</surname>, <given-names>F.</given-names></string-name>, <string-name name-style="western"><surname>Gelman</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Vanpaemel</surname>, <given-names>W.</given-names></string-name> (<year>2016</year>). <article-title>Increasing transparency through a multiverse analysis</article-title>. <source>Perspectives on Psychological Science</source>, <volume>11</volume>(<issue>5</issue>), <fpage>702</fpage>–<lpage>712</lpage>. <pub-id pub-id-type="doi">10.1177/1745691616658637</pub-id></mixed-citation></ref>
<ref id="Xbib25"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Weermeijer</surname>, <given-names>J.</given-names></string-name>, <string-name name-style="western"><surname>Lafit</surname>, <given-names>G.</given-names></string-name>, <string-name name-style="western"><surname>Kiekens</surname>, <given-names>G.</given-names></string-name>, <string-name name-style="western"><surname>Wampers</surname>, <given-names>M.</given-names></string-name>, <string-name name-style="western"><surname>Eisele</surname>, <given-names>G.</given-names></string-name>, <string-name name-style="western"><surname>Kasanova</surname>, <given-names>Z.</given-names></string-name>, <string-name name-style="western"><surname>Vaessen</surname>, <given-names>T.</given-names></string-name>, <string-name name-style="western"><surname>Kuppens</surname>, <given-names>P.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Myin-Germeys</surname>, <given-names>I.</given-names></string-name> (<year>2022</year>). <article-title>Applying multiverse analysis to experience sampling data: Investigating whether preprocessing choices affect robustness of conclusions</article-title>. <source>Behavior Research Methods</source>, <volume>54</volume>(<issue>6</issue>), <fpage>2981</fpage>–<lpage>2992</lpage>. <pub-id pub-id-type="doi">10.3758/s13428-021-01777-1</pub-id></mixed-citation></ref>
<ref id="Xbib26"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Wright</surname>, <given-names>A. G. C.</given-names></string-name>, <string-name name-style="western"><surname>Gates</surname>, <given-names>K. M.</given-names></string-name>, <string-name name-style="western"><surname>Arizmendi</surname>, <given-names>C.</given-names></string-name>, <string-name name-style="western"><surname>Lane</surname>, <given-names>S. T.</given-names></string-name>, <string-name name-style="western"><surname>Woods</surname>, <given-names>W. C.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Edershile</surname>, <given-names>E. A.</given-names></string-name> (<year>2019</year>). <article-title>Focusing personality assessment on the person: Modeling general, shared, and person specific processes in personality and psychopathology</article-title>. <source>Psychological Assessment</source>, <volume>31</volume>(<issue>4</issue>), <fpage>502</fpage>–<lpage>515</lpage>. <pub-id pub-id-type="doi">10.1037/pas0000617</pub-id></mixed-citation></ref>
<ref id="Xbib27"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Wright</surname>, <given-names>A. G. C.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Woods</surname>, <given-names>W. C.</given-names></string-name> (<year>2020</year>). <article-title>Personalized models of psychopathology</article-title>. <source>Annual Review of Clinical Psychology</source>, <volume>16</volume>, <fpage>49</fpage>–<lpage>74</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-clinpsy-102419-125032</pub-id></mixed-citation></ref>
<ref id="Xbib28"><mixed-citation publication-type="other"><string-name name-style="western"><surname>Wright</surname>, <given-names>A. G. C.</given-names></string-name>, &amp; <string-name name-style="western"><surname>Zimmermann</surname>, <given-names>J.</given-names></string-name> (<year>2019</year>). <article-title>Applied ambulatory assessment: Integrating idiographic and nomothetic principles of measurement</article-title>. <source>Psychological Assessment</source>, <volume>31</volume>(<issue>12</issue>), <fpage>1467</fpage>–<lpage>1480</lpage>. <pub-id pub-id-type="doi">10.1037/pas0000685</pub-id></mixed-citation></ref>
</ref-list>
	<sec sec-type="data-availability" id="das"><title>Data Availability</title>
		<p>The preregistration and all code, data, and supplementary materials are available at <xref ref-type="bibr" rid="Xbib22.25">Siepe and Heck (2025a)</xref></p>
	</sec>	

	
	<sec sec-type="supplementary-material" id="sp1"><title>Supplementary Materials</title>
		<p>For this article, the following Supplementary Materials are available:
			<list list-type="bullet">
				<list-item><p>Code. (<xref ref-type="bibr" rid="Xbib22.25">Siepe &amp; Heck, 2025a</xref>)</p></list-item>
				<list-item><p>Data. (<xref ref-type="bibr" rid="Xbib22.25">Siepe &amp; Heck, 2025a</xref>)</p></list-item>
				<list-item><p>Preregistration. (<xref ref-type="bibr" rid="Xbib22.25">Siepe &amp; Heck, 2025a</xref>)</p></list-item>
				<list-item><p>Study materials. (<xref ref-type="bibr" rid="Xbib22.25">Siepe &amp; Heck, 2025a</xref>)</p></list-item>
			</list></p>
	</sec>
	
	
			

<fn-group>
<fn fn-type="financial-disclosure"><p>The authors have no funding to report.</p></fn>
</fn-group>
<fn-group>
<fn fn-type="conflict"><p>The authors have declared that no competing interests exist.</p></fn>
</fn-group>
<ack>
<p>We thank Monica Kullar and Aidan Wright for sharing their data and code online and permitting us to reuse it. We thank Lea Schumacher and Matthias Kloft for their helpful comments on an earlier version of this manuscript.</p>
</ack>
</back>
</article>
