首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The purpose of this study is to provide guidance on a process for including latent class predictors in regression mixture models. We first examine the performance of current practice for using the 1-step and 3-step approaches where the direct covariate effect on the outcome is omitted. None of the approaches show adequate estimates of model parameters. Given that Step 1 of the 3-step approach shows adequate results in class enumeration, we suggest using an alternative approach: (a) decide the number of latent classes without predictors of latent classes, and (b) bring the latent class predictors into the model with the inclusion of hypothesized direct covariate effects. Our simulations show that this approach leads to good estimates for all model parameters. The proposed approach is demonstrated by using empirical data to examine the differential effects of family resources on students’ academic achievement outcome. Implications of the study are discussed.  相似文献   

2.
Factor mixture modeling (FMM) has been increasingly used to investigate unobserved population heterogeneity. This study examined the issue of covariate effects with FMM in the context of measurement invariance testing. Specifically, the impact of excluding and misspecifying covariate effects on measurement invariance testing and class enumeration was investigated via Monte Carlo simulations. Data were generated based on FMM models with (1) a zero covariate effect, (2) a covariate effect on the latent class variable, and (3) covariate effects on both the latent class variable and the factor. For each population model, different analysis models that excluded or misspecified covariate effects were fitted. Results highlighted the importance of including proper covariates in measurement invariance testing and evidenced the utility of a model comparison approach in searching for the correct specification of covariate effects and the level of measurement invariance. This approach was demonstrated using an empirical data set. Implications for methodological and applied research are discussed.  相似文献   

3.
Taxometric procedures such as MAXEIG and factor mixture modeling (FMM) are used in latent class clustering, but they have very different sets of strengths and weaknesses. Taxometric procedures, popular in psychiatric and psychopathology applications, do not rely on distributional assumptions. Their sole purpose is to detect the presence of latent classes. The procedures capitalize on the assumption that, due to mean differences between two classes, item covariances within class are smaller than item covariances between the classes. FMM goes beyond class detection and permits the specification of hypothesis-based within-class covariance structures ranging from local independence to multidimensional within-class factor models. In principle, FMM permits the comparison of alternative models using likelihood-based indexes. These advantages come at the price of distributional assumptions. In addition, models are often highly parameterized and susceptible to misspecifications of the within-class covariance structure.

Following an illustration with an empirical data set of binary depression items, the MAXEIG procedure and FMM are compared in a simulation study focusing on class detection and the assignment of subjects to the latent classes. FMM generally outperformed MAXEIG in terms of class detection and class assignment. Substantially different class sizes negatively impacted the performance of both approaches, whereas low class separation was much more problematic for MAXEIG than for the FMM.  相似文献   

4.
When conducting longitudinal research, the investigation of between-individual differences in patterns of within-individual change can provide important insights. In this article, we use simulation methods to investigate the performance of a model-based exploratory data mining technique—structural equation model trees (SEM trees; Brandmaier, Oertzen, McArdle, & Lindenberger, 2013)—as a tool for detecting population heterogeneity. We use a latent-change score model as a data generation model and manipulate the precision of the information provided by a covariate about the true latent profile as well as other factors, including sample size, under the possible influences of model misspecifications. Simulation results show that, compared with latent growth curve mixture models, SEM trees might be very sensitive to model misspecification in estimating the number of classes. This can be attributed to the lower statistical power in identifying classes, resulting from smaller differences of parameters prescribed by the template model between classes.  相似文献   

5.
Current practices for growth mixture modeling emphasize the importance of the proper parameterization and number of classes, but the impact of these decisions on latent class composition and the substantive implications has not been thoroughly addressed. Using measures of behavior from 575 middle school students, we compared the results of several multilevel growth mixture models. Results indicated a dramatic shift in class assignment as the models allowed class-varying parameters, with different substantive interpretations and resulting typologies. This research suggests that using variability as a criterion for class differences in a behavior typology can dramatically impact latent class membership. This study describes decisions and results from testing for noninvariance, with particular emphasis on how decisions about the nature of within-person variance can affect resulting subgroups and model parameters.  相似文献   

6.
For some time, there have been differing recommendations about how and when to include covariates in the mixture model building process. Some have advocated the inclusion of covariates after enumeration, whereas others recommend including them early on in the modeling process. These conflicting recommendations have led to inconsistent practices and unease in trusting modeling results. In an attempt to resolve this discord, we conducted a Monte Carlo simulation to examine the impact of covariate exclusion and misspecification of covariate effects on the enumeration process. We considered population and analysis models with both direct and indirect paths from the covariates to the latent class indicators. As expected, misspecified covariate effects most commonly led to the overextraction of classes. Findings suggest that the number of classes could be reliably determined using the unconditional latent class model, thus our recommendation is that class enumeration be done prior to the inclusion of covariates.  相似文献   

7.
Mixture Rasch models have been used to study a number of psychometric issues such as goodness of fit, response strategy differences, strategy shifts, and multidimensionality. Although these models offer the potential for improving understanding of the latent variables being measured, under some conditions overextraction of latent classes may occur, potentially leading to misinterpretation of results. In this study, a mixture Rasch model was applied to data from a statewide test that was initially calibrated to conform to a 3‐parameter logistic (3PL) model. Results suggested how latent classes could be explained and also suggested that these latent classes might be due to applying a mixture Rasch model to 3PL data. To support this latter conjecture, a simulation study was presented to demonstrate how data generated to fit a one‐class 2‐parameter logistic (2PL) model required more than one class when fit with a mixture Rasch model.  相似文献   

8.
Relatively few empirical studies have examined the performance of enumeration indices in correctly identifying longitudinal population heterogeneity when present, and these studies have produced mixed results. In light of these findings, Muthén (2003) suggested that the inclusion of time-invariant (antecedent) and time-varying (concurrent) covariates, as well as allowing the extracted growth mixture classes to predict a distal outcome (consequent covariate), could improve the performance of enumeration indices. This Monte Carlo simulation study examined the performance of a large set of enumeration indices within these generalized growth mixture model (GGMM) conditions, as suggested by Muthén, by manipulating 4 design conditions: (a) sample size, (b) separation distance between adjacent latent growth trajectories, (c) growth trajectory membership proportions, and (d) the amount of mixture model variance explained by the covariates. The findings suggest: (a) at smaller (N = 500) sample sizes, enumeration indices were much more likely to select an incorrect model than the correct one, (b) at larger (N = 3,000) sample sizes, correct model identification occurred only in the largest separation distance and trajectory membership equality conditions, and (c) the inclusion of covariates had a negligible effect at smaller sample sizes, but covariate inclusion had a more favorable effect on data generation model identification in the largest sample size, largest trajectory separation distance, and equal trajectory membership proportions conditions. Suggestions for researchers and future empirical study possibilities are discussed.  相似文献   

9.
Simulation studies involving mixture models inevitably aggregate parameter estimates and other output across numerous replications. A primary issue that arises in these methodological investigations is label switching. The current study compares several label switching corrections that are commonly used when dealing with mixture models. A growth mixture model is used in this simulation study, and the design crosses three manipulated variables—number of latent classes, latent class probabilities, and class separation, yielding a total of 18 conditions. Within each of these conditions, the accuracy of a priori identifiability constraints, a priori training of the algorithm, and four post hoc algorithms developed by Tueller et al.; Cho; Stephens; and Rodriguez and Walker are tested to determine their classification accuracy. Findings reveal that, of all a priori methods, training of the algorithm leads to the most accurate classification under all conditions. In a case where an a priori algorithm is not selected, Rodriguez and Walker’s algorithm is an excellent choice if interested specifically in aggregating class output without consideration as to whether the classes are accurately ordered. Using any of the post hoc algorithms tested yields improvement over baseline accuracy and is most effective under two-class models when class separation is high. This study found that if the class constraint algorithm was used a priori, it should be combined with a post hoc algorithm for accurate classification.  相似文献   

10.
Growth mixture modeling (GMM) is a useful statistical method for longitudinal studies because it includes features of both latent growth modeling (LGM) and finite mixture modeling. This Monte Carlo simulation study explored the impact of ignoring 3 types of time series processes (i.e., AR(1), MA(1), and ARMA(1,1)) in GMM and manipulated the separation of the latent classes, the strength of the time series process, and whether the errors conformed to the time series process in 1 or 2 latent classes. The results showed that omitting time series processes resulted in more serious bias in parameter estimation as the distance between classes increased. However, when the class distances were small, ignoring time series processes contributed to the selection of the correct number of classes. When the GMM models correctly specified the time series process, only models with an AR(1) time series process produced unbiased parameter estimates in most conditions. It was also found that among design factors manipulated, the distance between classes prominently affected the identification of the number of classes and parameter estimation.  相似文献   

11.
Stage-sequential (or multiphase) growth mixture models are useful for delineating potentially different growth processes across multiple phases over time and for determining whether latent subgroups exist within a population. These models are increasingly important as social behavioral scientists are interested in better understanding change processes across distinctively different phases, such as before and after an intervention. One of the less understood issues related to the use of growth mixture models is how to decide on the optimal number of latent classes. The performance of several traditionally used information criteria for determining the number of classes is examined through a Monte Carlo simulation study in single- and multiphase growth mixture models. For thorough examination, the simulation was carried out in 2 perspectives: the models and the factors. The simulation in terms of the models was carried out to see the overall performance of the information criteria within and across the models, whereas the simulation in terms of the factors was carried out to see the effect of each simulation factor on the performance of the information criteria holding the other factors constant. The findings not only support that sample size adjusted Bayesian Information Criterion would be a good choice under more realistic conditions, such as low class separation, smaller sample size, or missing data, but also increase understanding of the performance of information criteria in single- and multiphase growth mixture models.  相似文献   

12.
Sometimes, test‐takers may not be able to attempt all items to the best of their ability (with full effort) due to personal factors (e.g., low motivation) or testing conditions (e.g., time limit), resulting in poor performances on certain items, especially those located toward the end of a test. Standard item response theory (IRT) models fail to consider such testing behaviors. In this study, a new class of mixture IRT models was developed to account for such testing behavior in dichotomous and polytomous items, by assuming test‐takers were composed of multiple latent classes and by adding a decrement parameter to each latent class to describe performance decline. Parameter recovery, effect of model misspecification, and robustness of the linearity assumption in performance decline were evaluated using simulations. It was found that the parameters in the new models were recovered fairly well by using the freeware WinBUGS; the failure to account for such behavior by fitting standard IRT models resulted in overestimation of difficulty parameters on items located toward the end of the test and overestimation of test reliability; and the linearity assumption in performance decline was rather robust. An empirical example is provided to illustrate the applications and the implications of the new class of models.  相似文献   

13.
The inclusion of covariates improves the prediction of class memberships in latent class analysis (LCA). Several methods for examining covariate effects have been developed over the past decade; however, researchers have limited to the comparisons of the performance among these methods in cases of the single-level LCA. The present study investigated the performance of three different methods for examining covariate effects in a multilevel setting. We conducted a simulation to compare the performance of the three methods when level-1 and level-2 covariates were simultaneously incorporated into the nonparametric multilevel latent class model to predict latent class membership at each level. The simulation results revealed that the bias-adjusted three-step maximum likelihood method performed equally well as the one-step method when the sample sizes were sufficiently large and the latent classes were distinct from each other. However, the unadjusted three-step method significantly underestimated the level-1 covariate effect in most conditions.

  相似文献   


14.
This simulation study examines the efficacy of multilevel factor mixture modeling (ML FMM) for measurement invariance testing across unobserved groups when the groups are at the between level of multilevel data. To this end, latent classes are generated with class-specific item parameters (i.e., factor loading and intercept) across the between-level classes. The efficacy of ML FMM is evaluated in terms of class enumeration, class assignment, and the detection of noninvariance. Various classification criteria such as Akaike’s information criterion, Bayesian information criterion, and bootstrap likelihood ratio tests are examined for the correct enumeration of between-level latent classes. For the detection of measurement noninvariance, free and constrained baseline approaches are compared with respect to true positive and false positive rates. This study evidences the adequacy of ML FMM. However, its performance heavily depends on the simulation factors such as the classification criteria, sample size, and the magnitude of noninvariance. Practical guidelines for applied researchers are provided.  相似文献   

15.
In this simulation study, we explored the effect of introducing covariates to a growth mixture model when covariates were also generated by a mixture model. We varied the association between the latent classes underlying the growth trajectories and the covariates, the degree of separation between the latent classes underlying the covariates, the number of covariates included, and amount of missing data in the growth data. We found that adding covariates to the growth mixture model generally hurt class recovery except where the latent classes underlying the growth trajectories and the covariates were the same or very strongly associated, and there was a large degree of separation between the classes underlying the covariates. We found that when covariates were introduced, entropy might no longer be an accurate indicator of the distinctiveness of the growth trajectory classes.  相似文献   

16.
Mixture modeling is a widely applied data analysis technique used to identify unobserved heterogeneity in a population. Despite mixture models' usefulness in practice, one unresolved issue in the application of mixture models is that there is not one commonly accepted statistical indicator for deciding on the number of classes in a study population. This article presents the results of a simulation study that examines the performance of likelihood-based tests and the traditionally used Information Criterion (ICs) used for determining the number of classes in mixture modeling. We look at the performance of these tests and indexes for 3 types of mixture models: latent class analysis (LCA), a factor mixture model (FMA), and a growth mixture models (GMM). We evaluate the ability of the tests and indexes to correctly identify the number of classes at three different sample sizes (n = 200, 500, 1,000). Whereas the Bayesian Information Criterion performed the best of the ICs, the bootstrap likelihood ratio test proved to be a very consistent indicator of classes across all of the models considered.  相似文献   

17.
This study investigates the effect of multidimensionality on extraction of latent classes in mixture Rasch models. In this study, two‐dimensional data were generated under varying conditions. The two‐dimensional data sets were analyzed with one‐ to five‐class mixture Rasch models. Results of the simulation study indicate the mixture Rasch model tended to extract more latent classes than the number of dimensions simulated, particularly when the multidimensional structure of the data was more complex. In addition, the number of extracted latent classes decreased as the dimensions were more highly correlated regardless of multidimensional structure. An analysis of the empirical multidimensional data also shows that the number of latent classes extracted by the mixture Rasch model is larger than the number of dimensions measured by the test.  相似文献   

18.
Latent Markov models with covariates can be estimated via 1-step maximum likelihood. However, this 1-step approach has various disadvantages, such as that the inclusion of covariates in the model might alter the formation of the latent states and that parameter estimation could become infeasible with large numbers of time points, responses, and covariates. This is why researchers typically prefer performing the analysis in a stepwise manner; that is, they first construct the measurement model, then obtain the latent state classifications, and subsequently study the relationship between covariates and latent state memberships. However, such a stepwise approach yields downward-biased estimates of the covariate effects on initial state and transition probabilities. This article, shows how to overcome this problem using a generalization of the bias-corrected 3-step estimation method proposed for latent class analysis (Asparouhov & Muthén, 2014; Bolck, Croon, & Hagenaars, 2004; Vermunt, 2010). We give a formal derivation of the generalization to latent Markov models and discuss how it can be used with many time points by incorporating it into a Baum–Welch type of expectation-maximization algorithm. We evaluate the method through a simulation study and illustrate it using an application on household financial portfolio change. Our study shows that the proposed correction method yields unbiased parameter estimates and accurate standard errors, except for situations with very poorly separated classes and a small sample.  相似文献   

19.
We propose a maximum likelihood framework for estimating finite mixtures of multivariate regression and simultaneous equation models with multiple endogenous variables. The proposed “semi‐parametric” approach posits that the sample of endogenous observations arises from a finite mixture of components (or latent‐classes) of unknown proportions with multiple structural relations implied by the specified model for each latent‐class. We devise an Expectation‐Maximization algorithm in a maximum likelihood framework to simultaneously estimate the class proportions, the class‐specific structural parameters, and posterior probabilities of membership of each observation into each latent‐class. The appropriate number of classes can be chosen using various information‐theoretic heuristics. A data set entailing cross‐sectional observations for a diverse sample of businesses is used to illustrate the proposed approach.  相似文献   

20.
The effects of misspecifying intercept-covariate interactions in a 4 time-point latent growth model were the focus of this investigation. The investigation was motivated by school growth studies in which students' entry-level skills may affect their rate of growth. We studied the latent interaction of intercept and a covariate in predicting growth with respect to 3 factors: sample size (100, 200, and 500), 4 levels of magnitude of interaction effect, and 3 correlation values between intercept and covariate (.3, .5, and .7). Correctly specified models were examined to determine power and Type I error rates, and misspecified models were examined to evaluate the effects on power, parameter estimation, bias, and fit indexes. Results showed that, under correctly specified models, power increased as expected with increasing sample size, larger magnitude of interaction, and larger intercept-covariate correlation. Under misspecification, omitting a non-null interaction results in significant change in the estimation of the direct effects of both covariate and intercept in both magnitude and direction, with results dependent on sign of parameter values for main effects and interaction. Including a spurious interaction does not affect estimation of direct effects of intercept and covariate but does increase standard errors. The primary problem in ignoring a non-null interaction lies in misinterpretation of the model, as interactions yield important insights into the nature of the processes being studied.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号