Product overview

From data import to various analyses carried out with user friendly functionalities.

SNPAlyze is an efficient data mining software that extracts useful information from a massive amount of genotyping data.
This software can be used for various analyses such as Case-Control Study, Cochran-Armitage Trend Test, Linkage Disequilibrium (LD) Analysis, Haplotype Inference, Hardy-Weinberg Equilibrium test, Case-Control Haplotype Analysis, Haplotype Block Analysis, and Logistic Regression Analysis*.

Besides, this software can eliminates most of ambiguity so that it achieves to obtain much more precise analyses by introducing Akaike’s information criterion (AIC)*1.

Tech Specs
*HealthSketch Ver.2.5.8 or later is required (
**Only the Pro version
***Only SNP type markers

SNPAlyze data analysis flow chart

The data analysis flowchart is given below.


The main features will be introduced next. Go to specification details >>


*1 Akaike’s information criterion (AIC)

AIC values are the criteria that indicate the degree to which the observed data corresponds to a model. The residual sum of squares (RSS) becomes smaller as the number of parameters included in a model increases.

Hence, SNPAlyze not only compares the size of the RSS but also considers the number of parameters. Consequently, a model that leads to the minimum AIC is considered the best.

AIC = -2 x (maximum log likelihood of the model) + 2 x (number of free parameters in the model)

The first term is called the maximum likelihood of the model AIC and is a measure of how well the model fits the data. The second term is called the number of free parameters in the AIC of the model and indicates the penalty associated with the addition of parameters (and hence model complexity).

The determination reliability becomes higher as the absolute value of the AIC (in this case, it indicates the difference between a dependent and an independent model) increases. The absolute value of the AIC that is close to zero is considered equivalent to the 5% level of significance in the chi-square test, although this evaluation depends on the degree of freedom in the contingency table.

Reference : Sakamoto Y. and Akaike H.(1978) Analysis of Cross Classified Data by AIC, Ann. Inst. Statist. Math., 30-1, pp.185-197.