DYNACOM Co.,Ltd.

Product overview

From data import to various analyses carried out with user friendly functionalities.

SNPAlyze is an efficient data mining software that extracts useful information from a massive amount of genotyping data.
This software can be used for various analyses such as Case-Control Study, Cochran-Armitage Trend Test, Linkage Disequilibrium (LD) Analysis, Haplotype Inference, Hardy-Weinberg Equilibrium test, Case-Control Haplotype Analysis, Haplotype Block Analysis, and Logistic Regression Analysis*.

Besides, this software can eliminates most of ambiguity so that it achieves to obtain much more precise analyses by introducing Akaike’s information criterion (AIC)*1.

Features

Tech Specs

Genotyping Data Import
Supports file list
Automatic selection of polymorphic markers
Case-Control Study
  Setup the tabulation method of DNA polymorphism information
  Use of AIC
  Multiple testing correction
Linkage Disequilibrium Analysis
Haplotype Inference
Haplotype frequency inference
Estimation of diplotype distribution
Hardy-Weinberg Equilibrium Test
Use of FDR (Multiple testing correction)
Cochran-Armitage Trend Test
Use of the Bootstrap method
Case-Control Haplotype Analysis
Haplotype Block Analysis
Automatic selection of polymorphic markers
Cooperate with HealthSketch
Data passing between SNPAlyze and HealthSketch
Logistic Regression Analysis
Treat genotyping data and all analysis data collectively
Principal Component Analysis
Manhattan plot

*HealthSketch Ver.2.5.8 or later is required (http://healthsketch.com/)
**Only the Pro version
***Only SNP type markers

SNPAlyze data analysis flow chart

The data analysis flowchart is given below.

The main features will be introduced next. Go to specification details >>

*1 Akaike’s information criterion (AIC)

AIC values are the criteria that indicate the degree to which the observed data corresponds to a model. The residual sum of squares (RSS) becomes smaller as the number of parameters included in a model increases.

Hence, SNPAlyze not only compares the size of the RSS but also considers the number of parameters. Consequently, a model that leads to the minimum AIC is considered the best.

AIC = -2 x (maximum log likelihood of the model) + 2 x (number of free parameters in the model)

The first term is called the maximum likelihood of the model AIC and is a measure of how well the model fits the data. The second term is called the number of free parameters in the AIC of the model and indicates the penalty associated with the addition of parameters (and hence model complexity).

The determination reliability becomes higher as the absolute value of the AIC (in this case, it indicates the difference between a dependent and an independent model) increases. The absolute value of the AIC that is close to zero is considered equivalent to the 5% level of significance in the chi-square test, although this evaluation depends on the degree of freedom in the contingency table.

Reference : Sakamoto Y. and Akaike H.(1978) Analysis of Cross Classified Data by AIC, Ann. Inst. Statist. Math., 30-1, pp.185-197.