Page 59 - tmp
P. 59
Faculty of Science Handbook, Session 2017/2018
6. Jarman, Kristin H.(2013) Art of Data Analysis : How to clustering: k-means, CART, decision trees; Artificial Neural
Answer Almost any Question Using Basic Statistics, Network; boosting; support vector machine; association
John Wiley & Sons rules mining. Modelling: descriptive and predictive
modelling. Data organization.
SIT3008 INTRODUCTION TO SURVEY SAMPLING Assessment:
Continuous Assessment: 40%
Techniques of statistical sampling with applications in the Final Examination: 60%
analysis of sample survey data. Topics include simple
random sampling, stratified sampling, systematic sampling, Medium of Instruction:
cluster sampling, two-stage sampling and ratio and English
regression estimates.
Humanity Skill:
Assessment: CS3, CT3, LL2
Continuous Assessment: 40%
Final Examination: 60% References:
1. Adriaans, P. and Zantige, D. (1996). Data Mining.
Medium of Instruction: Addison-Wesley.
English 2. Hand, D., Mannila, H. and Smyth, P. (2001). Principles
of Data Mining. MIT Press.
Humanity Skill: 3. Cios.K.J. et al. (2010). Data mining : a knowledge
CT4, LL2 discovery approach. New York : Springer-Verlag
References:
1. Scheaffer, R. L. (2006), Elementary Survey Sampling, SIT3011 BIOINFORMATICS
th
Duxbury (6 ed.).
nd
2. Thompson, S. K. (2002), Sampling, Wiley, (2 ed.). Statistical modelling of DNA/protein sequences:
3. Lohr, Sharon L. (2010), Sampling: Design and Assessing statistical significance in BLAST using the
nd
Analysis, Cengage Learning (2 ed). Gumbel distribution; DNA substitution models; Poisson and
4. Cochran, W. (1977), Sampling Techniques, Wiley (3 rd negative binomial models for gene counts; Hidden Markov
ed.). Model.
Algorithms for sequence analysis and tree
SIT3009 STATISTICAL PROCESS CONTROL construction: Dynamic programming for sequence
alignment and Viterbi decoding; neighbour-joining,
Methods and philosophy of statistical process control. UPGMA, parsimony and maximum likelihood tree-building
Control charts for variables and attributes. CUSUM and methods.
EWMA charts. Process capability analysis. Multivariate
control charts. Acceptance sampling by attributes and Analysis of high-dimensional microarray / RNA-Seq
variables. gene expression data: Statistical tests for detecting
differential expression, feature selection, visualization, and
Assessment: phenotype classification.
Continuous Assessment: 40%
Final Examination : 60% Assessment:
Continuous Assessment: 40%
Medium of Instruction: Final Examination: 60%
English
Medium of Instruction:
Humanity Skill: English
CS3, CT3
Humanity Skill:
References: CS3, CT3, LL2
1. D. C. Montgomery (2009), Introduction to Statistical
Quality Control, Wiley (6 ed). References:
th
2. R. S. Kenett and S. Zacks (1998), Modern Industrial 1.
Statistics: Design and control of quality and reliability, Jones, N.C. & Pevzner, P.A. (2004). An Introduction to
Duxbury Press. Bioinformatics Algorithms. Massachusetts: MIT Press.
3. A. J. Duncan (1986), Quality Control and industrial 2. Durbin, R., Eddy, S., Krogh, A. & Mitchison, G. (1998).
Statistics, Irwin, (5 ed). Biological Sequence Analysis: Probabilistic Models of
th
Proteins and Nucleic Acids. Cambridge: Cambridge
University Press.
SIT3010 INTRODUCTION TO DATA MINING 3. Ewens, W.J. & Grant, G.R. (2005). Statistical Methods
in Bioinformatics: An Introduction (2nd ed.). New York:
Description: Introduction to statistical methods and tools for Springer.
analysis of very large data sets and discovery of interesting 4. Pevsner, J. (2009). Bioinformatics and Functional
and unexpected relationships in the data. Genomics (2nd ed.). New York: Wiley-Blackwell.
Data preprocessing and exploration: data quality and data
cleaning. Data exploration: summarizing and visualizing SIT3012 DESIGN AND ANALYSIS OF EXPERIMENTS
data; principal component, multidimensional scaling. Data
analysis and uncertainty: handling uncertainty; statistical Philosophy related to statistical designed experiments.
inference; sampling. Analysis of variance. Experiments with Blocking factors.
Factorial experiments. Two level factorial designs. Blocking
Statistical approach to data mining and data mining and confounding system for two-level factorials. Two-level
algorithms: Regression, Validation; classification and fractional factorial designs.
57