Page 156 - buku panduan 20192020
P. 156

Faculty of Science Handbook, Session 2019/2020


               cluster  sampling,  two-stage  sampling  and  ratio  and   Medium of Instruction:
               regression estimates.                           English

               Assessment:                                     Soft Skills:
               Continuous Assessment:       40%                CS3, CTPS3
               Final Examination:           60%
                                                               References:
               Medium of Instruction:                          1.   Adriaans,  P.,  &  Zantige,  D.  (1996).  Data  mining.
               English                                             Addison-Wesley.
                                                               2.    Hand, D., Mannila, H., & Smyth, P. (2001). Principles
               Soft Skills:                                        of data mining. MIT Press.
               CS3, CTPS3                                      3.    Cios.  K.  J.  et  al.  (2010).  Data  mining:  A  knowledge
                                                                   discovery approach. New York: Springer-Verlag
               References:
               1.   Scheaffer,  R.  L.  (2006),  Elementary  survey  sampling
                    th
                   (6  ed.). Duxbury.                          SIT3011  BIOINFORMATICS
                                             nd
               2.   Thompson, S. K. (2002), Sampling (2  ed.). Wiley.
               3.   Lohr,  Sharon  L.  (2010).  Sampling:  Design  and   Statistical  modelling  of  DNA/protein  sequences:
                           nd
                   analysis (2  ed). Cengage Learning.         Assessing  statistical  significance  in  BLAST  using  the
               4.   Cochran,  W.  (1977).  Sampling  techniques  (3   ed.).   Gumbel distribution; DNA substitution models; Poisson and
                                                     rd
                   Wiley.                                      negative binomial models for gene counts; Hidden Markov
                                                               Model.

               SIT3009  STATISTICAL PROCESS CONTROL            Algorithms   for   sequence   analysis   and   tree
                                                               construction:  Dynamic  programming  for  sequence
               Methods  and  philosophy  of  statistical  process  control.   alignment   and   Viterbi   decoding;   neighbour-joining,
               Control  charts  for  variables  and  attributes.  CUSUM  and   UPGMA,  parsimony  and  maximum  likelihood  tree-building
               EWMA  charts.  Process  capability  analysis.  Multivariate   methods.
               control  charts.  Acceptance  sampling  by  attributes  and
               variables.                                      Analysis  of  high-dimensional  microarray  /  RNA-Seq
                                                               gene  expression  data:  Statistical  tests  for  detecting
               Assessment:                                     differential expression, feature selection, visualization, and
               Continuous Assessment:       40%                phenotype classification.
               Final Examination:           60%
                                                               Assessment:
               Medium of Instruction:                          Continuous Assessment:       40%
               English                                         Final Examination:           60%

               Soft Skills:                                    Medium of Instruction:
               CS3, CTPS3                                      English

               References:                                     Soft Skills:
               1.   D.  C.  Montgomery.  (2009).  Introduction  to  statistical   CS3, CTPS3
                               th
                   quality control (6  ed.). Wiley.
               2.   R.  S.  Kenett,  &  S.  Zacks.  (1998).  Modern  industrial   References:
                   statistics: Design and control of quality and reliability.
                   Duxbury Press.                              1.   Jones, N.C., & Pevzner, P.A. (2004).  An introduction
               3.    A.  J.  Duncan.  (1986).  Quality  control  and  industrial   to  bioinformatics  algorithms.  Massachusetts:  MIT
                   statistics (5  ed.). Irwin.                     Press.
                           th
                                                               2.    Durbin,  R.,  Eddy,  S.,  Krogh,  A.,  &  Mitchison,  G.
                                                                   (1998).  Biological  sequence  analysis:  Probabilistic
               SIT3010  INTRODUCTION TO DATA MINING                models  of  proteins  and  nucleic  acids.  Cambridge:
                                                                   Cambridge University Press.
               Description: Introduction to statistical methods and tools for   3.    Ewens,  W.J.,  &  Grant,  G.R.  (2005).  Statistical
                                                                                                     nd
               analysis of very large data sets and discovery of interesting   methods  in  bioinformatics:  An  introduction  (2   ed.).
               and unexpected relationships in the data.           New York: Springer.
                                                               4.    Pevsner,  J.  (2009).  Bioinformatics  and  functional
                                                                            nd
               Data preprocessing and exploration: data quality and data   genomics (2  ed.). New York: Wiley-Blackwell.
               cleaning.  Data  exploration:  summarizing  and  visualizing
               data;  principal  component,  multidimensional  scaling.  Data
               analysis  and  uncertainty:  handling  uncertainty;  statistical   SIT3012   DESIGN AND ANALYSIS OF EXPERIMENTS
               inference; sampling.
                                                               Philosophy  related  to  statistical  designed  experiments.
               Statistical  approach  to  data  mining  and  data  mining   Analysis  of  variance.  Experiments  with  Blocking  factors.
               algorithms:  Regression,  Validation;  classification  and   Factorial experiments. Two level factorial designs. Blocking
               clustering: k-means, CART, decision trees; Artificial Neural   and confounding system for two-level factorials. Two-level
               Network;  boosting;  support  vector  machine;  association   fractional factorial designs.
               rules  mining.  Modelling:  descriptive  and  predictive
               modelling. Data organization.                   Assessment:
                                                               Continuous Assessment:       40%
               Assessment:                                     Final Examination:           60%
               Continuous Assessment:       40%
               Final Examination:           60%                Medium of Instruction:
                                                               English



                                                           143
   151   152   153   154   155   156   157   158   159   160   161