Statistics II - Syllabus

Course Overview and Structure

Embark on a profound academic exploration as you delve into the Statistics II course () within the distinguished Tribhuvan university's CSIT department. Aligned with the 2074 Syllabus, this course (STA210) seamlessly merges theoretical frameworks with practical sessions, ensuring a comprehensive understanding of the subject. Rigorous assessment based on a 60 + 20 + 20 marks system, coupled with a challenging passing threshold of , propels students to strive for excellence, fostering a deeper grasp of the course content.

This 3 credit-hour journey unfolds as a holistic learning experience, bridging theory and application. Beyond theoretical comprehension, students actively engage in practical sessions, acquiring valuable skills for real-world scenarios. Immerse yourself in this well-structured course, where each element, from the course description to interactive sessions, is meticulously crafted to shape a well-rounded and insightful academic experience.


Course objectives:

To impart the theoretical as well as practical knowledge of estimation, testing of hypothesis,

application of parametric and non-parametric statistical tests, design of experiments, multiple

regression analysis, and basic concept of stochastic process with special focus to data/problems

related with computer science and information technology.


Units

Key Topics

  • Sampling Distribution
    SA-1

    The distribution of a statistic obtained from multiple samples of a population. It is a fundamental concept in inferential statistics.

  • Sampling Distribution of Mean and Proportion
    SA-2

    The distribution of the sample mean and proportion, which are used to make inferences about the population mean and proportion.

  • Central Limit Theorem
    SA-3

    A fundamental theorem in statistics that states that the sampling distribution of the mean will be approximately normal, even if the population distribution is not normal.

  • Concept of Inferential Statistics
    SA-4

    The branch of statistics that deals with making inferences about a population based on a sample of data.

  • Estimation
    SA-5

    The process of making an educated guess about a population parameter based on a sample of data.

  • Methods of Estimation
    SA-6

    Different techniques used to estimate population parameters, such as maximum likelihood estimation and method of moments.

  • Properties of Good Estimator
    SA-7

    The characteristics of a good estimator, including unbiasedness, consistency, and efficiency.

  • Determination of Sample Size
    SA-8

    The process of determining the required sample size to achieve a desired level of precision in estimation.

  • Relationship of Sample Size with Desired Level of Error
    SA-9

    The relationship between the sample size and the desired level of error in estimation, including the concept of margin of error.

Key Topics

  • Types of Statistical Hypotheses
    TE-1

    This topic covers the different types of statistical hypotheses, including null and alternative hypotheses, and their roles in hypothesis testing.

  • Power of the Test and P-Value
    TE-2

    This topic explains the concept of power of the test, p-value, and its use in decision making during hypothesis testing.

  • Steps in Testing of Hypothesis
    TE-3

    This topic outlines the steps involved in testing a hypothesis, from formulating the hypothesis to making a decision based on the test results.

  • One Sample Tests for Mean of Normal Population
    TE-4

    This topic covers one sample tests for the mean of a normal population, including tests for known and unknown variance.

  • Test for Single Proportion
    TE-5

    This topic explains how to conduct a test for a single proportion, including the test statistic and p-value calculation.

  • Test for Difference between Two Means
    TE-6

    This topic covers the test for the difference between two means, including the test statistic and p-value calculation.

  • Test for Difference between Two Proportions
    TE-7

    This topic explains how to conduct a test for the difference between two proportions, including the test statistic and p-value calculation.

  • Paired Sample T-Test
    TE-8

    This topic covers the paired sample t-test, including its application and interpretation.

  • Linkage between Confidence Interval and Testing of Hypothesis
    TE-9

    This topic explains the relationship between confidence intervals and hypothesis testing, including how to use confidence intervals to make inferences about a population.

Key Topics

  • Chi-Square Test
    NO-1

    The Chi-Square test is a statistical test used to determine whether there is a significant association between two categorical variables. It is used to test the independence of two variables or to test whether the observed frequencies of a categorical variable match the expected frequencies.

  • Order Statistics
    NO-2

    Order statistics is a branch of statistics that deals with the arrangement of data in order of magnitude. It is used to describe the distribution of data and to make inferences about the population.

  • Run Test
    NO-3

    The Run test is a non-parametric test used to determine whether a sequence of data is random or not. It is used to test for randomness in a sequence of binary data.

  • Sign Test
    NO-4

    The Sign test is a non-parametric test used to compare the median of two related samples. It is used to test whether the median of one sample is significantly different from the median of another sample.

  • Wilcoxon Matched Pairs Signed Ranks Test
    NO-5

    The Wilcoxon Matched Pairs Signed Ranks test is a non-parametric test used to compare the median of two related samples. It is used to test whether the median of one sample is significantly different from the median of another sample.

  • Mann-Whitney U Test
    NO-6

    The Mann-Whitney U test is a non-parametric test used to compare the median of two independent samples. It is used to test whether the median of one sample is significantly different from the median of another sample.

  • Median Test
    NO-7

    The Median test is a non-parametric test used to compare the median of two or more samples. It is used to test whether the median of one sample is significantly different from the median of another sample.

  • Kolmogorov Smirnov Test (One Sample Case)
    NO-8

    The Kolmogorov Smirnov test is a non-parametric test used to compare the distribution of a sample to a known distribution. It is used to test whether the distribution of a sample is significantly different from a known distribution.

Key Topics

  • Multiple Correlation
    MU-1

    Introduction to multiple correlation, its concept, and application in statistics. Exploring the relationship between multiple variables.

  • Partial Correlation
    MU-2

    Understanding partial correlation, its concept, and application in statistics. Analyzing the relationship between two variables while controlling for other variables.

  • Introduction to Multiple Linear Regression
    MU-3

    Basic concepts and principles of multiple linear regression, including model formulation and estimation. Understanding the relationship between multiple independent variables and a dependent variable.

  • Hypothesis Testing of Multiple Regression
    MU-4

    Testing hypotheses in multiple regression, including significance testing and confidence intervals. Evaluating the overall fit and significance of the regression model.

  • Test of Significance of Regression
    MU-5

    Testing the overall significance of the regression model, including F-test and p-value interpretation. Determining whether the regression model is a good fit to the data.

  • Test of Individual Regression Coefficient
    MU-6

    Testing the significance of individual regression coefficients, including t-test and p-value interpretation. Evaluating the contribution of each independent variable to the regression model.

  • Model Adequacy Tests
    MU-7

    Evaluating the goodness of fit and adequacy of the multiple regression model, including residual analysis and diagnostic plots. Identifying potential issues and limitations of the model.

Key Topics

  • Designing Databases
    DE-1

    This topic covers the fundamentals of designing databases, including the relational database model, normalization, and transforming E-R diagrams into relations.

  • Designing Forms and Reports
    DE-2

    This topic focuses on designing forms and reports, including formatting and assessing usability to create effective user interfaces.

  • Designing Interfaces and Dialogues
    DE-3

    This topic explores the design of interfaces and dialogues, including interaction methods and devices, and designing interfaces and dialogues in graphical environments.

  • Implementation Issues
    DE-4

    Addressing common challenges and considerations that arise during the implementation phase of software development.

  • Open-Source Development
    DE-5

    Exploring the principles, benefits, and best practices of open-source software development.

  • Estimation of Missing Values
    DE-6

    Methods for estimating missing values in experimental designs, including CRD and RBD.

  • Advantages and Disadvantages of CRD and RBD
    DE-7

    Discussion of the benefits and drawbacks of Completely Randomized Design (CRD) and Randomized Block Design (RBD).

  • Latin Square Design (LSD)
    DE-8

    A type of experimental design that accounts for two blocking factors to reduce variability.

  • Statistical Analysis of LSD
    DE-9

    Methods for analyzing data from a Latin Square Design (LSD) experiment.

  • Efficiency of LSD relative to RBD
    DE-10

    Comparison of the efficiency of Latin Square Design (LSD) and Randomized Block Design (RBD).

  • Advantages and Disadvantages of LSD
    DE-11

    Discussion of the benefits and drawbacks of Latin Square Design (LSD).

Key Topics

  • State Management on Stateless HTTP
    ST-1

    Understanding state management in stateless HTTP protocol and its implications on ASP.NET Core application development.

  • Server-side Strategies
    ST-2

    Exploring server-side strategies for state management in ASP.NET Core applications, including Session State, TempData, and Using HttpContext.

  • Session State
    ST-3

    Using Session State to store and manage user data in ASP.NET Core applications.

  • TempData
    ST-4

    Using TempData to store and manage temporary data in ASP.NET Core applications.

  • Using HttpContext
    ST-5

    Using HttpContext to access and manage HTTP request and response data in ASP.NET Core applications.

  • Cache
    ST-6

    Using Cache to store and manage frequently accessed data in ASP.NET Core applications.

  • Client-side Strategies
    ST-7

    Exploring client-side strategies for state management in ASP.NET Core applications, including Cookies, Query Strings, and Hidden Fields.

Lab works


S. No.

Title of the practical problems 
(Using any statistical software such as SPSS, STATA etc. whichever 
convenient). 

No. of

practical

problems
1Sampling distribution, random number generation, and computation of 
sample size 
1
2Methods of estimation(including interval estimation) 
1
3
Parametric tests (covering most of the tests) 
3
4Non-parametric test(covering most of the tests) 
3
5Partial correlation 
1
6Multiple regression 
1
7Design of Experiments 
3
9Stochastic process 
2

Total number of practical problems 
      15