Proportion of students who complete primary, lower secondary, and upper secondary education at national and state level.
0 viewsΒ·0 downloads
This dataset is derived from administrative records maintained by the Ministry of Education Malaysia via the Education Management Information System (EMIS). Completion rates are calculated by comparing the number of students who complete a given education level with the total cohort that started that level.
Because Malaysia's dropout rates are extremely low, completion rates may exceed 100% due to a small number of students repeating grades or transferring between states. Furthermore, it should be noted that the data refers to government schools only.
β
Proportion of students who complete primary, lower secondary, and upper secondary education at national and state level.
Name in Dataset | Variable | Definition |
---|---|---|
date (Date) | Date | Date in YYYY-MM-DD format, with MM-DD set to 01-01 as the data is at annual frequency |
state (Categorical) | State | One of 16 states, or Malaysia for national-level data |
stage (Categorical) | School Stage | Either primary (Standards 1 to 6), lower secondary (Form 1 to 3), or upper secondary (Form 4 to 5) |
sex (Categorical) | Sex | Either both sexes, male, or female |
completion (Float) | Completion Rate | Proportion of students who complete that stage of education, relative to the total cohort that started that stage |
01 Sept 2024, 12:00
N/A
This data is made open under the Creative Commons Attribution 4.0 International License (CC BY 4.0). A copy of the license is available Here.
Full Dataset (CSV)
Recommended for individuals seeking an Excel-friendly format.
0
Full Dataset (Parquet)
Recommended for data scientists seeking to work with data via code.
0
Connect directly to the data with Python.
# If not already installed, do: pip install pandas fastparquet
import pandas as pd
URL_DATA = 'https://storage.data.gov.my/education/completion_school_state.parquet'
df = pd.read_parquet(URL_DATA)
if 'date' in df.columns: df['date'] = pd.to_datetime(df['date'])
print(df)
The following code is an example of how to make an API query to retrieve the data catalogue mentioned above. You can use different programming languages by switching the code accordingly. For a complete guide on possible query parameters and syntax, please refer to the official Open API Documentation.
import requests
import pprint
url = "https://api.data.gov.my/data-catalogue?id=completion_school_state&limit=3"
response_json = requests.get(url=url).json()
pprint.pprint(response_json)
Department of Statistics Malaysia
Β© 2024 Public Sector Open Data
Open Data
data.gov.my