A novel feature engineering algorithm for air quality datasets
Dublin Core | PKP Metadata Items | Metadata for this Document | |
1. | Title | Title of document | A novel feature engineering algorithm for air quality datasets |
2. | Creator | Author's name, affiliation, country | Raja Sher Afgun Usmani; Taylor’s University; Malaysia |
2. | Creator | Author's name, affiliation, country | Wan Nurul Farah Binti Wan Azmi; Taylor’s University; Malaysia |
2. | Creator | Author's name, affiliation, country | Akibu Mahmoud Abdullahi; Taylor’s University; Malaysia |
2. | Creator | Author's name, affiliation, country | Ibrahim Abaker Targio Hashem; Taylor’s University; Malaysia |
2. | Creator | Author's name, affiliation, country | Thulasyammal Ramiah Pillai; Taylor’s University; Malaysia |
3. | Subject | Discipline(s) | computer science; feature engineering; air quality; data science |
3. | Subject | Keyword(s) | Air pollution; Feature engineering; Data cleaning; Air quality; Air quality monitoring station; Data science |
4. | Description | Abstract | Feature engineering (FE) is one of the most important steps in data science research. FE provides useful features to be used later in the study. Due to climate change, the research focus is moving towards air quality estimation and the impacts of air pollution on health in Malaysia. Malaysia has 66 air quality monitoring (AQM) stations, and the air quality data for research is provided in an excel worksheet format by the Department of Environment, Malaysia. The data generated by the AQM stations is in a raw custom format, and it is virtually impossible to clean and engineer this data manually due to the sheer number of files. Hence, we propose a novel feature engineering algorithm to transform and combine this data into a useable format. The results show that the proposed feature engineering algorithm was able to efficiently extract and combine the hourly and daily values for pollutant and meteorological variables in useful row format. This algorithm will help all the researchers using the data from the AQM station in Malaysia as well as other countries using the same AQM station. The implementation of the feature engineering algorithm is also available to use at GitHub (https://github.com/rajasherafgun/featureengineeringaq) under AFL-3.0 license. |
5. | Publisher | Organizing agency, location | Institute of Advanced Engineering and Science |
6. | Contributor | Sponsor(s) | Taylor's University, Malaysia; Department of Environment, Malaysia |
7. | Date | (YYYY-MM-DD) | 2020-09-01 |
8. | Type | Status & genre | Peer-reviewed Article |
8. | Type | Type | |
9. | Format | File format | |
10. | Identifier | Uniform Resource Identifier | https://ijeecs.iaescore.com/index.php/IJEECS/article/view/21875 |
10. | Identifier | Digital Object Identifier (DOI) | http://doi.org/10.11591/ijeecs.v19.i3.pp1444-1451 |
11. | Source | Title; vol., no. (year) | Indonesian Journal of Electrical Engineering and Computer Science; Vol 19, No 3: September 2020 |
12. | Language | English=en | en |
14. | Coverage | Geo-spatial location, chronological period, research sample (gender, age, etc.) | |
15. | Rights | Copyright and permissions |
Copyright (c) 2020 Institute of Advanced Engineering and Science![]() This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. |