Indexing metadata

A novel feature engineering algorithm for air quality datasets


 
Dublin Core PKP Metadata Items Metadata for this Document
 
1. Title Title of document A novel feature engineering algorithm for air quality datasets
 
2. Creator Author's name, affiliation, country Raja Sher Afgun Usmani; Taylor’s University; Malaysia
 
2. Creator Author's name, affiliation, country Wan Nurul Farah Binti Wan Azmi; Taylor’s University; Malaysia
 
2. Creator Author's name, affiliation, country Akibu Mahmoud Abdullahi; Taylor’s University; Malaysia
 
2. Creator Author's name, affiliation, country Ibrahim Abaker Targio Hashem; Taylor’s University; Malaysia
 
2. Creator Author's name, affiliation, country Thulasyammal Ramiah Pillai; Taylor’s University; Malaysia
 
3. Subject Discipline(s) computer science; feature engineering; air quality; data science
 
3. Subject Keyword(s) Air pollution; Feature engineering; Data cleaning; Air quality; Air quality monitoring station; Data science
 
4. Description Abstract Feature engineering (FE) is one of the most important steps in data science research. FE provides useful features to be used later in the study. Due to climate change, the research focus is moving towards air quality estimation and the impacts of air pollution on health in Malaysia. Malaysia has 66 air quality monitoring (AQM) stations, and the air quality data for research is provided in an excel worksheet format by the Department of Environment, Malaysia. The data generated by the AQM stations is in a raw custom format, and it is virtually impossible to clean and engineer this data manually due to the sheer number of files. Hence, we propose a novel feature engineering algorithm to transform and combine this data into a useable format. The results show that the proposed feature engineering algorithm was able to efficiently extract and combine the hourly and daily values for pollutant and meteorological variables in useful row format. This algorithm will help all the researchers using the data from the AQM station in Malaysia as well as other countries using the same AQM station. The implementation of the feature engineering algorithm is also available to use at GitHub (https://github.com/rajasherafgun/featureengineeringaq) under AFL-3.0 license.
 
5. Publisher Organizing agency, location Institute of Advanced Engineering and Science
 
6. Contributor Sponsor(s) Taylor's University, Malaysia; Department of Environment, Malaysia
 
7. Date (YYYY-MM-DD) 2020-09-01
 
8. Type Status & genre Peer-reviewed Article
 
8. Type Type
 
9. Format File format PDF
 
10. Identifier Uniform Resource Identifier https://ijeecs.iaescore.com/index.php/IJEECS/article/view/21875
 
10. Identifier Digital Object Identifier (DOI) http://doi.org/10.11591/ijeecs.v19.i3.pp1444-1451
 
11. Source Title; vol., no. (year) Indonesian Journal of Electrical Engineering and Computer Science; Vol 19, No 3: September 2020
 
12. Language English=en en
 
14. Coverage Geo-spatial location, chronological period, research sample (gender, age, etc.)
 
15. Rights Copyright and permissions Copyright (c) 2020 Institute of Advanced Engineering and Science
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.