首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Unifying telescope and microscope: A multi-lens framework with open data for modeling emerging events
Institution:1. AGH University of Science and Technology, 30 Mickiewicza Ave, Kraków 30-059, Poland;2. VSB Technical University of Ostrava, 17. listopadu 2172/15, Ostrava-Poruba 708 00, Czech Republic;1. School of Information and Communication Engineering, Hunan Institute of Science and Technology, Hunan, China;2. Machine Vision & Artificial Intelligence Research Center, Hunan Institute of Science and Technology, Hunan, China;1. School of Economics and Management, Chang''an University, Xi''an 710064, China;2. Computer & Information Sciences Department, Universiti Teknologi PETRONAS, 32610 Seri Iskandar, Perak, Malaysia;3. Institute of IR4.0, Universiti Kebangsaan Malaysia, 43600, Bangi, Selangor, Malaysia;4. College of Engineering, Al Ain University, Al Ain, United Arab Emirates;5. Department of Mathematics, College of Science, Tafila Technical University, Tafila, Jordan
Abstract:Open data is becoming ubiquitous as governments, companies, and even individuals have the option to offer more or less unrestricted access to their non-sensitive data. The benefits of open data, such as accessibility and transparency, have motivated and enabled a large number of research studies and applications in both academia and industry. However, each open data only offers a single perspective, and its potential inherent limitations (e.g., demographic biases) may lead to poor decisions and misjudgments. This paper discusses how to create and use multiple digital lenses empowered by open data, including census data (macro lens), search logs (meso lens), and social data (micro lens), to investigate general real-world events. To reveal the unique angles and perspectives brought by each open lens, we summarize and compare the underpinning open data from eleven dimensions, such as utility, data volume, dynamic variability, and demographic fairness. Then, we propose an easy-to-use and generalized open data driven framework, which automatically retrieves multi-source data, extracts features, and trains machine learning models for the event specified by answering what, when, and where questions. With low labor efforts, the framework’s generalization and automation capabilities guarantee an instant investigation of general events and phenomena, such as disasters, sports events, and political activities. We also conduct two case studies, i.e., the COVID-19 pandemic and Great American Eclipse (see Appendix), to demonstrate its feasibility and effectiveness at different time granularities.
Keywords:Open data  Data retrieval  Data fusion  Model fusion  Google Trends  Twitter
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号