REAL-TIME DATA WAREHOUSING: PERFORMANCE INSIGHTS OF SEMI-STREAM JOINS USING MONGODB
Keywords:
Real-time data warehousing, MongoDB, semi-stream joins, ETL (Extraction-Transformation-Loading), NoSQL databases, structured dataAbstract
The speed of processing of MongoDB for real-time data warehousing is examined in this work, with a special emphasis
on semi-stream join processing during the extraction, transformation, and loading (ETL) stage. Decision-making can
be slowed down by traditional data warehouses' frequent problems with timely updates and rapid data retrieval. With
speedier data access and ongoing updates, real-time data warehousing seeks to overcome these problems. We assess
MongoDB's efficiency in processing structured and unstructured data streams, and our tests show that it can handle
high-velocity data while maintaining constant memory and CPU utilisation. The results show that MongoDB works
effectively in real-time data contexts, handling different kinds and amounts of data without causing appreciable
performance reduction.
