• Five Themes

    Underpinning MySIG’s research themes on big data is our formidable range of expertise, comprising outstanding problem- and curiosity-driven research by individuals and teams. Research in the following themes takes place across many of University of Malaya research clusters. These themes are evolving and not mutually exclusive, as befits an institution seeking to apply the breadth of its research to the national and world major problems.


We focus our research on five major themes:

Big Data Databases and Analytics

We are developing new software platforms for storing and processing massive amounts of data and for applying analytics beyond what conventional relational systems can do.  We see a “sea change” happening as analysis moves from the simple SQL aggregation capabilities to much more complex routines to perform data clustering, predictive modeling, and complex statistics.  Relational systems are not good at these linear algebra operations, because they are specified on arrays not tables.  Therefore, we are focused on building array-oriented DBMSes.  In addition, we are investigating graph-based DBMSes for social-network-style analysis.

Big Data Math and Algorithms

We are designing and implementing algorithms for linear algebra, signal processing, search, and machine learning that scale to tens or hundreds of machines and petabytes of data.  To date, most algorithm work has focused on complexity of issues, assuming that the data for the algorithm is main-memory resident and runs on a single computing thread.  Therefore, our focus here is on algorithm development for parallel execution and for data that does not necessarily fit in main memory.

Big Data Visualization

We are designing visualizations and interfaces that allow users to interact with massive data sets, on displays ranging from phones to video walls.  We assume that there is a DBMS behind such a visualization program.  Moreover, when the visualization system runs a query, it may get back a fire hose of data that it was not expecting.  Hence, visualizations have to be made scalable to large amounts of data.  As well, we have to find ways to speed up visualization systems through prefetching and caching.

Big Data Architecture

We are to understand how next-generation hardware innovations – such as many-core chips, non-volatile random-access memories, and reconfigurable hardware – affect the design of data processing systems. A significant fraction of computing cycles go to supporting Big Data.  Hence, it is important to optimize computer architectures for this task.  This extends to memory systems as well as specialized chips, such as collections of GPSes.

Streaming Big Data

We are building data processing systems that facilitate rapid processing and ingest of data streams. Behind every Big Data problem is a “Big Velocity” problem that requires data ingest and data conditioning at high rates, including the abilities to aggregate data at high speeds and load it into database management systems.


Join Us

Malaysia Special Interest Groups (MySIG) of Big Data are open to all who are interested. To join a Special Interest Group and gain access to collaborative materials produced by our SIG, call us at +603-79676342 or email your resume to

Subscribe to our newsletter to get the latest scoop right to your inbox.

Contact Us

  • Faculty of Computer Science & Information Technology,
  • University of Malaya,
  • 50603 Kuala Lumpur, Malaysia
  • Phone: +603-79676342      Fax: 03-79579249
  • Email:

Follow Us!!

Follow our activities update!