Paper ReviewComputer SystemsDesign Science Research

HTAP: The End of the Analytics-Transactions Divide in Enterprise Databases

For decades, enterprises maintained separate databases for transactions (OLTP) and analytics (OLAP). HTAP systems promise to unify themโ€”processing real-time transactions and complex analytics on the same data store. Kim et al. show how application-database co-design makes this practical.

By Sean K.S. Shin
This blog summarizes research trends based on published paper abstracts. Specific numbers or findings may contain inaccuracies. For scholarly rigor, always consult the original papers cited in each post.

Enterprise data architecture has been built on a fundamental separation for three decades: transactional systems (OLTP) that process real-time business operationsโ€”orders, payments, inventory updatesโ€”and analytical systems (OLAP) that process complex queries for business intelligenceโ€”trends, forecasts, anomaly detection. Data flows from OLTP to OLAP through ETL (Extract, Transform, Load) pipelines that introduce latency ranging from hours to days.

This separation made engineering sense when memory was expensive and workloads were predictable. But in 2025, the business demand is for real-time analytics on live transactional dataโ€”detecting fraud as it happens, adjusting pricing based on current demand, optimizing supply chains in response to live inventory levels. The ETL delay that was acceptable for monthly reporting is unacceptable for real-time decision-making.

Hybrid Transactional/Analytical Processing (HTAP) systems promise to eliminate this delay by processing both workload types on the same data store. Kim et al. demonstrate how this unification requires not just database innovation but co-innovation between applications and databasesโ€”a collaboration that challenges traditional boundaries between application development and database engineering.

The Virtual Data Model Approach

Kim et al.'s central contribution is the concept of a virtual data model that bridges the semantic gap between application logic and database operations. Traditional applications interact with databases through SQL queries that specify how to retrieve data. Virtual data models specify what data the application needs, allowing the database to optimize how it is retrieved based on its knowledge of data layout, indexing, and workload patterns.

This inversionโ€”from application-driven query specification to database-driven query optimizationโ€”is particularly valuable for HTAP because the optimal query strategy depends on whether the workload is transactional or analytical:

  • Transactional queries access specific rows by primary keyโ€”they benefit from row-oriented storage and index lookup
  • Analytical queries scan large column rangesโ€”they benefit from columnar storage and parallel scan
An HTAP system with a virtual data model can transparently choose the optimal access strategy for each query, maintaining a dual representation (row-store for transactions, column-store for analytics) and routing queries to the appropriate representation without application awareness.

Cloud-Native HTAP

Sundaramoorthy et al. extend the HTAP concept to cloud-native environments, where the additional challenge is dynamic resource allocation. Cloud databases must scale transaction processing and analytical processing independentlyโ€”a spike in analytical queries should not degrade transactional performance, and vice versa.

Their autonomous scaling approach uses workload classification to dynamically allocate compute resources between OLTP and OLAP workloads. Machine learning models predict workload patterns from historical data and pre-allocate resources before demand spikes occurโ€”a proactive approach that avoids the latency of reactive autoscaling.

Claims and Evidence

<
ClaimEvidenceVerdict
HTAP eliminates the ETL delay for real-time analyticsArchitectural demonstration with SAP HANA-style systemsโœ… Supported
Application-database co-design improves HTAP performanceKim et al. demonstrate virtual data model benefitsโœ… Supported
Cloud-native HTAP can independently scale OLTP and OLAPSundaramoorthy et al. propose autonomous scaling frameworkโš ๏ธ Framework described, limited deployment evidence
HTAP replaces the need for separate OLTP and OLAP systemsSome workloads still benefit from separation; HTAP adds complexityโš ๏ธ Depends on workload
Current HTAP systems handle all enterprise workload patternsComplex analytical workloads may still require dedicated OLAPโš ๏ธ Workload-dependent

Open Questions

  • Workload interference: Even with resource isolation, HTAP systems must prevent analytical queries from degrading transactional performance. What level of isolation is sufficient, and what is the overhead?
  • Consistency semantics: Should analytical queries see the absolute latest transactional state, or is slight staleness acceptable? The answer affects both performance (strong consistency is expensive) and business logic (some analytics require exact consistency).
  • Migration path: Enterprises have massive investments in separate OLTP/OLAP infrastructure. What is the practical migration path to HTAP, and how do you maintain operations during transition?
  • Cost comparison: HTAP systems require more sophisticated (and expensive) database technology than separate OLTP and OLAP. Under what conditions does the elimination of ETL infrastructure offset the higher database cost?
  • Query optimization complexity: HTAP query optimizers must handle a much wider range of workload types than either OLTP or OLAP optimizers alone. Does this increased complexity lead to more optimization errors?
  • What This Means for Your Research

    For database researchers, HTAP co-design (Kim et al.) opens a research direction that requires expertise in both database internals and application architectureโ€”a combination that traditional database research rarely demands. The virtual data model concept is generalizable beyond HTAP to any setting where applications and databases must co-evolve.

    For enterprise architects, HTAP represents a genuine architectural simplificationโ€”fewer moving parts, no ETL pipelines, real-time analytics capability. But adoption requires careful evaluation of workload patterns, consistency requirements, and the organizational readiness to manage a more sophisticated database platform.

    For the data engineering community, the HTAP trend has implications for ETL tool providers, data warehouse vendors, and the practitioners who build and maintain data pipelines. If HTAP succeeds at scale, a significant portion of current data engineering infrastructure becomes unnecessaryโ€”a creative destruction that the industry should anticipate.

    References (2)

    [1] Kim, K., Kim, H., Lee, J. et al. (2025). Enterprise Application-Database Co-Innovation for Hybrid Transactional/Analytical Processing: A Virtual Data Model and Its Query Optimization Needs. ACM SIGMOD.
    [2] Sundaramoorthy, P., Parikh, M., Keshireddy, S. (2025). Adaptive Resource Management in Cloud-Native Databases: A Study of Autonomous Scaling and Query Optimization. IEEE ICCES.

    Explore this topic deeper

    Search 290M+ papers, detect research gaps, and find what hasn't been studied yet.

    Click to remove unwanted keywords

    Search 7 keywords โ†’