InfluxDB / Features / Lakehouse / Warehouse Open Data Access
Lakehouse / Warehouse Open Data Access
Unlock real-time analytics with InfluxDB and plug into your existing data lakehouse using Apache Iceberg.
Built for data teams
What is open data access and how does it work?
Bring specialized time series data handling and real-time analytics to your operations data and enable zero-copy, no-ETL data sharing and interoperability with your existing data lakehouses and warehouses. Bridge the gap between real-time operations and analytical data tools, including lakehouses, by virtualizing data access to InfluxDB with Apache Iceberg.
InfluxDB offers high-performance data ingestion, real-time querying, and built-in functions for time series analysis. It persists data on commodity storage in an open file format known as Apache Parquet, and its catalog is abstracted to enable data access virtualization via an open table format, such as Apache Iceberg, Delta-sharing, etc.
Real-time operational analytics
InfluxDB’s columnar, in-memory tier enables sub-second query responses so you can power real-time use cases like operational event analytics, threat monitoring, gaming analytics, and more.
Hybrid data persistence
Time series data at scale can accumulate quickly, leading to massive datasets with cardinality concerns. InfluxDB is optimized for efficient storage and partitioning strategies to handle time series data at any scale and cardinality. Leverage InfluxDB for time series operational workloads while using data access virtualization to train AI/ML models and run advanced analytics in your existing data lakehouses.
Lower total cost of ownership
Data access virtualization allows direct data access to Parquet files without any data movement or need to hold multiple copies of the data, which helps lower costs by reducing replication, transfer, and storage costs. The lack of any ETL increases operational efficiency, so you can do more while using fewer resources.
Customers
Startups and Fortune 500 enterprises are building applications with InfluxDB.
Frederik Van Leekwyck, Business Development and Marketing Manager, Factry.io
Looking for The Most Efficient Way to Get Started with InfluxDB?
Whether you’re looking for cost savings, lower management overhead while maintaining high availability, or to optimize efficiency, InfluxDB can help. Find the Best Way to Start
How Time Series Databases and Data Lakes Work Together
Imagine you're working with streams of data that requires rapid analysis and storage for long-term insights. This is where the powerful duo of time series databases (TSDBs) and data lakes can help. Explore Article
Why Choose a Purpose-Built Time Series Database?
Details on what makes InfluxDB different from other propose-built solutions and a dive into horizontal use cases built with time series data. Download Paper
Time Series Analytics
Ready to optimize your time series workloads? Ensure you have the basics right first. Download Paper
Data Lakehouses Explained
Read a comprehensive guide explaining data lakehouses, a new data management architecture that combines concepts from data lakes and data warehouses. Explore Article
Easy Data Collection with Telegraf
Telegraf is a plugin-driven server agent written in Go for collecting metrics & data on the system. Download the latest Telegraf for free! Learn More
Real-Time Analytics
Engineered to give developers nanosecond precision when collecting and querying time series data. Learn More