InfluxDB / Features / Lakehouse / Warehouse Open Data Access

Lakehouse / Warehouse Open Data Access

Unlock real-time analytics with InfluxDB and plug into your existing data lakehouse using Apache Iceberg.

Built for data teams


What is open data access and how does it work?

Bring specialized time series data handling and real-time analytics to your operations data and enable zero-copy, no-ETL data sharing and interoperability with your existing data lakehouses and warehouses. Bridge the gap between real-time operations and analytical data tools, including lakehouses, by virtualizing data access to InfluxDB with Apache Iceberg.

InfluxDB offers high-performance data ingestion, real-time querying, and built-in functions for time series analysis. It persists data on commodity storage in an open file format known as Apache Parquet, and its catalog is abstracted to enable data access virtualization via an open table format, such as Apache Iceberg, Delta-sharing, etc.

lakehouse-flat
DataLake-Warehouse

Real-time operational analytics

InfluxDB’s columnar, in-memory tier enables sub-second query responses so you can power real-time use cases like operational event analytics, threat monitoring, gaming analytics, and more.

Hybrid data persistence

Time series data at scale can accumulate quickly, leading to massive datasets with cardinality concerns. InfluxDB is optimized for efficient storage and partitioning strategies to handle time series data at any scale and cardinality. Leverage InfluxDB for time series operational workloads while using data access virtualization to train AI/ML models and run advanced analytics in your existing data lakehouses.

Lower total cost of ownership

Data access virtualization allows direct data access to Parquet files without any data movement or need to hold multiple copies of the data, which helps lower costs by reducing replication, transfer, and storage costs. The lack of any ETL increases operational efficiency, so you can do more while using fewer resources.

DataLake-Warehouse-1800px
DataLake-Warehouse-Mobile-760px

Customers


Startups and Fortune 500 enterprises are building applications with InfluxDB.

Before Factry, VEEMO had to log in via remote dekstop into each individual SCADA system per wind farm to have a look at how the turbines were doing. InfluxDB is extremely easy to setup, requires no external dependencies, had a SQL-like query syntax, and is fully open source.

Frederik Van Leekwyck, Business Development and Marketing Manager, Factry.io
BENCHMARKS

Looking for The Most Efficient Way to Get Started with InfluxDB?

Whether you’re looking for cost savings, lower management overhead while maintaining high availability, or to optimize efficiency, InfluxDB can help. Find the Best Way to Start

BLOG

How Time Series Databases and Data Lakes Work Together

Imagine you're working with streams of data that requires rapid analysis and storage for long-term insights. This is where the powerful duo of time series databases (TSDBs) and data lakes can help. Explore Article

TECHNICAL PAPER

Why Choose a Purpose-Built Time Series Database?

Details on what makes InfluxDB different from other propose-built solutions and a dive into horizontal use cases built with time series data. Download Paper

TECHNICAL PAPER

Time Series Analytics

Ready to optimize your time series workloads? Ensure you have the basics right first. Download Paper

BLOG

Data Lakehouses Explained

Read a comprehensive guide explaining data lakehouses, a new data management architecture that combines concepts from data lakes and data warehouses. Explore Article

INTEGRATIONS

Easy Data Collection with Telegraf

Telegraf is a plugin-driven server agent written in Go for collecting metrics & data on the system. Download the latest Telegraf for free! Learn More

GET STARTED

Real-Time Analytics

Engineered to give developers nanosecond precision when collecting and querying time series data. Learn More

Talk with an Expert