InfluxDB / Features / Lakehouse / Warehouse Open Data Access

Integrate Time Series with Data Lakes and Warehouses

InfluxDB 3’s Apache Iceberg integration brings a new level of power and efficiency to your data stack.
Purpose-built for complex time series data with easy integration into your broader data ecosystem.

Talk with an Expert

Open data access for time series

Efficiently store, transfer, and analyze time series data with open data access. InfluxDB uses Apache Parquet for high-compression storage and provides an Iceberg integration for seamless interoperability with data lakes and warehouses—no ETL required. By combining Parquet, Iceberg, and object storage, InfluxDB ensures all ingested data is directly accessible to other big data query engines.

Improve AI/ML Models

Combine time series data with historical data for better AI/ML model training and anomaly detection.

Eliminate ETL Overhead

Avoid costly data replication and complex data pipelines by making time series data available directly in lakehouses.

Lower Total Cost of Ownership

Slash storage costs by eliminating the need to keep high-resolution time series data in lakehouses.

Build better AI/ML models with high-volume time series data and historical datasets

Traditional lakehouses can't keep up with time series data, driving up costs. InfluxDB's Iceberg integration allows data lakes and warehouses direct access to time series data in InfluxDB, combining live and historical data without rewriting it to your warehouse. The result? Faster performance, lower costs, and more precise AI/ML models and anomaly detection for deeper insights.

Seamlessly integrate with your stack

Plug into your existing data lakehouse using Apache Iceberg

Built for data teams

What is open data access and how does it work?

Bring specialized time series data handling and real-time analytics to your operations data and enable zero-copy, no-ETL data sharing and interoperability with your existing data lakehouses and warehouses. Bridge the gap between real-time operations and analytical data tools, including lakehouses, by virtualizing data access to InfluxDB with Apache Iceberg.

InfluxDB offers high-performance data ingestion, real-time querying, and built-in functions for time series analysis. It persists data on commodity storage in an open file format known as Apache Parquet, and its catalog is abstracted to enable data access virtualization via an open table format, such as Apache Iceberg, Delta-sharing, etc.

Real-time operational analytics

InfluxDB’s columnar, in-memory tier enables sub-second query responses so you can power real-time use cases like operational event analytics, threat monitoring, gaming analytics, and more.

Hybrid data persistence

Time series data at scale can accumulate quickly, leading to massive datasets with cardinality concerns. InfluxDB is optimized for efficient storage and partitioning strategies to handle time series data at any scale and cardinality. Leverage InfluxDB for time series operational workloads while using data access virtualization to train AI/ML models and run advanced analytics in your existing data lakehouses.

Lower total cost of ownership

Data access virtualization allows direct data access to Parquet files without any data movement or need to hold multiple copies of the data, which helps lower costs by reducing replication, transfer, and storage costs. The lack of any ETL increases operational efficiency, so you can do more while using fewer resources.

Customers

Startups and Fortune 500 enterprises are building applications with InfluxDB.

Before Factry, VEEMO had to log in via remote dekstop into each individual SCADA system per wind farm to have a look at how the turbines were doing. InfluxDB is extremely easy to setup, requires no external dependencies, had a SQL-like query syntax, and is fully open source.

Frederik Van Leekwyck, Business Development and Marketing Manager, Factry.io

BENCHMARKS

Looking for The Most Efficient Way to Get Started with InfluxDB?

Whether you’re looking for cost savings, lower management overhead while maintaining high availability, or to optimize efficiency, InfluxDB can help. <a class="video-link has-text-white" href="/lp/oss-vs-new-engine/" target="_blank">Find the Best Way to Start</a>

BLOG

How Time Series Databases and Data Lakes Work Together

Imagine you're working with streams of data that requires rapid analysis and storage for long-term insights. This is where the powerful duo of time series databases (TSDBs) and data lakes can help. <a href="/blog/TSDB-data-lakes-together/">Explore Article</a>

TECHNICAL PAPER

Why Choose a Purpose-Built Time Series Database?

Learn what makes InfluxDB different from other purpose-built solutions and dive into use cases based on time series data. <a class="video-link-old has-text-white-old" href="/time-series-technical-paper-2/"> Download Paper</a>

TECHNICAL PAPER

Time Series Analytics

Ready to optimize your time series workloads? Ensure you have the basics right first. <a href="/what-is-time-series-data/">Download Paper</a>

BLOG

Data Lakehouses Explained

Read a comprehensive guide explaining data lakehouses, a new data management architecture that combines concepts from data lakes and data warehouses. <a href="/blog/data-lakehouses-explained/">Explore Article</a>

INTEGRATIONS

Easy Data Collection with Telegraf

Telegraf is a plugin-driven server agent written in Go for collecting metrics & data on the system. Download the latest Telegraf for free! <a href="/time-series-platform/telegraf/">Learn More</a>

GET STARTED

Real-Time Analytics

Engineered to give developers nanosecond precision when collecting and querying time series data. <a href="/use-cases/real-time-analytics/">Learn More</a>

Integrate Time Series with Data Lakes and Warehouses

Open data access for time series

Improve AI/ML Models

Eliminate ETL Overhead

Lower Total Cost of Ownership

Build better AI/ML models with high-volume time series data and historical datasets

Seamlessly integrate with your stack

Built for data teams

What is open data access and how does it work?

Real-time operational analytics

Hybrid data persistence

Lower total cost of ownership

Customers

Looking for The Most Efficient Way to Get Started with InfluxDB?

How Time Series Databases and Data Lakes Work Together

Why Choose a Purpose-Built Time Series Database?

Time Series Analytics

Data Lakehouses Explained

Easy Data Collection with Telegraf

Real-Time Analytics

Talk with an Expert

Start using InfluxDB 3 powered by Apache Iceberg

Integrate InfluxDB 3 with Apache Iceberg + Snowflake

Virtual and Live InfluxDB Events

Product Training from InfluxDB University

Product & Solutions

Developers

Company

Integrate Time Series with Data Lakes and Warehouses

Open data access for time series

Improve AI/ML Models

Eliminate ETL Overhead

Lower Total Cost of Ownership

Build better AI/ML models with high-volume time series data and historical datasets

Seamlessly integrate with your stack

Built for data teams

What is open data access and how does it work?

Real-time operational analytics

Hybrid data persistence

Lower total cost of ownership

Customers

Looking for The Most Efficient Way to Get Started with InfluxDB?

How Time Series Databases and Data Lakes Work Together

Why Choose a Purpose-Built Time Series Database?

Time Series Analytics

Data Lakehouses Explained

Easy Data Collection with Telegraf

Real-Time Analytics

Talk with an Expert

Start using InfluxDB 3 powered by Apache Iceberg

Integrate InfluxDB 3 with Apache Iceberg + Snowflake

Virtual and Live InfluxDB Events

Product Training from InfluxDB University

Product & Solutions

Developers

Company

Follow Us