DNS and Thanos Integration

Powerful performance with an easy integration, powered by Telegraf, the open source data connector built by InfluxData.

info

This is not the recommended configuration for real-time query at scale. For query and compression optimization, high-speed ingest, and high availability, you may want to consider DNS and InfluxDB.

5B+

Telegraf downloads

#1

Time series database
Source: DB Engines

1B+

Downloads of InfluxDB

2,800+

Contributors

Table of Contents

Powerful Performance, Limitless Scale

Collect, organize, and act on massive volumes of high-velocity data. Any data is more valuable when you think of it as time series data. with InfluxDB, the #1 time series platform built to scale with Telegraf.

See Ways to Get Started

Input and output integration overview

The DNS plugin enables users to monitor and gather statistics on DNS query times, facilitating performance analysis of DNS resolutions.

This plugin sends metrics from Telegraf to Thanos using the Prometheus remote write protocol over HTTP, allowing efficient and scalable ingestion into Thanos Receive components.

Integration details

DNS

This plugin gathers DNS query times in milliseconds, utilizing the capabilities of DNS queries similar to the Dig command. It provides a means to monitor and analyze DNS performance by measuring the response time from specified DNS servers, allowing network administrators and engineers to ensure optimal DNS resolution times. The plugin can be configured to target specific servers and customize the types of records queried, encompassing various DNS features such as resolving domain names to IP addresses, or retrieving details from specific records as needed, while also clearly reporting on the success or failure of each query, alongside relevant metadata.

Thanos

Telegraf’s HTTP plugin can send metrics directly to Thanos via its Remote Write-compatible Receive component. By setting the data format to prometheusremotewrite, Telegraf can serialize metrics into the same protobuf-based format used by native Prometheus clients. This setup enables high-throughput, low-latency metric ingestion into Thanos, facilitating centralized observability at scale. It is particularly useful in hybrid environments where Telegraf is collecting metrics from systems outside Prometheus’ native reach, such as SNMP devices, Windows hosts, or custom apps, and streams them directly to Thanos for long-term storage and global querying.

Configuration

DNS

[[inputs.dns_query]]
  servers = ["8.8.8.8"]

  # network = "udp"

  # domains = ["."]

  # record_type = "A"

  # port = 53

  # timeout = "2s"

  # include_fields = []
  

Thanos

[[outputs.http]]
  ## Thanos Receive endpoint for remote write
  url = "http://thanos-receive.example.com/api/v1/receive"

  ## HTTP method
  method = "POST"

  ## Data format set to Prometheus remote write
  data_format = "prometheusremotewrite"

  ## Optional headers (authorization, etc.)
  # [outputs.http.headers]
  #   Authorization = "Bearer YOUR_TOKEN"

  ## Optional TLS configuration
  # tls_ca = "/path/to/ca.pem"
  # tls_cert = "/path/to/cert.pem"
  # tls_key = "/path/to/key.pem"
  # insecure_skip_verify = false

  ## Request timeout
  timeout = "10s"

Input and output integration examples

DNS

  1. Monitor DNS Performance for Multiple Servers: By deploying the DNS plugin, a user can simultaneously monitor the performance of different DNS servers, such as Google DNS and Cloudflare DNS, by specifying them in the servers array. This scenario enables comparisons of response times and reliability across different DNS providers, assisting in selecting the best option based on empirical data.

  2. Analyze Query Times for High-Traffic Domains: Integrate the plugin to measure response times specifically for high-traffic domains relevant to an organization’s operations, such as internal services or customer-facing sites. By focusing on performance metrics for these domains, organizations can proactively address latency issues, ensuring service reliability and improving user experiences.

  3. Alerting on DNS Timeouts: Utilize the plugin in combination with alerting systems to notify administrators whenever a DNS query exceeds a defined timeout threshold. This setup can help in proactive troubleshooting of networking issues or server misconfigurations, fostering a rapid response to potential downtime scenarios.

  4. Gather Historical Data for Performance Trends: Use the plugin to collect historical data on DNS query times over extended periods. This data can be used to analyze trends and patterns in DNS performance, enabling better capacity planning, identifying periodic issues, and justifying infrastructure upgrades or changes to DNS architectures.

Thanos

  1. Agentless Cloud Monitoring: Deploy Telegraf agents across cloud VMs to collect system and application metrics, then stream them directly into Thanos using Remote Write. This provides centralized observability without requiring Prometheus nodes at each location.

  2. Scalable Windows Host Monitoring: Use Telegraf on Windows machines to collect OS-level metrics and send them via Remote Write to Thanos Receive. This enables observability across heterogeneous environments with native Prometheus support only on Linux.

  3. Cross-Region Metrics Federation: Telegraf agents in multiple geographic regions can push data to region-local Thanos Receivers using this plugin. From there, Thanos can deduplicate and query metrics globally, reducing latency and network egress costs.

  4. Integrating Third-Party Data into Thanos: Collect metrics from custom telemetry sources such as REST APIs or proprietary logs using Telegraf inputs and forward them to Thanos via Remote Write. This brings non-native data into a Prometheus-compatible, long-term analytics pipeline.

Feedback

Thank you for being part of our community! If you have any general feedback or found any bugs on these pages, we welcome and encourage your input. Please submit your feedback in the InfluxDB community Slack.

Powerful Performance, Limitless Scale

Collect, organize, and act on massive volumes of high-velocity data. Any data is more valuable when you think of it as time series data. with InfluxDB, the #1 time series platform built to scale with Telegraf.

See Ways to Get Started

Related Integrations

HTTP and InfluxDB Integration

The HTTP plugin collects metrics from one or more HTTP(S) endpoints. It supports various authentication methods and configuration options for data formats.

View Integration

Kafka and InfluxDB Integration

This plugin reads messages from Kafka and allows the creation of metrics based on those messages. It supports various configurations including different Kafka settings and message processing options.

View Integration

Kinesis and InfluxDB Integration

The Kinesis plugin allows for reading metrics from AWS Kinesis streams. It supports multiple input data formats and offers checkpointing features with DynamoDB for reliable message processing.

View Integration