Send NGINX Logs

NGINX is one of the most popular web servers today. In a world driven by the web and connected APIs, its logs are a great candidate for surfacing a birds’ eye view of activity in your service.

Setup

Install the Agent

DEB-AMD64

Download the honeytail_1.10.0_amd64.deb package.

wget -q https://honeycomb.io/download/honeytail/v1.10.0/honeytail_1.10.0_amd64.deb

Verify the package.

echo '3db441215f97eaed068aa0531c986cf5405957e3e8e26b22c16b571091caf917  honeytail_1.10.0_amd64.deb' | sha256sum -c

Install the package.

sudo dpkg -i honeytail_1.10.0_amd64.deb

The packages install honeytail, its config file /etc/honeytail/honeytail.conf, and some start scripts. Build honeytail from source if you need it in an unpackaged form or for ad-hoc use.

Download the honeytail_1.10.0_arm64.deb package.

wget -q https://honeycomb.io/download/honeytail/v1.10.0/honeytail_1.10.0_arm64.deb

Verify the package.

echo '4220756e5a941cde6a484cb4cfde184eb189aaf29170df301a874eb143e960ed  honeytail_1.10.0_arm64.deb' | sha256sum -c

Install the package.

sudo dpkg -i honeytail_1.10.0_arm64.deb

The packages install honeytail, its config file /etc/honeytail/honeytail.conf, and some start scripts. Build honeytail from source if you need it in an unpackaged form or for ad-hoc use.

Download the honeytail_1.10.0-1.x86_64.rpm package.

wget -q https://honeycomb.io/download/honeytail/v1.10.0/honeytail_1.10.0-1.x86_64.rpm

Verify the package.

echo 'b23215a9301b20b2e2262a0823c9e761e8b57e1a62fd5cec35f697fce41fa863  honeytail_1.10.0-1.x86_64.rpm' | sha256sum -c

Install the package.

sudo rpm -i honeytail_1.10.0-1.x86_64.rpm

The packages install honeytail, its config file /etc/honeytail/honeytail.conf, and some start scripts. Build honeytail from source if you need it in an unpackaged form or for ad-hoc use.

Download the 1.10.0 binary.

wget -q -O honeytail https://honeycomb.io/download/honeytail/v1.10.0/honeytail-linux-amd64

Verify the binary.

echo 'c9cc7dd1aa2b12afeb30b089061870f3407d2df0119e7c2807fec648b603e2d5  honeytail' | shasum -a 256 -c

Set the permissions to allow execution.

chmod 755 ./honeytail

Download the 1.10.0 binary.

wget -q -O honeytail https://honeycomb.io/download/honeytail/v1.10.0/honeytail-linux-arm64

Verify the binary.

echo '1dd37227788548c4ed44592554e3c90e374c4d796c444dde9f372db8618bc7fa  honeytail' | shasum -a 256 -c

Set the permissions to allow execution.

chmod 755 ./honeytail

Download the {{ $bin_version }} binary.

wget -q -O honeytail https://honeycomb.io/download/honeytail/v1.10.0/honeytail-darwin-amd64

Verify the binary.

echo '9a3da0f48fe21b1e610ac6b63130dfb8118a9a0ec16abae13350edba02d85e4d  honeytail' | shasum -a 256 -c

Set the permissions to allow execution.

chmod 755 ./honeytail

Clone the Honeytail repository.

git clone https://github.com/honeycombio/honeytail

Install from source.

cd honeytail; go install

Identify Log Locations + Formats

Make sure to run through Optional Configuration below before running honeytail, in order to get the richest metadata out of your web traffic and into your logs.

In addition to the standard configuration captured in /etc/honeytail/honeytail.conf, you will want to set the two options in the Nginx Parser Options section:

For example, if your nginx config file is at /etc/nginx/nginx.conf and has the following snippet:

… then ConfigFile should be set to /etc/nginx/nginx.conf and your LogFormatName value should be set to my_favorite_format.

Launch the Agent

Start up a honeytail process using upstart or systemd or by launching the process by hand.

upstart

sudo initctl start honeytail

sudo systemctl start honeytail

honeytail -c /etc/honeytail/honeytail.conf

Backfilling Archived Logs

In addition to getting current logs flowing, you can backfill old logs into Honeycomb to kickstart your dataset. By running honeytail from the command line, you can import old logs separate from tailing your current logs. Adding the --backfill flag to honeytail adjusts a number of settings to make it appropriate for backfilling old data, such as stopping when it gets to the end of the log file instead of the default behavior of waiting for new content (like tail).

The specific locations on your system may vary from ours, but once you fill in your system’s values instead of our examples, you can backfill using this command:

This command can be used at any point to backfill from archived log files. You can read more about our agent honeytail or its backfill behavior here.

Troubleshooting

Optional Configuration

Nginx logs can be an incredibly powerful, high-level view of your system—especially so if they are configured correctly and enriched with custom, application-specific information about each request. Below are two simple ways to pack those logs with more useful metadata.

Missing Default Options

Nginx comes with some fairly powerful optional log fields that are not included by default. This is the log_format we recommend for any configuration file (note the extra quotes around some fields):

You may already have an access_log line, but by defining a log_format (combined, in the example above) and specifying the format name (--nginx.format=combined), you will be able to take advantage of all of these additional fields. Make sure that all fields that start $http_ are quoted in your log_format:

Embedding Custom Response Headers

Nginx can also be configured to extract custom request and response headers. Of the two, response headers are the most powerful in this case—they can carry application-specific IDs or timers back through to the nginx log. Having all of the information pertinent to a single request, available in a single log line, can be an incredibly powerful tool in diagnosing the origin of a problem in your system.

To include a specific response header in your access.log, add an $upstream_http_ variable to your log_format—the response header values will be written out and ingested by our nginx parser! Make sure to put quotes around these variables to capture any embedded spaces.

For example, an X-RateLimit-Remaining header can be output by adding $upstream_http_x_ratelimit_remaining to the log_format line. See the nginx docs for more about extracting metadata from the HTTP response or request.

As with other fields which may output strings (for example,$http_user_agent), be careful when logging strings—add an extra set of double quotes around values which might contain spaces, in order to ensure correct parsing.

A final trick: sometimes, response headers may be set for logging that should not be exposed back to the user. In this case, the proxy_hide_header directive may be used to strip out specific headers by name:

Scrubbing Personally Identifiable Information

While we believe strongly in the value of being able to track down the precise query causing a problem, we understand the concerns of exporting log data which may contain sensitive user information.

With that in mind, we recommend using honeytail’s nginx parser, but adding a --scrub_field=sensitive_field_name flag to hash the concrete sensitive_field_name value, or --drop_field=sensitive_field_name to drop it altogether and prevent it being sent to Honeycomb’s servers.

Parsing URL Patterns

honeytail can break URLs up into their component parts, storing extra information in additional columns. This behavior is turned on by default for the request field on nginx datasets, but can become more useful with a little bit of guidance from you.

Honeycomb.io Documentation

Setup

Install the Agent

Identify Log Locations + Formats

Launch the Agent

Backfilling Archived Logs

Troubleshooting

Optional Configuration

Missing Default Options

Embedding Custom Response Headers

Scrubbing Personally Identifiable Information

Parsing URL Patterns

Open Source