Knowledge + Practice

Splunk Core Certified Power User SPLK-1003 (SPLK-1003) — Questions 76–150

500 questions total · 7pages · All types, answers revealed

Take a mock exam Exam hub

Page 2 of 7

76

MCQhard

A dashboard uses a timechart to show CPU usage over 24 hours. The time range selector is set to 'Last 7 days'. The chart displays data only for the last 24 hours. Which visualization setting is MOST likely causing this?

A.The chart's 'Time range override' is set to 24 hours.

B.The 'Max rows' is set to 24.

C.The 'Span' is set to 1 hour.

D.The data source only retains 24 hours.

AnswerA

A time range override on the panel overrides the dashboard selector.

Why this answer

The 'Time range override' setting on a visualization explicitly overrides the dashboard's global time picker. When set to a fixed duration like 24 hours, the chart ignores the 'Last 7 days' selection and only displays data for that specific window. This explains why the chart shows only the last 24 hours despite the dashboard time range being 7 days.

Exam trap

Splunk often tests the distinction between 'Time range override' (which changes the time window) and 'Span' (which changes the bucket size), leading candidates to confuse a bucket interval setting with a time range restriction.

How to eliminate wrong answers

Option B is wrong because 'Max rows' limits the number of data points displayed (e.g., 24 rows), not the time window; it could still show 24 rows spanning 7 days if the span is large enough. Option C is wrong because 'Span' controls the bucket interval (e.g., 1-hour buckets) but does not restrict the overall time range; a 1-hour span over 7 days would produce 168 data points, not truncate to 24 hours. Option D is wrong because the question states the data source retains 24 hours, but the dashboard time range is set to 7 days; if the data source only had 24 hours, the chart would show nothing beyond that, but the issue is that the chart shows exactly 24 hours, not that data is missing—this points to a visualization override, not a data retention limit.

Full explanation →

77

Multi-Selectmedium

Which TWO of the following statements about lookup tables in Splunk are true?

Select 2 answers

A.Lookups can only add new fields to events; they cannot replace existing field values.

B.External lookups can only retrieve data from CSV files.

C.KV Store lookups require defining a collection in collections.conf.

D.CSV lookups can be updated by replacing the CSV file without restarting Splunk.

E.Lookup files can only be stored in $SPLUNK_HOME/etc/system/lookups/.

AnswersC, D

KV Store lookups are backed by a collection that must be defined in collections.conf.

Why this answer

CSV lookups can be updated by replacing the file; Splunk automatically reloads them. KV Store lookups require a collection definition in collections.conf. The other options are false: lookup files can be stored in app directories, external lookups can connect to various data sources via scripts, and lookups can replace field values if configured with overwrite=true.

Full explanation →

78

Multi-Selecteasy

Which THREE of the following are valid Splunk search commands for determining the number of distinct values of a field?

Select 3 answers

A.| stats count(field)

B.| stats dc(field)

C.| stats distinct_count(field)

D.| stats values(field)

E.| dedup field | stats count

AnswersB, D, E

dc() is a stats function that returns distinct count.

Why this answer

Option B is correct because `| stats dc(field)` uses the `dc` (distinct count) function to return the exact number of unique values for the specified field. This is the direct and most efficient command for counting distinct values in Splunk.

Exam trap

The trap here is that candidates often confuse `count(field)` with `dc(field)`, not realizing that `count` tallies events while `dc` tallies unique values, and that `distinct_count` is not a valid Splunk command.

Full explanation →

79

Multi-Selectmedium

Which TWO of the following are best practices when creating lookup table files?

Select 2 answers

A.Use header rows with spaces in field names for readability.

B.Ensure the lookup file is in CSV format with the .csv extension.

C.Enable automatic lookup refresh for frequently updated files.

D.Place the lookup file in the $SPLUNK_HOME/etc/apps/search/lookups directory.

E.Use full paths in the lookup definition.

AnswersB, D

CSV with header row is the standard and recommended format.

Why this answer

B and C are correct. Using CSV format with a header row is standard (B). Placing the file in the app's lookups directory ensures proper access (C).

A is wrong because spaces in field names cause issues. D is unnecessary (full paths are not recommended). E is not a best practice for file creation; automatic refresh is configured separately.

Full explanation →

80

Matchingmedium

Match each Splunk component to its function.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

Indexes and stores incoming data

Distributes search requests and merges results

Sends data to indexers or other forwarders

Manages configuration of forwarders

Manages license usage across the deployment

Why these pairings

These are core components of a Splunk distributed environment.

Full explanation →

81

MCQmedium

A Splunk admin notices that a search using the transaction command takes too long. To debug, they want to see how events are grouped into transactions before the transaction command runs. Which command can be added to the search pipeline before transaction to inspect the grouping?

A.map

B.fields

C.transaction

D.streamstats

AnswerD

streamstats can create a session ID or other grouping field, enabling preview of transaction grouping.

Why this answer

Option B is correct because streamstats can be used to compute a transaction ID based on a time window or field change, allowing you to see how events would be grouped. Options A, C, and D do not help inspect grouping.

Full explanation →

82

MCQeasy

A Splunk administrator wants to reduce maintenance effort when the same search logic is used in multiple saved searches. Which approach is most effective?

A.Define a macro that encapsulates the common search logic and reference the macro in each saved search.

B.Use the Common Information Model (CIM) to normalize the data and then search using data model commands.

C.Create a summary index that contains the output of the common logic and have each saved search reference that summary index.

D.Enable report acceleration on each saved search to improve performance.

AnswerA

Macros promote reuse and centralize changes.

Why this answer

Option C is correct because macros allow reusable search components. Option A (summary index) increases complexity. Option B (post-process) is for data models.

Option D (report acceleration) is for speeding up reports, not reuse.

Full explanation →

83

Multi-Selectmedium

Which TWO of the following are valid ways to define macro arguments in Splunk? (Select exactly 2.)

Select 2 answers

A.`$arg$` without default

B.Using `| eval` to set argument values inside the macro

C.Defining arguments list in `args` field: `args = arg1, arg2`

D.`$arg$` with default value: `$arg$=default$`

E.Numeric argument: `$1$` in definition and usage

AnswersC, D

The `args` field specifies the names of positional arguments.

Why this answer

Options B and D are correct. Option A is not valid because `$arg$` alone does not assign a default. Option C uses numeric syntax correctly with `$1$`.

Option E is not a standard argument syntax.

Full explanation →

84

Multi-Selectmedium

Which TWO search commands can be used to calculate a running total (cumulative sum) of a field over time?

Select 2 answers

A.delta

B.accum

C.transaction

D.streamstats

E.eventstats

AnswersB, D

Built-in command for cumulative sum.

Why this answer

B is correct because the `accum` command calculates a cumulative sum of a specified numeric field across all events in the search result order, adding each event's value to the running total. This directly implements a running total without requiring any additional options.

Exam trap

Splunk often tests the distinction between `accum` and `eventstats`, where candidates mistakenly choose `eventstats` thinking it computes running totals, but it actually computes a single aggregate over all events and appends that same value to every event.

Full explanation →

85

MCQmedium

An analyst is using the transaction command to group events by a field that has high cardinality (millions of unique values). The search is taking too long and consuming too much memory. Which approach should be taken to improve performance?

A.Reduce the cardinality of the field by using a derived field with fewer values.

B.Use the 'maxspan' option to narrow the time window.

C.Use the 'mvlist' option to reduce field storage.

D.Use the 'maxevents' option to limit number of events per transaction.

AnswerA

Correct: Reduces open transactions and improves performance.

Why this answer

Option D is correct. High cardinality causes many open transactions; reducing cardinality (e.g., using a derived field) improves performance. Options A and B help but are not the primary cause; Option C is not a valid option.

Full explanation →

86

MCQeasy

An analyst wants to find the top 5 users who have the highest total bytes transferred. The data has fields 'user' and 'bytes'. Which search should be used?

A.| stats max(bytes) as max_bytes by user | sort - max_bytes | head 5

B.| stats sum(bytes) as total_bytes by user | sort - total_bytes | head 5

C.| sort - bytes | head 5 | table user, bytes

D.| top limit=5 user

AnswerB

This correctly sums bytes per user, sorts descending, and takes top 5.

Why this answer

Option B is correct because it uses `stats sum(bytes) as total_bytes by user` to aggregate the total bytes transferred per user, then sorts the results in descending order with `sort - total_bytes`, and finally limits the output to the top 5 users with `head 5`. This directly answers the requirement for the highest total bytes transferred.

Exam trap

The trap here is that candidates often confuse `max` with `sum` for total calculations, or mistakenly think sorting raw events and taking the top 5 yields user-level totals, when in fact aggregation by user is required first.

How to eliminate wrong answers

Option A is wrong because `stats max(bytes)` returns the single largest byte value per user, not the sum of all bytes transferred, so it does not calculate total bytes. Option C is wrong because it sorts individual events by bytes and takes the first 5, which gives the 5 events with the highest bytes, not the top 5 users by total bytes; it also does not aggregate per user. Option D is wrong because `top limit=5 user` counts the frequency of user occurrences, not the sum of bytes transferred, so it answers a different question.

Full explanation →

87

MCQmedium

A user wants to create a trellis chart with multiple panels, each showing data for a different department. What attribute should be used to split the visualization?

A.Overlay

B.Split by

C.Drilldown

D.Color

AnswerB

Split by creates separate panels for each value of the field.

Why this answer

In trellis charts, the 'Split by' attribute divides the visualization into multiple panels based on a field value. Overlay, color, and drilldown are not used for splitting.

Full explanation →

88

Multi-Selectmedium

Which TWO of the following statements about the 'transaction' command are true? (Choose two.)

Select 2 answers

A.It can add a 'duration' field to the result events.

B.It requires events to be sorted by _time in descending order.

C.It can only be used with 'startswith' and 'endswith' options.

D.It requires at least one field in the 'by' clause.

E.It groups events that are logically related based on common field values and time proximity.

AnswersA, E

Correct: transaction automatically adds a duration field if enabled.

Why this answer

Option A is correct because the 'transaction' command automatically adds a 'duration' field to the result events, which represents the time difference between the first and last event in the transaction. This is a built-in behavior of the command, not an optional setting.

Exam trap

Splunk often tests the misconception that the 'transaction' command requires a 'by' clause or that it only works with 'startswith'/'endswith', when in fact it can group events purely by time proximity using 'maxspan' and 'maxpause'.

Full explanation →

89

MCQhard

A Splunk administrator notices that a `transaction` command used for correlating VPN login and logout events is consuming excessive memory and causing search timeouts. The transaction groups events by `user` with `maxspan=12h` and `maxpause=30m`. The VPN logs contain millions of events per day. Which design change would most effectively reduce resource consumption while maintaining the ability to correlate logins and logouts within the same session?

A.Remove the maxpause option from the transaction command to simplify grouping.

B.Reduce maxspan to 4h to limit the time window for grouping events.

C.Replace the transaction command with a stats command using earliest and latest functions on the event type.

D.Add maxevents=2 to the transaction command to limit each transaction to exactly two events.

AnswerC

Using `stats earliest(_time) as login, latest(_time) as logout by user` is much more memory efficient and still captures session boundaries.

Why this answer

Option C is correct because replacing `transaction` with `stats` using `earliest` and `latest` eliminates the in-memory event buffering that causes memory exhaustion. `transaction` holds all events in memory until the transaction boundary (maxspan/maxpause) is reached, which is extremely expensive for millions of VPN events. `stats` processes events in a streaming fashion, computing the first and last timestamps per user without storing the full event list, drastically reducing memory and avoiding timeouts.

Exam trap

Splunk often tests the misconception that reducing time windows or event counts in `transaction` solves memory issues, but the real trap is that `transaction` always buffers events in memory, whereas `stats` is a streaming command that avoids this bottleneck entirely.

How to eliminate wrong answers

Option A is wrong because removing `maxpause` would cause the transaction to never close on idle gaps, leading to even larger in-memory buffers and worse memory consumption. Option B is wrong because reducing `maxspan` to 4h only limits the time window but does not address the fundamental issue of `transaction` buffering all events per user in memory; it may still cause memory exhaustion with high event volumes. Option D is wrong because `maxevents=2` assumes exactly one login and one logout per session, but VPN logs may have multiple login attempts or reconnections; this would break correlation for legitimate multi-event sessions and still not reduce memory if events arrive out of order or within the pause window.

Full explanation →

90

MCQeasy

Which visualization type is best for comparing the proportion of each error type to the total errors over time?

A.Pie chart (with multiple series)

B.Column chart

C.Line chart

D.Area chart

AnswerD

Stacked area chart effectively shows proportion over time.

Why this answer

Option D is correct because an area chart (especially stacked) shows proportions over time. Option A (column) can be used but area is more common for proportion over time. Option B (line) does not show proportion.

Option C (pie) is not time-based.

Full explanation →

91

Multi-Selectmedium

Which TWO statements about lookups in Splunk are correct? (Choose two.)

Select 2 answers

A.The lookup command can be used to add fields from a lookup table to search results

B.External lookups can only be used with CSV files

C.KV store lookups support real-time updates during a search

D.CSV lookups can be updated in real-time by adding rows to the CSV file

E.Automatic lookups are configured in macros.conf

AnswersA, C

Correct; lookup command enriches events.

Why this answer

Option A is correct because the `lookup` command in Splunk is specifically designed to enrich search results by adding fields from a lookup table (such as CSV, KV Store, or external lookups) based on matching field values. This is a fundamental use case for lookups in Splunk, allowing you to augment raw event data with additional context from a static or dynamic source.

Exam trap

The trap here is that candidates often confuse the capabilities of CSV lookups (static, loaded at search start) with KV Store lookups (dynamic, real-time), and mistakenly assume that CSV files can be updated live or that automatic lookups are configured in macros.conf instead of transforms.conf.

Full explanation →

92

MCQhard

An analyst observes that a lookup command with a large lookup file is causing the search to timeout. The lookup is used to extract additional fields based on a field value. What is the most effective immediate solution?

A.Change the match_type to WILDCARD to be more flexible.

B.Reduce the lookup file size by removing unused columns or rows.

C.Use the inputlookup command to load the entire table.

D.Increase the search timeout in limits.conf.

AnswerB

Smaller files process faster, and removing unnecessary data reduces lookup time.

Why this answer

Option C is correct because reducing the lookup file size (by removing unused columns or rows) directly reduces the amount of data processed, improving performance. Options A, B, D are less effective or exacerbate the issue.

Full explanation →

93

MCQmedium

A SOC analyst is investigating a security incident. They use `transaction src_ip` to group firewall events. The search returns too many single-event transactions. The analyst suspects that some events should be grouped but are not because the IP address is used by different sessions. Which option can help ensure events are grouped only if they occur close in time?

A.maxspan=1h

B.maxevents=2

C.fields=src_ip

D.maxpause=5m

AnswerD

maxpause ensures events are only grouped if they occur within 5 minutes of each other, reducing false grouping.

Why this answer

The maxpause option sets the maximum time gap allowed between consecutive events in a transaction. Using a short maxpause helps ensure that events from different sessions are not incorrectly grouped.

Full explanation →

94

Multi-Selecthard

Which two methods can reduce the resource consumption of a large CSV lookup in Splunk? (Choose 2)

Select 2 answers

A.Convert the CSV to a gzip file and reference it

B.Increase the max_match parameter

C.Use a time-based lookup to limit matches by time

D.Use the lookup command to only return required fields

E.Use a KV Store lookup with a smaller data set

AnswersD, E

Limiting output fields reduces data transfer and processing.

Why this answer

Option D is correct because using the `lookup` command with a field list (e.g., `| lookup mylookup.csv field1 OUTPUT field2 field3`) reduces resource consumption by only loading the specified output fields into memory, rather than the entire CSV lookup. This minimizes memory usage and I/O overhead, especially for large CSV files with many columns.

Exam trap

The trap here is that candidates often assume compressing the CSV file (Option A) would reduce resource consumption, but Splunk cannot read compressed CSV files natively, making this an invalid optimization.

Full explanation →

95

Multi-Selecteasy

Which three of the following are benefits of using the Common Information Model (CIM)? (Choose THREE.)

Select 3 answers

A.Ensures consistent field naming across data sources.

B.Provides pre-built data models for common domains.

C.Reduces the need for custom field extractions.

D.Automatically generates reports for compliance.

E.Allows sharing of dashboards and searches across environments.

AnswersA, B, E

Correct: CIM defines common field names for normalization.

Why this answer

Options A, C, and E are correct. CIM ensures consistent field naming, allows sharing of dashboards across environments, and provides pre-built data models. It does not automatically generate compliance reports, and while it reduces the need for custom extractions, it does not eliminate them entirely.

Full explanation →

96

MCQhard

A search is used to calculate the 95th percentile of response times for each application, and then to find applications where the 95th percentile exceeds 5000 ms. The current search is: `index=perf sourcetype=app_response | stats perc95(response_time) by app | where perc95(response_time) > 5000` This search fails with an error. What is the most likely reason?

A.The stats command cannot use perc95 with a by clause.

B.The stats command requires a rename of the output field.

C.The field name in the where clause must match exactly, including parentheses. Use `where 'perc95(response_time)' > 5000`

D.The perc95 function is not a valid stats function.

AnswerC

In Splunk, the resulting field is named with the function and parentheses, so it must be quoted or escaped.

Why this answer

Option C is correct because the `where` clause in SPL treats field names containing special characters (like parentheses) as literal strings. Without quoting, `perc95(response_time)` is parsed as a function call rather than a field name, causing a syntax error. Wrapping the field name in single quotes (`'perc95(response_time)'`) tells Splunk to treat it as a literal field reference.

Exam trap

Splunk often tests the nuance that field names generated by `stats` with functions like `perc95()` must be quoted in subsequent commands like `where` or `eval` to avoid being misinterpreted as function calls.

How to eliminate wrong answers

Option A is wrong because `perc95` is a valid percentile function in the `stats` command and can be used with a `by` clause to compute percentiles per group. Option B is wrong because the `stats` command does not require renaming the output field; the field is automatically named `perc95(response_time)` and can be referenced directly. Option D is wrong because `perc95` is a valid stats function in Splunk, used to calculate the 95th percentile of a numeric field.

Full explanation →

97

MCQeasy

A security analyst wants to create a visualization that shows the count of failed login attempts per user over the last 7 days. Which visualization type is most appropriate?

A.Pie chart showing proportion of attempts per user

B.Line chart with time on x-axis and count on y-axis

C.Scatter chart showing correlation between user and time

D.Bar chart with users on x-axis and count on y-axis

AnswerB

Line charts effectively display changes over time.

Why this answer

A line chart is best for showing trends over time. A pie chart shows proportions at a single point, a bar chart can show comparisons but not trends, and a scatter chart is for correlations.

Full explanation →

98

MCQhard

An administrator notices that a search using the timechart command returns data for every 15-minute bucket even when no events exist, creating many null values. How can this behavior be suppressed?

A.Use timechart limit=0

B.Use timechart usenull=f

C.Use timechart partial=false

D.Use timechart cont=false

AnswerD

Suppresses continuous time bins with no data.

Why this answer

Option D is correct: cont=false suppresses continuous time bins, showing only buckets with data. Option A (limit=0) affects number of series, not gaps. Option B (usenull=f) is not a valid timechart option.

Option C (partial=false) controls partial buckets at edges.

Full explanation →

99

Matchingmedium

Match each Splunk report type to its description.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

Displays results in a tabular format

Visualizes data as a chart (e.g., bar, line, pie)

Shows statistical summaries like count, avg, sum

A collection of panels with visualizations

Triggers actions based on search results

Why these pairings

Reports help present and monitor data in Splunk.

Full explanation →

100

MCQmedium

A SOC manager wants to plot locations of security incidents on a map using latitude and longitude fields. Which visualization type should be used in a Splunk dashboard?

A.Cluster map

B.Choropleth map

C.Pie chart

D.Scatter chart

AnswerA

Cluster map visualizes point data on a map.

Why this answer

A cluster map is the correct choice because it is specifically designed to plot individual geographic coordinates (latitude/longitude) as points on a map, and it automatically groups nearby points into clusters to improve readability when many incidents are present. In Splunk, the cluster map visualization uses the `geostats` command or directly ingests lat/lng fields to render markers, making it ideal for showing security incident locations without requiring predefined region boundaries.

Exam trap

Splunk often tests the distinction between cluster maps (point-based) and choropleth maps (region-based), leading candidates to mistakenly choose choropleth when they see 'map' and 'latitude/longitude' in the same question.

How to eliminate wrong answers

Option B (Choropleth map) is wrong because it requires a field that maps to predefined geographic regions (e.g., country, state, ZIP code) and shades entire areas based on aggregated data, not individual latitude/longitude points. Option C (Pie chart) is wrong because it is a categorical chart used to show proportions of a whole, not spatial data. Option D (Scatter chart) is wrong because it plots numeric X/Y coordinates on a Cartesian plane, not geographic latitude/longitude on a map projection, and lacks map tile layers or clustering behavior.

Full explanation →

101

Multi-Selecthard

Refer to the exhibit. An analyst runs a search over access_combined events and notices that some events are not getting the region_name and region_code fields. Which TWO changes could resolve this issue? (Choose two.)

Select 2 answers

A.Increase max_matches to a higher value

B.Check whether the field used for the lookup (clientip) is correctly extracted in the events

C.Remove the LOOKUP- stanza and use the lookup command in each search

D.Verify that the lookup file regions.csv contains matching values for all clientip values in the events

E.Set default_match to the desired default values for all fields

AnswersB, D

If clientip field is not extracted, lookup cannot match.

Why this answer

Option B is correct because if the `clientip` field is not properly extracted from the events, the lookup against `regions.csv` will fail to match, resulting in missing `region_name` and `region_code` fields. The lookup command or LOOKUP- stanza relies on a correctly extracted field to perform the key-value match; without it, no enrichment occurs.

Exam trap

Splunk often tests the distinction between lookup configuration issues (missing key field or missing CSV data) versus lookup performance settings (like `max_matches`), leading candidates to incorrectly choose options that adjust lookup behavior rather than fixing the data or extraction problem.

Full explanation →

102

MCQmedium

An analyst runs a search that includes a subsearch: `index=web [search index=web status=500 | fields url | dedup url | limit 5]`. The main search returns no results even though the subsearch returns 5 URLs. What is the most likely issue?

A.The `limit 5` should be inside the subsearch command, not after dedup.

B.The main search index should be different from the subsearch index.

C.The subsearch returns multiple values but the main search does not properly use them as a filter.

D.The subsearch should not use `fields url` because that causes loss of the search command.

AnswerC

The subsearch returns a list of URLs, but the main search must use the result in a way that matches the field; typically one would use `[search ... | table url | format]` to ensure correct formatting.

Why this answer

The subsearch returns a list of 5 URLs, but the main search `index=web [subsearch]` uses these results as a filter. By default, a subsearch returns its results as a single string (or multiple values) that are passed to the main search's `search` command. If the subsearch returns multiple values (e.g., `/page1 /page2 ...`), the main search interprets them as a single search string unless properly formatted with quotes or an `OR` operator.

Since the subsearch does not use `format` or `return` to structure the output, the main search likely receives an invalid or empty filter, yielding no results.

Exam trap

The trap here is that candidates assume the subsearch results are automatically used as a proper filter, but Splunk requires explicit formatting (e.g., `return` or `format`) to ensure multiple values are correctly combined with OR logic in the main search.

How to eliminate wrong answers

Option A is wrong because `limit 5` after `dedup` is valid; it limits the deduplicated results to 5, and the subsearch correctly returns 5 URLs. Option B is wrong because the main search and subsearch can use the same index; there is no requirement for different indexes, and using the same index is common for filtering. Option D is wrong because `fields url` is a valid command that retains only the `url` field, which is necessary for the subsearch to return URL values; it does not cause loss of the search command.

Full explanation →

103

MCQhard

A lookup table must be updated multiple times per minute from an external API. Which lookup method provides the best performance for search-time enrichment?

A.CSV file lookup updated by a scheduled script

B.Index-time lookup using a transformation that queries the API during indexing

C.Scripted lookup that calls the API on each search

D.KV Store lookup updated via REST API

AnswerD

KV Store supports rapid updates and efficient indexed lookups at search time.

Why this answer

Option D is correct because KV Store lookups can be updated via REST API calls without reindexing and provide fast search-time lookups. Option A is incorrect because file lookups require rewriting the file and reloading. Option B is incorrect because index-time lookups require reindexing.

Option C is incorrect because scripted lookups call the API on every search, causing severe performance issues.

Full explanation →

104

MCQhard

A data engineer has defined a CIM data model for 'Network_Traffic'. They have also created field aliases using `| fieldaliases` to map custom fields like `src_ip` and `dest_ip` to the CIM fields. When running searches against the data model, some events do not appear. The engineer verified that the tags are correctly applied. What is the most likely remaining issue?

A.The field aliases must be defined within the data model itself, not via `| fieldaliases`.

B.The data model search is using a time range that excludes the events.

C.The custom fields are not indexed, so they cannot be used in data models.

D.The tags are applied only to a subset of events.

AnswerA

Data model acceleration uses the data model's field definitions; `| fieldaliases` is a search-time command and does not impact accelerated data models.

Why this answer

Option C is correct: field aliases defined with `| fieldaliases` are not applied to data model searches; instead, data model acceleration uses fields defined in the data model itself. Option A: tags are for inclusion, not mapping. Option B: time range would affect all events equally.

Option D: lookup table would not solve missing fields.

Full explanation →

105

Multi-Selectmedium

Which TWO of the following statements about the 'stats' command are true?

Select 2 answers

A.It can be used with a BY clause to group results.

B.The count() function must always include a field argument.

C.It creates one event per input event by default.

D.It can be used to modify individual field values in raw events.

E.It can produce multiple output columns by using multiple stats functions.

AnswersA, E

The BY clause allows grouping by one or more fields.

Why this answer

Option A is correct because the 'stats' command in Splunk supports a BY clause that allows you to group results by one or more fields, similar to a SQL GROUP BY. This enables you to compute aggregate statistics (e.g., count, sum, avg) for each distinct value of the specified field(s), which is a core feature for summarizing event data.

Exam trap

Splunk often tests the misconception that 'stats' works like 'eval' or 'rex' to modify raw events, or that count() always requires a field, when in fact count() without a field is a valid and common usage.

Full explanation →

106

MCQmedium

An analyst creates a timechart to display the average CPU usage over time for multiple hosts. The chart shows a single line representing the overall average instead of separate lines per host. What is the most likely cause?

A.The search uses 'timechart avg(cpu) by host' but the host field is null.

B.The search uses 'timechart span=1h avg(cpu)' without splitting.

C.The visualization type is set to 'Column' instead of 'Line'.

D.The search uses 'timechart avg(cpu) over date' without a by clause.

AnswerB

Without a 'by' clause, timechart aggregates all events into one series.

Why this answer

Option C is correct because the timechart command without a 'by' clause produces a single series showing the overall average. Options A, B, and D would produce different issues but not the described symptom.

Full explanation →

107

Matchingmedium

Match each Splunk macro to its definition.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

A reusable search snippet without arguments

A reusable search snippet with arguments

A search within a search, enclosed in brackets

A macro that performs a lookup

A macro that evaluates an expression

Why these pairings

Macros simplify complex searches by reusing common patterns.

Full explanation →

108

MCQmedium

An analyst wants to group events from different sourcetypes (web_access and error_log) into a single transaction when they share the same 'request_id' field and occur within 1 minute. Which search correctly accomplishes this?

A.index=* | transaction request_id maxspan=1m

B.index=main | join type=inner request_id [search sourcetype=error_log]

C.sourcetype=web_access | transaction request_id maxspan=1m | append [search sourcetype=error_log | transaction request_id maxspan=1m]

D.(sourcetype=web_access OR sourcetype=error_log) | transaction request_id maxspan=1m

AnswerD

Correctly combines sourcetypes and groups by request_id within 1 minute.

Why this answer

Using OR to combine sourcetypes and then transaction with maxspan=1m groups events by request_id across both sourcetypes, with a total time limit of 1 minute.

Full explanation →

109

Multi-Selecthard

Which THREE are best practices for creating lookups in Splunk?

Select 3 answers

A.Use case-insensitive matching by default for user-facing lookups.

B.Use wildcard matching in lookups for better performance.

C.Use a time-based lookup for static reference data.

D.Define lookups in a lookup definition (transforms.conf).

E.Store lookup files in the lookups directory of the app.

AnswersA, D, E

Reduces user errors.

Why this answer

Option A is correct because using case-insensitive matching by default in user-facing lookups ensures that users do not need to worry about the case of the lookup key, which improves usability and reduces errors. In Splunk, this is configured via the `LOOKUP` command or in transforms.conf with the `case_sensitive_match = false` setting, which is the default behavior for most lookup types. This best practice avoids confusion when users query data with varying case formats, such as user names or hostnames.

Exam trap

The trap here is that candidates often confuse performance optimization with convenience, mistakenly thinking wildcard matching is a best practice, when in reality it is a performance anti-pattern that should be avoided in favor of exact or CIDR-based matching.

Full explanation →

110

MCQmedium

A security analyst needs to find all events where the field 'user' has a value that is either 'admin' or 'root', but the search is returning too many results from a noisy source. Which search best filters the events to only include those where the 'user' field exactly matches 'admin' or 'root'?

A.user="admin" OR user="root"

B.user=*admin* OR user=*root*

C.user IN ("admin", "root")

D.user=admin OR user=root

AnswerC

The IN operator matches fields exactly against the listed values, avoiding substring issues.

Why this answer

Option C is correct because the `IN` operator in Splunk's Search Processing Language (SPL) performs an exact match against a list of values, ensuring that only events where the `user` field is exactly 'admin' or 'root' are returned. This is the most precise and efficient way to filter for multiple exact values without introducing wildcard behavior or relying on implicit field-value parsing that may include surrounding whitespace or punctuation.

Exam trap

Splunk often tests the distinction between exact match operators (`=`, `IN`) and wildcard patterns (`*`), trapping candidates who assume that `user=admin` (unquoted) or `user="admin"` (quoted) will always perform an exact match, when in fact they can behave differently depending on the field's data type and the presence of special characters.

How to eliminate wrong answers

Option A is wrong because `user="admin" OR user="root"` uses quoted strings, which in Splunk can sometimes be interpreted as literal strings rather than field-value comparisons, potentially causing the search to match substrings or fail if the field contains extra characters. Option B is wrong because `user=*admin* OR user=*root*` uses wildcards (`*`), which match any value containing 'admin' or 'root' anywhere in the field, returning many irrelevant events (e.g., 'superadmin', 'rootuser'). Option D is wrong because `user=admin OR user=root` uses unquoted values, which Splunk may treat as search terms rather than exact field-value pairs, leading to unintended matches or parsing errors.

Full explanation →

111

MCQhard

A large e-commerce company runs Splunk Enterprise on a single indexer cluster with four indexers. They have been experiencing slow search performance during peak hours, especially for searches that cover the last 24 hours. The environment uses a default search time range of 'Last 30 days'. The team has noticed that searches often time out or return partial results. They have also observed high CPU usage on the search head during peak times. The company's data volume is approximately 500 GB per day across various sources. They have implemented some search acceleration for data models, but the issue persists. The security team needs to run ad-hoc searches for threat hunting that cover multiple sourcetypes over the last 7 days. Additionally, the search head has a memory limit that is sometimes reached. The security team's searches are complex and involve joins and subsearches. The existing acceleration only covers a few data models. The team is looking for a quick win that does not require significant infrastructure changes. Which course of action would most effectively improve search performance without compromising data completeness?

A.Implement a data model for all sourcetypes and enforce using tstats for all searches

B.Reduce the default time range to 'Last 7 days' and encourage users to specify shorter time ranges

C.Increase the number of indexers to distribute the load

D.Use the search head clustering feature to distribute search load across multiple search heads

AnswerB

Immediately reduces data scanned for most searches, a quick win.

Why this answer

Option B is correct because reducing the default time range from 'Last 30 days' to 'Last 7 days' directly reduces the data scanned by searches, which is the most effective quick win without infrastructure changes. Since the environment has high CPU usage on the search head and searches often time out, limiting the default time range reduces the load on the indexers and search head, improving performance for the majority of searches. This change does not compromise data completeness because users can still specify longer time ranges when needed.

Exam trap

Splunk often tests the misconception that adding more hardware (indexers or search heads) is the only way to improve performance, when in fact optimizing search time ranges and using acceleration appropriately can provide a quicker and more cost-effective solution.

How to eliminate wrong answers

Option A is wrong because implementing a data model for all sourcetypes and enforcing tstats for all searches would require significant upfront effort and may not cover ad-hoc threat hunting searches that use joins and subsearches, which tstats cannot directly support. Option C is wrong because increasing the number of indexers requires significant infrastructure changes and is not a quick win; it also does not address the search head memory limit or the default time range issue. Option D is wrong because search head clustering distributes search load across multiple search heads, but it requires additional hardware and configuration, and does not reduce the amount of data scanned per search; it also does not address the root cause of high CPU usage on the existing search head.

Full explanation →

112

MCQeasy

Refer to the exhibit. The search returns results quickly but shows zero events for some src values. What does `summariesonly=t` imply?

A.It restricts tstats to only use data from accelerated data model summaries.

B.It causes tstats to search the index directly.

C.It makes the search run faster but less accurate.

D.It forces tstats to return all events, including those not in summaries.

AnswerA

This is the purpose of summariesonly=t; if acceleration is not complete, some data may be missing.

Why this answer

Option A is correct because `summariesonly=t` in a `tstats` command restricts the search to only use data from accelerated data model summaries, ignoring raw event data. This is why the search returns quickly but shows zero events for some `src` values—those values are not present in the accelerated summaries, so they are excluded from the results.

Exam trap

Splunk often tests the misconception that `summariesonly=t` makes searches 'faster but less accurate,' when in reality it strictly limits the data source to summaries, and accuracy depends on summary completeness, not the option itself.

How to eliminate wrong answers

Option B is wrong because `summariesonly=t` does not cause `tstats` to search the index directly; instead, it explicitly avoids the index and relies solely on summary data. Option C is wrong because while `summariesonly=t` can make the search run faster, it does not inherently make it 'less accurate'—accuracy depends on whether the summaries are complete and up-to-date; the option itself does not introduce inaccuracy. Option D is wrong because `summariesonly=t` does the opposite—it forces `tstats` to return only events present in summaries, not all events including those not in summaries.

Full explanation →

113

MCQhard

A search uses the transaction command with maxevents=1000 and maxspan=1h. The search is slow and memory-intensive. Which modification can reduce resource usage while still grouping related events?

A.Use eventstats with a time window.

B.Reduce maxevents to 100.

C.Use tstats instead of transaction.

D.Increase maxspan to 2h.

AnswerB

Reducing the maximum events per transaction lowers memory consumption.

Why this answer

Reducing maxevents to 100 limits the number of events that the transaction command groups into a single transaction, which directly reduces memory consumption and processing overhead. This modification still allows related events to be grouped together, but with a smaller batch size, making the search less resource-intensive while preserving the core grouping logic.

Exam trap

Splunk often tests the misconception that reducing maxevents will break event grouping, when in fact it only limits the number of events per transaction, still allowing related events to be grouped as long as they fall within the maxspan and other criteria.

How to eliminate wrong answers

Option A is wrong because eventstats with a time window does not group events into transactions; it only computes statistical aggregations over a sliding window and cannot correlate events into a single logical group. Option C is wrong because tstats is designed for statistical queries on indexed fields and cannot perform event grouping or transaction-style correlation across raw events. Option D is wrong because increasing maxspan to 2h would allow the transaction to span a longer time window, potentially including more events and increasing memory usage, which is the opposite of reducing resource usage.

Full explanation →

114

MCQeasy

An analyst runs `index=web status=500 | top 10 uri` and gets results. Which statement is true about the 'top' command's behavior?

A.It returns the 10 URIs that appeared most recently.

B.It computes the average latency per URI and shows the top 10.

C.It sums a numeric field per URI and shows the top 10 sums.

D.It requires a 'by' clause to specify the field to group by.

E.It counts the number of events per URI and displays the 10 with the highest count.

AnswerE

Correct: top counts events per field value.

Why this answer

The `top` command in Splunk counts the occurrences of each distinct value of a specified field (here, `uri`) and returns the values with the highest counts. By default, it returns the top 10 results, so option E correctly describes that it counts events per URI and displays the 10 with the highest count.

Exam trap

The trap here is that candidates often confuse `top` with commands like `sort` or `stats`, thinking it sorts by time or computes averages, when in fact it purely counts event frequencies per field value.

How to eliminate wrong answers

Option A is wrong because `top` does not consider recency; it counts total occurrences, not the most recent events. Option B is wrong because `top` does not compute averages; it counts events, and latency is not involved. Option C is wrong because `top` does not sum numeric fields; it counts events per distinct value.

Option D is wrong because `top` does not require a `by` clause; it automatically groups by the field specified as its argument (e.g., `uri`).

Full explanation →

115

Multi-Selectmedium

Which TWO of the following are valid methods to create a lookup table in Splunk?

Select 2 answers

A.Use the eval command to create a lookup definition

B.Use the stats command to aggregate data into a lookup

C.Upload a CSV file through the Splunk Web interface

D.Use the outputlookup command in a search

E.Use the inputlookup command in a search

AnswersC, D

This creates a file-based lookup table.

Why this answer

Option C is correct because Splunk Web provides a direct interface to upload a CSV file and define it as a lookup table, automatically creating the lookup definition and file. Option D is correct because the `outputlookup` command in a search writes search results to a CSV file in the lookups directory, effectively creating a lookup table from dynamic data.

Exam trap

The trap here is that candidates confuse commands that *use* lookups (like `inputlookup`) with commands that *create* lookups, or assume that aggregation commands like `stats` inherently produce a lookup file.

Full explanation →

116

MCQhard

A Splunk admin notices that a time-based lookup (defined in transforms.conf with time_range=TRUE) is not returning correct results for events outside the lookup's time boundaries. The lookup file contains rows with a valid time range. What is the most likely cause?

A.The lookup is defined as an automatic lookup and runs at index time

B.The lookup table has max_matches set to 1, limiting matches

C.The lookup file does not contain a time field

D.The event time is outside the time range defined in the lookup

AnswerD

Time-based lookups only match events whose _time falls within the row's time range.

Why this answer

When a time-based lookup is configured with time_range=TRUE in transforms.conf, Splunk restricts lookup matches to events whose timestamps fall within the time range specified in the lookup file. If an event's timestamp is outside that range, the lookup will not return any results, even if other fields match. This is the most likely cause because the question explicitly states that events are outside the lookup's time boundaries.

Exam trap

The trap here is that candidates may think the lookup file is missing a time field (Option C) when the question explicitly states it contains valid time ranges, or they may confuse time-based lookup behavior with other lookup types that do not enforce time boundaries.

How to eliminate wrong answers

Option A is wrong because automatic lookups run at search time, not index time; index-time lookups are configured with `lookup_index` and are not time-based. Option B is wrong because `max_matches=1` limits the number of results per event but does not prevent matches for events outside the time range; it would still return a match if the time range condition is satisfied. Option C is wrong because the lookup file must contain a time field for time-based lookups to work; the question states the lookup file contains rows with a valid time range, so a time field is present.

Full explanation →

117

MCQmedium

An analyst needs to calculate the average response time for each web server, but only for requests that returned status code 200. The field 'response_time' is numeric. Which search correctly achieves this?

A.index=web | transaction server maxspan=1m | stats avg(response_time) by server

B.index=web status=200 | eventstats avg(response_time) as avg_time by server

C.index=web | eval avg_time = avg(response_time) by server | search status=200

D.index=web status=200 | stats avg(response_time) by server

AnswerD

Correct: filter first, then aggregate.

Why this answer

Option D is correct because it first filters the data to only include events with status=200 using a search-time field filter, then uses the `stats` command with `avg(response_time) by server` to compute the average response time per server. This ensures that only successful requests are included in the aggregation, and the `by server` clause correctly groups the results by each web server.

Exam trap

The trap here is that candidates often confuse `eventstats` with `stats`, thinking `eventstats` can produce a summary table, or they incorrectly use `eval` with aggregation functions, not realizing that `eval` operates on individual events and cannot perform group-by calculations.

How to eliminate wrong answers

Option A is wrong because it uses the `transaction` command with a `maxspan=1m`, which groups events into transactions based on time proximity rather than filtering by status=200, and the `avg(response_time)` would be computed over all events in the transaction, not just successful ones. Option B is wrong because `eventstats` adds the average response time as a new field to each event but does not aggregate the results into a single row per server; it retains all raw events, which does not produce the desired summary table. Option C is wrong because `eval` cannot perform an aggregation like `avg()` with a `by` clause; `eval` works on a per-event basis, and the `search status=200` after the `eval` would filter events before any aggregation, making the syntax invalid and the logic incorrect.

Full explanation →

118

Drag & Dropmedium

Order the steps to configure a field extraction using the Field Extractor (FX) in Splunk.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps

Order

Why this order

The Field Extractor guides you through selecting sample data, defining extraction rules, testing, and saving.

Full explanation →

119

MCQhard

A search uses a subsearch to filter events, but the subsearch returns more than 50,000 results, causing the search to fail. Which approach can avoid this limit while still achieving the goal?

A.Increase the limit in limits.conf.

B.Use a join command instead.

C.Use a nested subsearch with stats to aggregate results first.

D.Use the format command with AND.

AnswerC

Aggregating reduces the number of results returned by the subsearch.

Why this answer

Option C is correct because using a nested subsearch with `stats` to aggregate results first reduces the number of events returned by the subsearch, allowing it to stay under the default 50,000-result limit. This approach pre-processes the subsearch output (e.g., by counting or grouping) so that the outer search receives a manageable set of values, effectively bypassing the limit without altering system configuration.

Exam trap

Splunk often tests the misconception that increasing configuration limits or using commands like `join` or `format` can solve subsearch limit issues, when the correct approach is to reduce the subsearch output size through aggregation.

How to eliminate wrong answers

Option A is wrong because increasing the limit in `limits.conf` is a global configuration change that can degrade search performance and is not a best practice for handling subsearch result limits; it also requires a restart and affects all searches. Option B is wrong because the `join` command itself has a subsearch limit (default 50,000 results) and is generally inefficient, often leading to the same failure or poor performance; it does not inherently avoid the limit. Option D is wrong because the `format` command is used to format subsearch results into a boolean expression for the outer search, but it does not reduce the number of results returned by the subsearch; the subsearch still hits the 50,000-result limit before `format` is applied.

Full explanation →

120

MCQhard

Consider the following search: 'index=web | transaction sessionid maxspan=30m | where eventcount > 5 | stats avg(duration)'. An analyst notices that the search takes a long time and uses excessive memory. Which change would most likely improve performance?

A.Change maxspan to 1h to allow more events.

B.Use the stats command with values(sessionid) instead of transaction.

C.Remove the where clause and use stats after transaction.

D.Add a filter before transaction to reduce events.

AnswerB

Stats with values can group events without the overhead of transaction, but it does not maintain event order or duration. For calculating avg duration, transaction is needed. So this is incorrect.

Why this answer

Filtering events before the transaction command reduces the number of events processed, lowering memory and time requirements.

Full explanation →

121

MCQeasy

A team needs to create a saved search that runs automatically every Monday at 8 AM and emails a CSV file of the results. Besides configuring the search string, which steps are required?

A.Create a dashboard panel that runs the search on schedule and sends email.

B.Set the search to 'Summary Index' and run a separate alert on the summary.

C.Set a schedule (Cron: 0 8 * * 1) and add an email alert action with attachment format CSV.

D.Define the schedule and set the time range to 'Last 7 days'.

AnswerC

Scheduling and alert action are required for automated email delivery.

Why this answer

Option B is correct because a saved search must have both a schedule and at least one alert action (like email with CSV attachment). Option A is incorrect because the schedule time can be set directly. Option C is incorrect because CSV output is configured in the alert action.

Option D is incorrect because summary indexing is optional.

Full explanation →

122

Multi-Selecthard

Which THREE statements about the `transaction` command are true?

Select 3 answers

A.Transaction can correlate events based on more than one field.

B.Transaction events can contain multivalue fields from the constituent events.

C.The maxpause option sets the maximum time span of the transaction.

D.When using startswith and endswith, the transaction event includes a duration field.

E.Transaction is the most efficient way to group events from large datasets.

AnswersA, B, D

Fields can be concatenated or multiple fields specified.

Why this answer

Transaction can group events by multiple fields, calculates duration for start/end events, and can include field values from all constituent events. However, it does not support use of `where` before transaction in the same search pipeline? Actually, it can be used after. Options: A: true, B: false (maxpause limits pause, not span), C: true, D: true, E: false (transaction is memory-intensive, not recommended for large datasets).

Full explanation →

123

Drag & Dropmedium

Order the steps to create a dashboard panel using the XML source editor in Splunk.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps

Order

Why this order

Dashboard panels are defined in XML by adding a panel with a search query and visualization.

Full explanation →

124

Multi-Selectmedium

Which three considerations are important when creating a visualization for a dashboard that will be displayed on a large monitor? (Choose 3)

Select 3 answers

A.Use small font sizes to fit more information

B.Ensure the chart type is appropriate for the data

C.Use high contrast colors

D.Use drill-down actions for detailed views

E.Minimize the number of data points

AnswersB, C, D

Appropriate chart type is always essential.

Why this answer

High contrast improves readability, appropriate chart type is fundamental, and drill-down actions provide interactive details. Small fonts and minimizing data points are detrimental.

Full explanation →

125

MCQhard

An admin created a macro `myfilter(host)` with definition: `host=$host$ | stats count`. When calling `myfilter(webserver)`, the search returns no results. What is the most likely cause?

A.The host field is case-sensitive.

B.The macro argument is not being treated as a literal string.

C.The host field is not indexed.

D.The macro is evaluated before the rest of the search.

AnswerB

Without quotes, the value is interpreted as a field value literal, but the correct syntax is `host="$host$"`.

Why this answer

The macro definition uses `host=$host$`, but when called with `myfilter(webserver)`, the argument `webserver` is passed as a literal string. However, the macro expands to `host=webserver | stats count`, which Splunk interprets as a field-value comparison where `webserver` is treated as a literal string value for the `host` field. The issue is that the macro argument is not being treated as a literal string in the context of the search; instead, it's being substituted directly, which is correct.

The real problem is that the macro definition uses `$host$` without quotes, so the argument is not being treated as a literal string value—it's being interpreted as a field name or search term. The correct syntax should be `host="$host$"` to ensure the argument is treated as a literal string. Option B is correct because the macro argument is not being treated as a literal string, causing the search to fail to match events.

Exam trap

Splunk often tests the misconception that macro arguments are automatically treated as literal strings, when in fact they are substituted as raw text and must be explicitly quoted to be treated as literal values.

How to eliminate wrong answers

Option A is wrong because while the host field is case-sensitive by default in Splunk, this is not the cause of the issue—the macro substitution does not change case, and the search would still return results if the case matched. Option C is wrong because the host field is a default indexed field in Splunk (part of the metadata), so it is always indexed; the problem is not about indexing. Option D is wrong because macro evaluation occurs before the rest of the search, but this is normal behavior and does not cause the issue; the problem is the lack of quotes around the argument, not the evaluation order.

Full explanation →

126

MCQeasy

A search returns many events, and the analyst wants to see a summary table of the top 5 values of the field `src_ip` along with the count of events for each. Which command should be used?

A.eventstats

B.top

C.sort

D.rare

AnswerB

top returns the most frequent values with count and percent.

Why this answer

The `top` command in Splunk is specifically designed to find the most common values of a field and display them in a summary table with counts and percentages. By default, `top` returns the top 10 values, but you can use the `limit=5` parameter to restrict the output to the top 5 values of `src_ip` along with their event counts. This directly meets the analyst's requirement.

Exam trap

The trap here is that candidates may confuse `top` with `sort` or `eventstats`, thinking that sorting or adding statistics to events is sufficient, when in fact only `top` provides the aggregated summary table of the most frequent values with counts.

How to eliminate wrong answers

Option A is wrong because `eventstats` computes aggregate statistics (like count, sum, avg) and adds them as new fields to each event, but it does not produce a summary table of top values. Option C is wrong because `sort` reorders events based on a field but does not aggregate counts or produce a summary table of top values. Option D is wrong because `rare` finds the least common values of a field, which is the opposite of what the analyst needs (top 5).

Full explanation →

127

MCQmedium

A search includes `... | transaction 1,2,3` but returns unexpected results. What does the `1,2,3` represent in this context?

A.Field names to use as transaction keys

B.Three index names to correlate across indexes

C.Status codes for transactions (1=open, 2=pending, 3=closed)

D.maxspan=1s, maxpause=2s, maxevents=3

AnswerD

Positional arguments: maxspan, maxpause, maxevents in seconds.

Why this answer

Option B is correct because `transaction` without named fields uses `maxspan` as the first argument, `maxpause` as second, and `maxevents` as third, all in seconds. Option A is wrong because it's not index. Option C is wrong because it's not status.

Option D is wrong because it's not field list.

Full explanation →

128

MCQeasy

What is the type of visualization produced?

A.A bar chart.

B.A pie chart.

C.A scatter chart.

D.A line chart.

AnswerB

The `pie` command creates a pie chart.

Why this answer

Option D is correct. The `pie` command generates a pie chart from the statistical results. Option A (bar chart) is produced by `chart`, not `pie`.

Option B (scatter) and C (line) are not applicable.

Full explanation →

129

MCQmedium

A user creates a lookup definition for a CSV file containing user roles. The lookup is used in a search: `| lookup user_roles username OUTPUT role`. The search returns no additional field. The lookup file has columns: 'username', 'role', 'department'. What is the MOST likely issue?

A.The username field in the search is not extracted.

B.The lookup file is not in the lookups directory.

C.The lookup definition uses a different filename than the CSV.

D.The 'role' field is misspelled in the search.

AnswerC

If the definition points to a different file, no matches occur.

Why this answer

Option C is correct because the lookup definition must reference the exact filename of the CSV file. If the definition points to a different filename than the actual CSV, the lookup will fail to load the file, and no fields will be returned. The search syntax is otherwise correct, so the issue lies in the mismatch between the definition and the file.

Exam trap

Splunk often tests the distinction between a lookup definition that exists but points to the wrong file versus a missing file or a field extraction issue, so candidates mistakenly choose Option B (missing file) or Option A (field not extracted) instead of recognizing the filename mismatch.

How to eliminate wrong answers

Option A is wrong because the search uses the username field directly in the lookup command, and if it were not extracted, the lookup would not match any rows but would still attempt to run; the issue is that no additional field is returned at all, not that the lookup fails to match. Option B is wrong because if the file were not in the lookups directory, Splunk would log an error and the lookup would not run, but the question states the lookup is used and returns no additional field, implying the lookup definition exists but the file is not loaded correctly. Option D is wrong because the 'role' field is correctly spelled in the search and matches the CSV column; a misspelling would cause the OUTPUT clause to fail silently or return an empty field, but the core issue is that the lookup definition does not point to the correct file.

Full explanation →

130

MCQeasy

Which Common Information Model (CIM) data model is appropriate for standardizing authentication events?

A.Change Analysis

B.Network Traffic

C.Authentication

D.Endpoint

AnswerC

The Authentication data model covers login/logout, failed authentication, etc.

Why this answer

The CIM includes the 'Authentication' data model for authentication events. Option A is correct. Option B (Change Analysis) is for configuration changes.

Option C (Endpoint) is for operating system events. Option D (Network Traffic) is for network sessions.

Full explanation →

131

Multi-Selectmedium

Which THREE are valid uses of macros in Splunk? (Choose three.)

Select 3 answers

A.Dynamically switch between different indexes at search time based on user input.

B.Enforce security by hiding sensitive parts of the search from users.

C.Modify the results of a search after the search completes (post-processing).

D.Reduce duplicated SPL in the search language by reusing common sub-searches.

E.Parameterize searches by passing arguments such as time ranges or threshold values.

AnswersB, D, E

Macros can be used to grant execute-only access without exposing the underlying SPL.

Why this answer

Options B, C, and D are correct. Macros can encapsulate complex SPL, accept arguments, and be used to enforce security policies by restricting access to parts of searches. Option A is wrong because macros cannot dynamically change the index at search time (they can, but not as a primary purpose; it's discouraged due to performance).

Option E is wrong because macros are not designed to modify search results; they generate SPL that runs against events.

Full explanation →

132

MCQmedium

Refer to the exhibit. The search is intended to count the number of clients who made more than 3 HTTP requests within any 30-minute window. However, the results are unexpectedly high. What is the most likely reason?

A.The sourcetype does not contain enough methods to satisfy the condition.

B.The stats command should use `dc(clientip)` instead of `count by clientip`.

C.The mvcount function counts the number of unique methods, not events.

D.The same clientip can appear in multiple transactions, causing overcounting.

AnswerD

Each 30-minute window creates a separate transaction; stats count counts each transaction, not unique clients.

Why this answer

The transaction command groups events into transactions based on fields like clientip. If the same clientip appears in multiple transactions (e.g., because the 30-minute window resets or overlaps), that clientip will be counted multiple times in the final stats count. This overcounting inflates the result, making it unexpectedly high.

Exam trap

Splunk often tests the misconception that transaction groups all events for a given field into a single transaction, when in reality it can create multiple transactions per field value if events exceed the maxspan or maxpause limits.

How to eliminate wrong answers

Option A is wrong because the sourcetype's methods are irrelevant to the overcounting issue; the problem is with how transactions group events, not the content of the sourcetype. Option B is wrong because using dc(clientip) would count distinct clientips, which would still be overcounted if the same clientip appears in multiple transactions; the issue is transaction grouping, not the aggregation function. Option C is wrong because mvcount counts the number of values in a multivalue field (like methods), not events; the search uses count by clientip, not mvcount, so this is a misdirection.

Full explanation →

133

MCQmedium

A Splunk administrator notices that a scheduled saved search titled 'Nightly_Threat_Report' is not completing on time. The search runs at 2:00 AM daily and typically takes 15 minutes, but recently it has been timing out after 30 minutes. The search query is complex, joining data from multiple indexes. The administrator checks the 'savedsearch.log' and sees entries like 'Search job terminated due to dispatch time limit' and 'Search job exceeded max time'. The administrator wants to resolve the issue without changing the search logic or increasing system resource limits. Which action should the administrator take first?

A.Change the scheduled time to 1:00 AM to avoid other concurrent heavy searches.

B.Increase the 'dispatch.max_time' parameter for the saved search in savedsearches.conf.

C.Increase the 'dispatch.earliest_time' and 'dispatch.latest_time' settings for the saved search to allow a longer execution window.

D.Reduce the search time range from 'last 24 hours' to 'last 1 hour' to speed up the query.

AnswerB

This directly increases the dispatch time limit, allowing the search to complete.

Why this answer

Option B is correct because the error messages 'Search job terminated due to dispatch time limit' and 'Search job exceeded max time' directly indicate that the search is hitting the 'dispatch.max_time' limit, which defaults to 600 seconds (10 minutes) for scheduled searches. Increasing this parameter in savedsearches.conf extends the maximum execution time allowed for the search job, allowing the complex query to complete without changing the search logic or system resource limits.

Exam trap

The trap here is that candidates confuse 'dispatch.max_time' (execution timeout) with 'dispatch.earliest_time'/'dispatch.latest_time' (time range), leading them to incorrectly adjust the search window instead of the timeout limit.

How to eliminate wrong answers

Option A is wrong because changing the scheduled time does not address the root cause of the timeout; the search is failing due to its own execution time limit, not due to resource contention from concurrent searches. Option C is wrong because 'dispatch.earliest_time' and 'dispatch.latest_time' control the time range of data searched, not the execution timeout; increasing these would actually expand the data volume and worsen the timeout issue. Option D is wrong because reducing the search time range changes the search logic and data scope, which violates the constraint of not changing the search logic; it also may not guarantee completion if the query itself is inefficient.

Full explanation →

134

MCQmedium

A Splunk administrator is asked to create a dashboard that shows the top 10 source IPs by count of failed logins over the past week. The data is already CIM-compliant and uses the Authentication data model. Which search is most appropriate?

A.| tstats count from datamodel=Authentication where Authentication.action=failure by Authentication.src limit 10

B.| tstats summaryonly=true count from datamodel=Authentication where action=failure by src

C.| datamodel Authentication search | search action=failure | top src

D.| search sourcetype=* authentication action=failure | stats count by src | sort - count | head 10

AnswerA

Uses tstats on the accelerated data model with proper field and limit.

Why this answer

Option A is correct because it uses the CIM data model and the 'tag=authentication' to filter relevant events, then extracts the 'src' field (which is the CIM field for source IP). Option B uses '| datamodel Authentication' but does not use '| tstats' which is more efficient for accelerated data models. Option C uses '| tstats summaryonly=true' which loses detail.

Option D uses the Authentication data model correctly but does not sum by source IP.

Full explanation →

135

MCQhard

A large e-commerce company uses Splunk to monitor transactions. They have a CSV lookup file named 'customer_lookup.csv' containing 5 million rows of customer data (customer_id, name, address, tier). The lookup is used in a search that runs every hour to generate a report of sales by customer tier: index=transactions sourcetype=transaction_logs | lookup customer_lookup customer_id OUTPUT name, address, tier | timechart count by tier The search often times out or takes too long to complete, impacting operational dashboards. The team is considering optimization strategies. The lookup file is updated daily and stored in a custom app directory. The Splunk environment is distributed with a single search head and multiple indexers. Which of the following recommendations would most effectively improve the search performance?

A.Use a subsearch to first find matching customer IDs and then join with the lookup.

B.Convert the CSV lookup to a KV Store lookup by defining a collection and populating it from the CSV.

C.Use the `timechart` command with `partial=true` to allow the search to return intermediate results.

D.Remove the OUTPUT fields name and address from the lookup, keeping only the tier field.

AnswerB

KV Store lookups provide faster, indexed lookups for large datasets, reducing search time significantly.

Why this answer

While all options could offer some improvement, using a KV Store lookup is the most effective for large lookups. KV Store lookups are designed for high-performance key-value lookups and scale better than CSV lookups for millions of rows. This reduces the time spent matching events against the lookup.

Option A is correct.

Full explanation →

136

MCQhard

A performance analyst notices that a saved search running a macro with multiple `| eval` statements takes significantly longer than expected. The macro includes conditions like `| eval status=if(success=="true", "OK", "Fail")`. Which change would most likely improve performance?

A.Reduce the number of arguments passed to the macro.

B.Increase the summary index range to reduce the number of events processed.

C.Replace the `| eval` with a lookup table that maps the conditions.

D.Add more `| fields` commands to limit output fields.

AnswerC

Lookups are faster than per-event eval evaluations.

Why this answer

Option A is correct: using a lookup to precompute the status field will offload processing from search time. Option B: reducing arguments does not affect eval complexity. Option C: adding more fields increases overhead.

Option D: increasing summary range does not help the search itself.

Full explanation →

137

MCQmedium

The admin calls the macro as shown. What will be the expanded search string?

A.search index=main earliest=-1h latest=now | stats count by sourcetype | rename count as total

B.search index='main' earliest='-1h' latest=now | stats count by sourcetype | rename count as total

C.search index=-1h earliest=main latest=now | stats count by sourcetype | rename count as total

D.search index=main|earliest=-1h latest=now | stats count by sourcetype | rename count as total

AnswerA

Correct substitution: $index$ -> main, $time_range$ -> -1h.

Why this answer

Option A is correct because macro arguments are substituted: $index$ becomes 'main', $time_range$ becomes '-1h'. The brackets define the search inside. Option B incorrectly concatenates.

Option C uses single quotes. Option D swaps order.

Full explanation →

138

Multi-Selectmedium

Which TWO of the following are valid methods to join two sets of search results?

Select 2 answers

A.join

B.append

C.union

D.stats values(*)

E.addcoltotals

AnswersA, B

`join` merges results from two searches based on a field.

Why this answer

Option A is correct because the `join` command in Splunk merges two result sets based on a common field, similar to a SQL JOIN. It requires both datasets to have a matching field and supports inner, left, outer, and other join types. Option B is correct because the `append` command simply adds the results of a subsearch to the main search results, effectively concatenating the two sets without requiring a common field.

Exam trap

Splunk often tests the distinction between `join` and `append` versus non-existent or unrelated commands like `union` or `addcoltotals`, and candidates may confuse `stats values(*)` as a join method because it can combine values, but it does not join separate result sets.

Full explanation →

139

MCQeasy

An analyst wants to group events that start with a 'login' event and end with a 'logout' event, using the username field. Which transaction syntax is correct?

A.transaction username startswith=login endswith=logout

B.transaction username startswith="login" endswith="logout" maxspan=2h

C.transaction startswith="login" endswith="logout" by username

D.transaction username startswith="login" endswith="logout"

AnswerD

Correct: Proper syntax.

Why this answer

Option C is correct. The proper syntax is 'transaction username startswith="login" endswith="logout"'. Option A lacks quotes, Option B adds an unnecessary maxspan, and Option D incorrectly places the field.

Full explanation →

140

Multi-Selectmedium

Which three conditions can cause a transaction to close prematurely? (Choose three.)

Select 3 answers

A.The maxevents value is reached.

B.The startswith event is encountered again.

C.The maxpause value is exceeded.

D.The maxspan value is reached.

E.The endswith event is detected.

AnswersA, C, D

Correct: maxevents closes the transaction when event count reaches limit.

Why this answer

Options A, B, and C are correct. maxspan, maxpause, and maxevents all cause transactions to close when their limits are reached, potentially before all related events are grouped. Option D (endswith) closes the transaction by design, not prematurely. Option E (startswith again) can start a new transaction but does not necessarily close the current one prematurely.

Full explanation →

141

Multi-Selecteasy

Which TWO commands can be used to perform statistical aggregations on streaming events without creating a separate search results set?

Select 2 answers

A.timechart

B.stats

C.streamstats

D.eventstats

E.chart

AnswersC, D

streamstats adds stats to each event as the results stream, preserving all events.

Why this answer

C is correct because streamstats performs statistical aggregations on streaming events in real-time as they arrive, without creating a separate search results set. It computes windowed statistics (e.g., running sum, moving average) on the event stream itself, appending the result to each event. D is correct because eventstats also operates on the current event set, computing aggregations and adding the results as new fields to each event without generating a separate output set.

Exam trap

Splunk often tests the distinction between commands that produce a new results set (stats, chart, timechart) versus those that augment the existing event stream (streamstats, eventstats), and the trap here is that candidates confuse eventstats with stats, thinking both create separate outputs.

Full explanation →

142

Multi-Selectmedium

Which TWO of the following searches are syntactically valid uses of the eventstats command? (Assume all referenced fields exist.)

Select 2 answers

A.| eventstats perc95(response_time) by host | eval p95 = perc95

B.| eventstats sum(bytes) as total_bytes by src_ip | eval pct = bytes/total_bytes*100

C.| eventstats avg(response_time) as avg | eval diff = response_time - avg

D.| eventstats values(user) as users by session_id | eval num_users = len(users)

E.| eventstats max(_time)last_time by user | table last_time

AnswersB, C

Valid: eventstats adds the sum per src_ip, then eval calculates percentage.

Why this answer

Option B is correct because the `eventstats` command calculates aggregate statistics (here, `sum(bytes)`) over the entire result set or by group (here, `by src_ip`), and it adds the result as a new field (`total_bytes`) to every event. This allows the subsequent `eval` to compute a per-event percentage using that new field. Option C is correct because `eventstats` without a `by` clause computes the aggregate over all events and adds the result (here, `avg`) to each event, enabling the `eval` to calculate the difference from the global average.

Exam trap

Splunk often tests the distinction between `eventstats` and `stats`, and the trap here is that candidates confuse the syntax for aliasing (missing `as`) or use invalid eval functions like `len()` instead of `mvcount()`, leading them to select options that look plausible but are syntactically incorrect.

Full explanation →

143

MCQeasy

A user wants to create a timechart showing the number of distinct users per hour over the past week. Which search is correct?

A.index=web | bucket _time span=1h | stats dc(user) by _time

B.index=web | timechart dc(user) span=1h

C.index=web | timechart dc(user) by _time span=1h

D.index=web | timechart count by user span=1h

AnswerB

Correctly uses timechart with distinct count function.

Why this answer

Option B is correct because the `timechart` command automatically creates a time-based chart with a default or specified `span`, and `dc(user)` calculates the distinct count of users per time bucket. The `span=1h` argument sets the bucket size to one hour, and the `by` clause is not needed because `timechart` implicitly groups by `_time`. This produces the desired output of distinct users per hour over the past week.

Exam trap

Splunk often tests the distinction between `stats` and `timechart` — the trap here is that candidates mistakenly use `stats ... by _time` (Option A) thinking it produces a timechart, or misuse the `by` clause in `timechart` (Option C) by including `_time` as a split-by field, which breaks the time-based grouping.

How to eliminate wrong answers

Option A is wrong because `bucket _time span=1h` creates time buckets but does not automatically sort or chart the results, and `stats dc(user) by _time` would produce a table, not a timechart, and may not handle missing time buckets correctly. Option C is wrong because `timechart dc(user) by _time span=1h` incorrectly uses `by _time` as a split-by field, which would attempt to create a separate series for each unique timestamp value, not group by hour. Option D is wrong because `count by user` counts all events per user instead of distinct users, and `span=1h` is applied to `timechart` but the `by user` clause splits the chart by user, not by time, so it does not show distinct users per hour.

Full explanation →

144

MCQmedium

An analyst wants to create a timechart of the count of events per hour, but only for events where the field `status` contains the word "fail" (case-insensitive). Which search is correct?

A.index=main | timechart count | search status=*fail*

B.index=main | regex status="fail" | timechart count

C.index=main | where status="*fail*" | timechart count

D.index=main | timechart count by eval(case(match(status,"(?i)fail"),1))

AnswerD

match with (?i) does case-insensitive regex and eval creates a field for timechart.

Why this answer

Option D is correct because it uses `eval` with `case` and `match` to create a field that is 1 when `status` contains 'fail' (case-insensitive via `(?i)`), then uses `timechart count by` that field to count only matching events per hour. This approach correctly filters within the timechart aggregation, ensuring only events where status matches the pattern are counted.

Exam trap

Splunk often tests the distinction between `search` (which supports wildcards) and `where` (which does not), and the requirement to use `match` with regex flags for case-insensitive substring matching in aggregation commands.

How to eliminate wrong answers

Option A is wrong because it runs `timechart count` on all events first, then applies `search status=*fail*` after the timechart, which would filter the timechart results (which no longer have individual event fields) rather than filtering events before counting. Option B is wrong because `regex status="fail"` is a case-sensitive exact match, not a case-insensitive substring search, and it would fail to match 'Failed' or 'FAIL'. Option C is wrong because `where status="*fail*"` treats the asterisks as literal characters, not wildcards; `where` does not support wildcard patterns like `search` does.

Full explanation →

145

Multi-Selecthard

Which THREE conditions must be met for events to be grouped into the same transaction when using the 'transaction' command without any 'startswith' or 'endswith' options? (Choose three.)

Select 3 answers

A.Events must have the same value in the field specified by the 'by' clause.

B.The time difference between the first and last event must not exceed the maxspan value.

C.Events must be from the same sourcetype.

D.The time gap between consecutive events must not exceed the maxpause value.

E.Events must appear in chronological order with no missing timestamps.

AnswersA, B, D

The 'by' clause defines the grouping field.

Why this answer

Option A is correct because the 'by' clause in the 'transaction' command defines a field whose value must be identical across all events in a transaction. Without 'startswith' or 'endswith', the transaction command groups events solely based on the 'by' field, the 'maxspan' time window, and the 'maxpause' gap between consecutive events. This ensures that only events sharing the same field value are considered part of the same logical transaction.

Exam trap

The trap here is that candidates often assume events must share the same sourcetype (Option C) because they confuse the 'transaction' command with the 'stats' or 'eventstats' commands, which do not inherently require sourcetype matching, or they mistakenly think chronological order is enforced (Option E) when in fact the command handles ordering internally.

Full explanation →

146

MCQmedium

A Splunk administrator is troubleshooting a slow dashboard that uses a timechart with a large dataset. Which of the following is a best practice to improve performance?

A.Increase the number of concurrent searches

B.Use the eval command to add fields

C.Use data model acceleration

D.Increase the search bucket count

AnswerC

Data model acceleration speeds up searches on data models significantly.

Why this answer

Data model acceleration pre-computes data for faster searches. Increasing bucket count, using eval, or increasing concurrent searches do not directly improve performance and may worsen it.

Full explanation →

147

MCQmedium

A Splunk admin created a macro named `filter_by_region` that takes one argument: the region code. The macro definition is: `index=main sourcetype=web region=$region$`. When a user runs the search `| `filter_by_region US`` they get no results, but when they replace the macro with the actual string `index=main sourcetype=web region=US`, they get results. What is the problem?

A.The macro definition does not specify an argument list.

B.The macro argument is not passed correctly because of quotation marks.

C.The user does not have execute permissions for the macro.

D.The macro uses double dollar signs incorrectly; it should be `$region$`.

AnswerA

Correct: Without an argument list, the macro does not recognize `$region$` as a variable.

Why this answer

Option A is correct: The macro definition does not include an argument list, so `$region$` is treated as literal text. The correct definition should be `filter_by_region(region)` in the definition name. Double dollar signs are correct for variable expansion.

Quotation marks are not an issue here. Permissions would cause an error message.

Full explanation →

148

MCQhard

A Splunk admin is accelerating a CIM data model for the "Network_Traffic" dataset. After acceleration, some searches that use the data model are slower than expected. What is the most likely reason?

A.The acceleration uses too many fields

B.The data model acceleration is not compatible with the CIM

C.Searches are not using the `| datamodel` command correctly

D.The acceleration summary range is set too low

AnswerD

Correct: A low summary range excludes older data from acceleration.

Why this answer

Option A is correct: If the summary range (e.g., 60 days) is set too low, older data may not be accelerated, causing slower searches across older time ranges. Option B is false; CIM data models are designed for acceleration. Option C could be a reason but less specific.

Option D is not a typical issue; acceleration indexes specific fields.

Full explanation →

149

Multi-Selectmedium

Which TWO of the following commands are useful for reducing the number of events before a `stats` command to improve performance? (Choose 2)

Select 2 answers

A.head

B.transaction

C.sort

D.eval

E.fields

AnswersA, E

Limiting events with head reduces the number of events processed.

Why this answer

The `head` command limits the number of events processed by returning only the first N events from the search results. By reducing the event volume early in the pipeline, it significantly decreases the workload on the subsequent `stats` command, improving performance when only a sample or the most recent events are needed.

Exam trap

Splunk often tests the misconception that `sort` or `transaction` can reduce event volume, but candidates must remember that these commands either require full event sets or increase complexity, making `head` and `fields` the correct choices for performance optimization before aggregation.

Full explanation →

150

Multi-Selectmedium

Which of the following are characteristics of the Splunk Common Information Model (CIM)? (Choose three.)

Select 3 answers

A.The CIM automatically renames all fields in incoming data to match its standard

B.It requires the installation of the Splunk Common Information Model Add-on

C.Data models defined in the CIM can be accelerated to improve search performance

D.It provides a set of standard field names and tags for different data sources

E.The CIM includes predefined dashboards and reports

AnswersB, C, D

Correct: The add-on must be installed.

Why this answer

Options A, B, and C are correct. A: The CIM provides standard field names and tags. B: The CIM requires the Splunk Common Information Model Add-on to be installed.

C: Data models defined in the CIM can be accelerated to improve search performance. D is false: The CIM does not automatically rename fields; it requires field aliases or other transformations. E is false: The CIM provides data models, not dashboards (though some apps may provide dashboards).

Full explanation →

Page 2 of 7

All pages

Practice SPLK-1003 by domain

Target a specific domain to shore up weak areas.

Advanced Searching and Statistics Macros, Saved Searches and CIM Advanced Visualization and Lookups Transactions and Event Correlation

See all domains with question counts →