Knowledge + Practice

Splunk Core Certified Power User SPLK-1003 (SPLK-1003) — Questions 301–375

500 questions total · 7pages · All types, answers revealed

Take a mock exam Exam hub

Page 5 of 7

301

MCQeasy

A security analyst wants to map IP addresses to hostnames using a CSV lookup file. Which command is correct to define a lookup that maps the IP field to hostname field, with the file named 'ip_host.csv'?

A.lookup ip_host.csv hostname OUTPUT ip

B.lookup ip_host.csv hostname as hostname ip as ip_output

C.inputlookup ip_host.csv

D.lookup ip_host.csv ip as ip_output hostname as hostname_output

AnswerD

Correct syntax for lookup with field mapping.

Why this answer

Option A is correct because the lookup command syntax is 'lookup <lookup-table> <lookup-field> [OUTPUT <output-field>]' and here 'ip as ip_output hostname as hostname_output' correctly maps input ip to output ip_output and hostname to hostname_output. Option B is wrong because inputlookup returns all rows, not a join. Option C is wrong because the order is reversed.

Option D is wrong because 'OUTPUT' without aliases is incorrect.

Full explanation →

302

MCQmedium

An analyst writes `transaction client_ip` to group events from a firewall. The resulting transactions show many events with duration=0. What is the most likely cause?

A.The client_ip field contains duplicates

B.The transaction option maxspan is set too high

C.The events are not time-stamped properly

D.There is only one event per client_ip in the time range

AnswerD

If only one event exists, the transaction will have duration 0. To avoid this, use startswith/endswith or adjust maxspan.

Why this answer

A duration of 0 often occurs when there is only one event in the transaction. This can happen if the events do not meet the criteria for starting or ending a transaction, or if the maxpause is too short.

Full explanation →

303

MCQhard

A Splunk analyst runs the above search. The results show that some transactions have a duration of 0 seconds. What is the most likely cause?

A.The transaction command failed to group events properly and returned only the login event.

B.The transaction command is processing events out of order, causing login and logout timestamps to be the same.

C.The maxevents=5 limitation causes the transaction to close early, but the logged duration is still calculated correctly from the first event timestamp.

D.Some user sessions are missing a logout event, resulting in a transaction that consists of only a login event, so _time_delta is undefined or zero.

AnswerD

Without a logout event, the transaction may contain only one event, and duration is not calculated, defaulting to 0.

Why this answer

Option D is correct because when a transaction lacks an end event (like a logout), the transaction command closes based on other limits (e.g., maxspan, maxpause, or maxevents) and contains only the start event. In such cases, the duration (_time_delta) is calculated from the first event's timestamp to the last event's timestamp; with only one event, the difference is zero or undefined, resulting in a 0-second duration.

Exam trap

Splunk often tests the misconception that a 0-second duration is caused by a grouping or ordering error, when in fact it is a direct result of incomplete transactions (missing end events) within the transaction command's logic.

How to eliminate wrong answers

Option A is wrong because the transaction command groups events correctly based on the fields specified (e.g., user or session ID); a 0-second duration is not caused by a failure to group but by incomplete transactions. Option B is wrong because the transaction command processes events in time order (as indexed) and does not arbitrarily reorder timestamps; if login and logout timestamps were the same, it would indicate simultaneous events, not a processing order issue. Option C is wrong because maxevents=5 limits the number of events in a transaction but does not cause early closure that results in a 0-second duration; the duration is calculated from the first to the last event in the transaction, so if multiple events exist, the duration would be non-zero.

Full explanation →

304

MCQeasy

A network engineer wants to add geographic location (city, country) to firewall logs based on source IP. Which lookup type is most appropriate?

A.KV Store lookup

B.Scripted lookup

C.Automatic lookup configured in props.conf

D.File-based lookup (CSV)

AnswerD

A CSV file with IP ranges and locations is straightforward and performs well for static reference data.

Why this answer

Option C is correct because a CSV lookup file containing IP-to-location mappings is simple and efficient for static geographic data. Option A is incorrect because automatic lookups are for search-time enrichment but require prior lookup definition. Option B is incorrect because KV Store is overkill for static data.

Option D is incorrect because scripted lookups are for complex external data sources.

Full explanation →

305

Multi-Selecthard

Which THREE of the following are valid ways to correlate events in Splunk? (Select exactly 3 correct answers.)

Select 3 answers

A.Using the subsearch command.

B.Using the join command with a common field.

C.Using the append command.

D.Using the stats command with values().

E.Using the transaction command with a common field.

AnswersB, D, E

Correct: join correlates events from two datasets.

Why this answer

Transaction groups events based on common fields. Join can correlate events from two searches. Stats with values can also correlate by grouping events into a single result.

Append and subsearch do not perform correlation.

Full explanation →

306

MCQhard

A large enterprise uses multiple Splunk search heads. An admin wants to create a saved search that automatically runs on all search heads and sends a single alert email per triggered result, not per search head. Which saved search setting should be configured?

A.Set the time range to 'Real-time' to capture events as they happen.

B.Enable 'Alert Suppression' to suppress duplicate alerts.

C.Set Alert Type to 'Per Result' to trigger an alert for each matching event.

D.Set the Schedule to 'Continuous' to avoid duplicates.

AnswerC

Per Result triggers an alert action for each search result; combined with throttling, you can limit emails.

Why this answer

Option B is correct because 'Per Result' alerting ensures each result triggers an alert action, but the challenge is to have one email per result across search heads. Actually, in a multi-search head environment, saved searches run independently. To deduplicate, you'd need to use summary indexing or a central index.

However, the question asks for the setting to achieve single email per result: the best is to set alert type to 'Per Result' and then use 'Throttle' to limit to one email per result. But among options, 'Per Result' is key. Option A is for scheduling.

Option C suppresses consecutive identical alerts. Option D is for time range.

Full explanation →

307

MCQmedium

A security analyst runs `index=network sourcetype=firewall | stats count by src_ip | sort - count | head 10` to find the top 10 source IPs by event count. The search returns only 5 results. Which of the following is the most likely reason?

A.The search time range is too short, so only a few events are counted.

B.The sort command should be `sort - count` without space.

C.The stats command should include a by clause with count in the field list.

D.There are fewer than 10 unique source IPs in the results.

AnswerD

If the number of distinct src_ip values is less than 10, head 10 returns all of them, resulting in fewer than 10 rows.

Why this answer

Option D is correct because the `stats count by src_ip` command groups events by each unique source IP address and counts them. If the search returns only 5 results, it means there are only 5 unique source IPs in the dataset matching the time range and filters. The `head 10` command then limits output to 10 rows, but since only 5 groups exist, only 5 rows are returned.

Exam trap

The trap here is that candidates assume `head 10` always returns 10 results, forgetting that `head` limits the number of output rows from the preceding command, which may already have fewer rows than the limit.

How to eliminate wrong answers

Option A is wrong because a short time range would reduce the total event count, but the `stats count by src_ip` command still groups by unique IPs; if there are more than 10 unique IPs, the search would return 10 results regardless of total event count. Option B is wrong because `sort - count` with a space is valid syntax in SPL; the space between the dash and the field name is optional and does not cause the command to fail. Option C is wrong because the `stats count by src_ip` command already includes `count` as the aggregation function and `src_ip` as the grouping field; there is no requirement to list `count` in the `by` clause.

Full explanation →

308

MCQeasy

Refer to the exhibit. The macro `count_by_host` is defined as shown. The macro is invoked as `| `count_by_host`. What will the expanded search look like?

A.`| stats count by host, sourcetype`

B.`| `count_by_host`

C.`stats count by host, sourcetype`

D.`| | stats count by host, sourcetype`

AnswerD

Correct: Double pipe due to leading pipe in macro definition.

Why this answer

Option A is correct: Since the macro definition includes a leading pipe, invoking it with `| `count_by_host` results in two pipes – one from the invocation and one from the definition. So the expanded search becomes `| | stats count by host, sourcetype`. Option B would be missing one pipe, option C would be missing both, and option D shows the macro invocation unexpanded.

Full explanation →

309

MCQmedium

A financial company wants to group all events related to a single trading session. The session ID appears in all events. Which is the most efficient way to correlate these events without using transaction?

A.Use sort to order events by timestamp.

B.Use stats with values() on the event fields.

C.Use join to combine events on sessionId.

D.Use append with a subsearch.

AnswerB

stats by sessionId with values(*) groups all events by session.

Why this answer

Option B is correct because using stats with values() can list all events per session ID efficiently. Option A (append) merges results but doesn't group. Option C (join) is for lookup.

Option D (sort) reorders but doesn't group.

Full explanation →

310

MCQhard

A Splunk admin is troubleshooting a transaction that groups firewall allow and deny events by session ID. The transaction should end when a deny event occurs for that session. Which transaction option should be used to define the end condition?

A.endswith="action=allow"

B.startswith="action=deny"

C.endswith="action=deny"

D.maxevents=2

AnswerC

Correct: This ends the transaction when a deny event is encountered.

Why this answer

Option C is correct because 'endswith='action=deny'' specifies the event that terminates the transaction. Option A starts with deny, Option B ends with allow, and Option D limits the number of events but does not define an end condition.

Full explanation →

311

MCQhard

GlobalTech runs Splunk Enterprise Security with CIM compliance. Their security operations center uses a scheduled saved search named 'Brute Force Detection' that runs every 30 minutes. The search definition is: `| tstats count from datamodel=Authentication where Authentication.action=failure by Authentication.user, Authentication.src | where count > 5 | join type=outer user [search index=* sourcetype=linux_secure | stats count by user | where count > 5]`. This search has been working for months. Recently, after an upgrade to the Splunk environment, the saved search started returning no results. The administrator checks the search log and sees that the tstats portion runs fine but the secondary search (the subsearch) returns no events even though there are matching events in the index. The subsearch uses a macro named 'get_failed_users' defined as `search index=* sourcetype=linux_secure "Failed password" | stats count by user | where count>5` with no arguments. The administrator confirms that the macro's search works when run manually in the same time range. What is the most likely reason the subsearch returns no results?

A.The subsearch is not part of the data model acceleration and is limited by the time range of the main search.

B.The macro 'get_failed_users' is not defined in the same app context as the saved search.

C.The subsearch uses a macro, and macros cannot be used in subsearches.

D.The macro definition has a typo in the search command.

AnswerB

Correct. After an upgrade, the app context might have changed, causing the macro to be unavailable.

Why this answer

Option B is correct because macros are resolved in the context of the app where the saved search is defined. If the macro 'get_failed_users' is not defined in the same app context as the 'Brute Force Detection' saved search, the subsearch will fail to resolve the macro and return no results, even though the macro works when run manually in a different app context. Splunk's macro resolution depends on the app context of the search, not the user's current app.

Exam trap

The trap here is that candidates assume macros are globally available or that the subsearch's manual success implies it will work in the saved search, overlooking the critical role of app context in macro resolution.

How to eliminate wrong answers

Option A is wrong because the subsearch is not limited by the time range of the main search; subsearches default to the same time range as the main search unless explicitly overridden, and the tstats portion runs fine, indicating time range is not the issue. Option C is wrong because macros can be used in subsearches; there is no restriction preventing macros from being used within subsearches. Option D is wrong because the administrator confirmed that the macro's search works when run manually in the same time range, ruling out a typo in the macro definition.

Full explanation →

312

Matchingmedium

Match each Splunk license violation type to its consequence.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

Indicates usage is near the limit

Usage exceeds license quota, search may be limited

License has expired, functionality is restricted

License key is incorrect or corrupted

Usage is within license limits

Why these pairings

License management ensures proper usage of Splunk.

Full explanation →

313

Multi-Selecteasy

Which TWO options can be used with the `transaction` command to define the beginning and end of a transaction?

Select 2 answers

A.closed_txn

B.maxpause

C.endswith

D.startswith

E.maxspan

AnswersC, D

Defines the end event.

Why this answer

startswith and endswith define boundary events. maxspan and maxpause are constraints.

Full explanation →

314

Multi-Selecthard

Which TWO components must be configured to enable an automatic lookup that populates fields at index time?

Select 2 answers

A.transforms.conf to define the lookup table and the field mapping.

B.The lookup table file must be placed in the lookup directory of the app.

C.indexes.conf to enable lookup acceleration.

D.props.conf to specify the automatic lookup stanza for the sourcetype.

E.The lookup must be defined via the Lookups menu in Splunk Web only.

AnswersA, D

The lookup definition (filename, fields, match type) is in transforms.conf.

Why this answer

Options A and B are correct. transforms.conf defines the lookup table and field mapping, and props.conf specifies which sourcetype should use the lookup automatically. Options C, D, and E are not required configuration files for the automatic lookup to work.

Full explanation →

315

MCQeasy

An analyst wants to correlate events from different sourcetypes (e.g., authentication logs and VPN logs) that share a common user field. The goal is to create a single event per user session containing all fields from both sourcetypes. Which command is best suited for this?

A.append

B.union

C.transaction

D.join

AnswerC

Correct: Groups events by common field across sourcetypes.

Why this answer

Option D is correct. The transaction command groups events based on a common field (user) and can include events from multiple sourcetypes. Options A (append) adds rows, B (join) requires a field and may not preserve sessions, C (union) combines schemas but does not correlate.

Full explanation →

316

MCQeasy

An analyst wants to identify the top 5 user agents that generated the most 404 errors in the last 24 hours. Which search accomplishes this correctly and efficiently?

A.index=web status=404 | top limit=5 user_agent

B.index=web | top limit=5 user_agent status=404

C.index=web | top limit=5 user_agent

D.index=web | stats count by user_agent | where status=404 | top 5 user_agent

AnswerA

Correctly filters for 404 errors and efficiently returns top 5 user agents using the top command.

Why this answer

Option A is correct because it first filters events to only those with status=404, then uses the `top` command with `limit=5` to efficiently count and rank user_agent values. This ensures the search only processes relevant events, minimizing resource usage and returning the correct top 5 user agents for 404 errors.

Exam trap

Splunk often tests the order of operations in Splunk SPL, specifically that filtering commands like `status=404` must precede statistical commands like `top` or `stats` to ensure the aggregation is performed only on the subset of interest, not on the entire dataset.

How to eliminate wrong answers

Option B is wrong because the `top` command processes fields in the order they are listed; placing `user_agent` before `status=404` means it will count user_agent values across all events, then apply the status=404 filter as a secondary field, which does not restrict the count to only 404 errors. Option C is wrong because it omits the status=404 filter entirely, returning the top 5 user agents across all HTTP status codes, not just 404 errors. Option D is wrong because the `where status=404` clause is placed after the `stats count by user_agent` command, which already aggregated data without the status filter; at that point, the `status` field is no longer available in the results, causing the search to fail or return no results.

Full explanation →

317

Matchingmedium

Match each Splunk search mode to its behavior.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

Optimizes for speed, may skip event data

Balances speed and completeness (default)

Returns all available fields for each event

Searches data as it is indexed

Searches data already indexed

Why these pairings

Search modes control how Splunk processes and returns results.

Full explanation →

318

MCQeasy

A Splunk search uses 'transaction clientip maxpause=5m'. What does the maxpause setting control?

A.The maximum number of transactions allowed.

B.The maximum number of events in the transaction.

C.The maximum total time span of the transaction.

D.The maximum time gap between events in the transaction.

AnswerD

Correct: maxpause defines the allowed gap between consecutive events.

Why this answer

maxpause sets the maximum inactivity timeout: if no new event for the same clientip arrives within 5 minutes, the transaction is closed.

Full explanation →

319

Multi-Selecteasy

Which TWO statements about lookup tables are true?

Select 2 answers

A.Lookups can only be defined by administrators.

B.A lookup definition can include time-based expiration.

C.Lookups are case-sensitive by default.

D.Lookups can be defined from CSV files or KV store collections.

E.Lookups can only be used in the 'lookup' command.

AnswersB, D

Time-based lookups allow automatic refresh.

Why this answer

Options B and C are correct. B: Lookups can be defined from CSV files or KV store. C: Time-based expiration is a feature.

A is false because lookups can be used in multiple commands. D is false because they are case-insensitive by default. E is false because they can be defined by power users.

Full explanation →

320

MCQhard

A Splunk admin notices that a scheduled search using inputlookup is returning inconsistent results. The lookup file is stored on the search head and is updated via a script every 15 minutes. What is the most likely cause of the inconsistency?

A.The search head is not configured as a lookup server

B.The lookup file contains duplicate entries with different timestamps

C.The lookup file is cached and not automatically refreshed

D.The lookup file exceeds the maximum file size

AnswerC

inputlookup caches the file; changes require a reload or restart.

Why this answer

The most likely cause is that the lookup file is cached by Splunk after the first read, and subsequent updates via the script do not automatically refresh the in-memory cache. By default, Splunk caches lookup files on the search head to improve performance, and changes to the file are not reflected until the cache expires or is manually cleared. This leads to inconsistent results when the scheduled search runs against a stale cached version.

Exam trap

Splunk often tests the misconception that file updates are immediately reflected in search results, when in reality Splunk's caching mechanism introduces a delay that can cause inconsistency unless the cache is explicitly refreshed.

How to eliminate wrong answers

Option A is wrong because the concept of a 'lookup server' applies to distributed environments where lookups are shared across search heads, but the issue here is local caching on a single search head, not server configuration. Option B is wrong because duplicate entries with different timestamps would cause consistent behavior (e.g., returning the first match) rather than inconsistency across runs; the problem is about stale data, not duplicate resolution. Option D is wrong because exceeding the maximum file size would cause the lookup to fail entirely or be truncated, not produce inconsistent results; the search would either error out or return partial data consistently.

Full explanation →

321

Multi-Selectmedium

Which TWO of the following are valid considerations when defining macros in Splunk?

Select 2 answers

A.Macros can be imported from other apps if they are shared globally.

B.Macros cannot contain transforming commands.

C.Macros can only accept a single argument.

D.Macro names must be unique within the app where they are defined.

E.Macro definitions can be created directly in the search bar.

AnswersA, D

Sharing to global makes macros available across apps.

Why this answer

Correct answers: A and D. Macros must be defined with a unique name within an app (A), and macros can be imported from other apps if shared globally (D). Option B is false because macros can accept multiple arguments.

Option C is false because macro definition is done in configuration files, not in the search bar. Option E is not a requirement; macros can contain any commands.

Full explanation →

322

Matchingmedium

Match each Splunk search operator to its behavior.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

Pipes output of one command to the next

Excludes events that match the following term

Matches events that contain either term

Matches events that contain both terms (default)

Groups terms to control evaluation order

Why these pairings

Operators control how search terms are combined and piped.

Full explanation →

323

MCQmedium

A transaction is created using the command: 'index=web status=200 OR status=404 | transaction sessionid'. The user wants to include transactions only if they contain both a 200 and a 404 status. Which additional step achieves this?

A.| transaction sessionid keepevicted=true | where mvcount(status)>=2

B.| where mvcount(mvdedup(status))>=2

C.| search status="200" OR status="404"

D.| where mvcount(status)==2

AnswerB

Correct: Counts distinct status values.

Why this answer

Option A is correct because 'mvcount(mvdedup(status))>=2' counts distinct status values; if both are present, it returns >=2. Option B (OR) includes sessions with either status, Option C (keepevicted) is not needed, and Option D counts total occurrences, not distinct.

Full explanation →

324

MCQmedium

Refer to the exhibit. A security analyst runs the above search. Which of the following best describes the result?

A.Transactions for all source IPs, but only showing src_ip 10.0.0.1 in the table

B.Transactions of all firewall events for src_ip 10.0.0.1, each lasting up to 5 minutes

C.Transactions of src_ip 10.0.0.1 that start with deny and end with allow

D.Transactions beginning with 'allow' and ending with 'deny' for src_ip 10.0.0.1, with a maximum duration of 5 minutes

AnswerD

Correct interpretation of the transaction parameters.

Why this answer

Option C is correct because the transaction groups events by src_ip, starts with 'allow' and ends with 'deny', and limits to 5-minute windows. Events within that window that fit the pattern will form transactions. Option A is incorrect because it says 'all events' but the start/end conditions filter.

Option B is incorrect because it reverses start/end. Option D is incorrect because maxspan is 5 minutes, not 1 hour.

Full explanation →

325

MCQhard

A search using the transaction command is producing many partial transactions that are closed due to maxpause, but these transactions are often relevant and should not be discarded. Which option should be added to the transaction command to keep these partial results?

A.keepopen=true

B.keepevicted=true

C.closed=true

D.partial=true

AnswerC

Correct: closed=true is not a valid option; the correct option is keepevicted=true. (Distractor)

Why this answer

The keepevicted=true option retains transactions that are closed because of maxpause or maxspan, ensuring they are included in results.

Full explanation →

326

MCQeasy

Which transaction option should be used to ensure that a transaction does not exceed a total duration of 10 minutes?

A.endswith="end"

B.maxpause=10m

C.startswith="start"

D.maxspan=10m

AnswerD

maxspan limits the total time span from first to last event.

Why this answer

Option A is correct because maxspan sets the maximum total duration of a transaction. Option B (maxpause) limits gaps between events. Option C (startswith) sets the start event.

Option D (endswith) sets the end event.

Full explanation →

327

Multi-Selecthard

A saved search that runs every hour is showing 'No results' in its history, but the same search when run manually returns results. Which two of the following are likely causes? (Choose TWO.)

Select 2 answers

A.The saved search's acceleration is outdated.

B.The user who created the saved search has been deleted.

C.The saved search uses a macro that has a typo in its definition.

D.The index being searched is not available at the scheduled time.

E.The saved search uses a different time range than the manual search.

AnswersD, E

Correct: Temporary index unavailability at the scheduled time results in no data.

Why this answer

Options A and D are correct. Time range differences: scheduled searches use a 'last hour' window, while manual may use 'All time'. Index availability at schedule time can cause empty results.

A macro typo would cause an error, not empty results. Outdated acceleration would show old data, not empty. Creator deletion might cause a permissions error.

Full explanation →

328

MCQmedium

A security analyst needs to find the top 10 users with the most failed login attempts from the linux_secure sourcetype. Which SPL command is most efficient for this task?

A.index=main sourcetype=linux_secure "Failed password" | top limit=10 user

B.index=main sourcetype=linux_secure "Failed password" | stats count by user | sort 10 -count

C.index=main sourcetype=linux_secure "Failed password" | stats count by user | sort -count | head 10

D.index=main sourcetype=linux_secure | regex _raw="Failed password" | stats count by user | top limit=10

AnswerA

The `top` command is optimized for finding top values and is efficient for this scenario.

Why this answer

Option A is correct because the `top` command in SPL is specifically designed to return the most frequent values of a field, and the `limit=10` parameter directly restricts the output to the top 10 results. This approach is more efficient than using `stats count` followed by `sort` and `head` because `top` performs the aggregation and ranking in a single operation, reducing processing overhead. The search also correctly filters for 'Failed password' events within the `linux_secure` sourcetype, ensuring only failed login attempts are considered.

Exam trap

Splunk often tests the misconception that `stats count by user | sort -count | head 10` is functionally equivalent to `top limit=10 user`, but the trap is that `top` is more efficient and is the idiomatic Splunk command for this task, while the multi-command approach is less optimal and may be penalized in performance-sensitive scenarios.

How to eliminate wrong answers

Option B is wrong because `sort 10 -count` is invalid syntax; the `sort` command requires the field name and direction (e.g., `sort -count`), and the limit must be applied via `head` or the `limit` parameter in `top`. Option C is wrong because while it produces the correct result, it is less efficient than option A; it requires two separate commands (`stats` then `sort` then `head`) instead of the single `top` command, and the `head 10` is redundant if `top limit=10` is used. Option D is wrong because it uses `regex _raw="Failed password"` instead of a simple search term, which is less efficient; Splunk's indexed search for a literal string is faster than applying a regex to the raw event data, and the `top limit=10` at the end is redundant since `stats count by user` already aggregated the data, making the `top` command unnecessary.

Full explanation →

329

MCQhard

A Splunk admin wants to track the number of unique users who accessed a system each hour over the past 24 hours. Which search provides the correct result?

A.index=main earliest=-24h | timechart span=1h dc(user) as unique_users

B.index=main earliest=-24h | timechart span=1h values(user)

C.index=main earliest=-24h | stats dc(user) by _time | timechart span=1h dc(user)

D.index=main earliest=-24h | timechart span=1h count by user

AnswerA

dc(user) gives distinct count of users per hour with timechart.

Why this answer

Option A is correct because it uses `timechart span=1h dc(user)` to count distinct users per hour over the last 24 hours. The `dc()` function calculates distinct counts, and `span=1h` sets the time bucket to one hour, exactly matching the requirement.

Exam trap

The trap here is confusing `count` (total events) with `dc()` (distinct values), and assuming `values()` or `count by user` can produce a unique user count per time period.

How to eliminate wrong answers

Option B is wrong because `values(user)` returns a multivalue list of users per hour, not a count of unique users. Option C is wrong because `stats dc(user) by _time` groups by raw event timestamps, not hourly buckets, and then `timechart` cannot properly aggregate pre-grouped data, leading to incorrect results. Option D is wrong because `count by user` counts events per user per hour, not the number of unique users; it produces a separate series for each user rather than a single count of distinct users.

Full explanation →

330

MCQeasy

A transaction search is processing too many fields. Which command should be used immediately before the transaction command to reduce memory usage?

A.fields - _raw, _time

B.fields + user_id, _time

C.fields - * except user_id

D.fields user_id, _time

AnswerD

Correct: this keeps only the necessary fields.

Why this answer

Option C is correct because 'fields user_id, _time' keeps only the essential fields. Option A is incorrect because it removes _raw and _time, but keeps other fields. Option B is incorrect because 'fields - * except user_id' removes _time.

Option D is incorrect because 'fields +' appends, not replaces.

Full explanation →

331

Multi-Selecthard

Which THREE of the following are valid uses of the stats command? (Select three.)

Select 3 answers

A.Calculating the average of a field across all events.

B.Finding the earliest timestamp for each category.

C.Grouping events by a categorical field and counting them.

D.Creating a time-based chart with multiple series.

E.Enriching events with fields from an external lookup.

AnswersA, B, C

Stats avg() computes average

Why this answer

The `stats` command in Splunk is used to perform statistical aggregations on search results. Option A is correct because `stats avg(field)` calculates the arithmetic mean of a specified field across all events in the result set. Option B is correct because `stats earliest(_time) by category` returns the minimum timestamp for each distinct value of the category field, which is a standard use of the `earliest()` function.

Option C is correct because `stats count by category` groups events by the categorical field and returns the number of events in each group, a fundamental aggregation pattern.

Exam trap

Splunk often tests the distinction between `stats` and `timechart`; the trap here is that candidates see 'time-based chart' and incorrectly assume `stats` can produce it, but `timechart` is the only command that automatically bins events into time buckets and supports multiple series via the `by` clause.

Full explanation →

332

MCQhard

An organization uses Splunk CIM to normalize data from multiple sources. They have a custom data source that logs firewall events with a field 'action' containing values 'accept', 'deny', 'drop'. They want to map this to the CIM field 'action'. Which configuration is required?

A.Define a field alias in props.conf: `FIELDALIAS-action = action as action`

B.Use the 'calculatedfields' field in props.conf: `CALCULATED_action = if(action=="accept","allowed",if(action=="deny","blocked","dropped"))`

C.Create a custom data model that includes the field 'action' with the vendor values.

D.Add a tag 'action=accept' to events with action=accept, and similarly for deny and drop.

AnswerA

This maps the vendor field 'action' to the CIM field 'action'.

Why this answer

Option A is correct because the CIM field 'action' already exists in the CIM data model with the same name as the vendor field. A field alias in props.conf using `FIELDALIAS-action = action as action` simply creates an alias so that the vendor's 'action' field is recognized as the CIM 'action' field, allowing the CIM to normalize the data without any transformation. This is the simplest and most efficient method when the vendor field name and values already match the CIM field name and expected values.

Exam trap

The trap here is that candidates often overcomplicate the solution by choosing a calculated field or custom data model, not realizing that when the vendor field name and values already align with the CIM field, a simple field alias is the correct and efficient approach.

How to eliminate wrong answers

Option B is wrong because it uses a calculated field to transform the vendor values into different strings ('allowed', 'blocked', 'dropped'), which would break CIM normalization since the CIM 'action' field expects values like 'accept', 'deny', 'drop' (or 'allowed', 'blocked', 'dropped' depending on the CIM version, but the question states the vendor values are already correct). Option C is wrong because creating a custom data model is unnecessary and overly complex; the CIM already defines the 'action' field, and the goal is to map the vendor data into the existing CIM model, not create a new one. Option D is wrong because tagging is used for event type classification and search-time filtering, not for field value normalization; tags do not map field values to CIM fields.

Full explanation →

333

Multi-Selecthard

Which TWO of the following statements about the `transaction` command are true? (Choose two.)

Select 2 answers

A.The transaction command can only be used on events that have a timestamp.

B.The transaction command uses a sliding time window to detect transaction boundaries.

C.The transaction command can group events that share a common field value, such as a session ID.

D.The transaction command adds fields such as `duration` and `eventcount` to each transaction.

E.The transaction command removes all fields except those specified in the `fields` argument.

AnswersC, D

Transaction can group by shared field values.

Why this answer

Option C is correct because the `transaction` command is designed to group events that share a common field value, such as a session ID, allowing you to correlate related events into a single transaction. This is a core use case for tracking user sessions or multi-step processes where events are linked by a shared identifier.

Exam trap

Splunk often tests the misconception that the `transaction` command uses a sliding time window, but in reality it uses a fixed or pause-based window, and candidates confuse this with the sliding window behavior of commands like `streamstats` or `timechart`.

Full explanation →

334

MCQmedium

An analyst wants to create a visualization showing the average response time by hour over the past day, with each server in a separate line. Which command should they use?

A.`... | timechart span=1h avg(response_time) by server`

B.`... | timechart avg(response_time) by server`

C.`... | chart avg(response_time) by hour, server`

D.`... | stats avg(response_time) by hour, server`

AnswerA

Correctly uses timechart with 1-hour intervals and splits by server.

Why this answer

Option C is correct. It uses `timechart` with an explicit span of 1 hour and splits by server. Option A uses `chart` which is not time-based and would produce a non-time chart.

Option B omits span, which might default to auto but is less explicit. Option D uses `stats` which does not produce a time-based chart automatically.

Full explanation →

335

MCQhard

A search uses a subsearch to retrieve a list of user IDs, and then the main search uses IN operator to filter events. The subsearch is expected to return up to 10,000 values. What is a potential limitation and how can it be addressed?

A.The subsearch returns only 10,000 results by default; use | head 50000 in subsearch.

B.The subsearch default limit is 50,000; no change needed.

C.The subsearch default limit is 10,000; to include more, use the | fields values command in the subsearch to return all values.

D.The subsearch default limit is 100,000; no change needed.

AnswerC

Fields values collapses duplicates and can exceed row limit

Why this answer

Option C is correct because the default limit for results returned by a subsearch in Splunk is 10,000. When using the `IN` operator in the main search, the subsearch must provide all necessary values; if more than 10,000 values are expected, the `| fields values` command can be used in the subsearch to override this limit and return all distinct values, as it bypasses the default result count restriction.

Exam trap

The trap here is that candidates often confuse the default subsearch result limit (10,000) with the main search result limit (50,000) or assume that increasing the limit with `| head` is the correct solution, when in fact the `| fields values` command is the proper method to return all values from a subsearch without hitting the row limit.

How to eliminate wrong answers

Option A is wrong because the default subsearch limit is 10,000, not 10,000 results by default that can be increased with `| head 50000`; using `| head` would only limit results further, not expand them, and the correct approach is to use `| fields values` to return all values. Option B is wrong because the default subsearch limit is 10,000, not 50,000; stating no change is needed is incorrect when the subsearch is expected to return up to 10,000 values, as this is exactly the default limit and may still be insufficient if the subsearch returns exactly 10,000 values (the limit is applied before the subsearch completes). Option D is wrong because the default subsearch limit is 10,000, not 100,000; no change is needed is also incorrect for the same reason as option B.

Full explanation →

336

MCQmedium

A team develops multiple dashboards that share common search logic. What is the best practice for managing these searches?

A.Create a saved search for each dashboard.

B.Use a single saved search that all dashboards reference.

C.Embed the search strings directly in each dashboard.

D.Use macros to define reusable search fragments.

AnswerD

Correct: Macros centralize common logic, improving maintainability and consistency.

Why this answer

Option C is correct: macros allow reusable code, reducing duplication and maintenance. Embedding search strings causes duplication. Saved searches per dashboard still duplicate logic.

A single saved search may not fit all dashboards.

Full explanation →

337

MCQhard

A security team has a saved search that runs every 5 minutes and looks for 'FAILED' events in Windows Security logs. The search uses a macro 'failed_logins' defined as: `define failed_logins() [search index=windows sourcetype=WinEventLog:Security EventCode=4625]`. Recently, the team noticed that the search is returning no results even though there are failed login events. What is the most likely issue?

A.The macro definition includes empty parentheses 'failed_logins()' but is being called without parentheses, causing Splunk to treat it as a different macro.

B.The macro does not have read permissions for the security team.

C.The macro must be called with backticks like `failed_logins` instead of pipe.

D.The sourcetype field is using a wildcard, which is deprecated.

AnswerA

The macro is defined with parentheses, so it expects to be called with parentheses even if no arguments. Alternatively, define without parentheses.

Why this answer

Option D is correct because the macro is defined with a parameter list (parentheses) but no arguments are passed, so Splunk treats it as a macro with one empty argument, which can cause the search to not run correctly if the macro is called without parentheses. Option A is incorrect because wildcards are fine. Option B is incorrect because macros do not require backticks.

Option C is incorrect because permissions are set per macro, but if it worked before, permissions are likely not the issue.

Full explanation →

338

MCQeasy

A team wants to create a dashboard that displays daily user activity over the past 30 days. The underlying data is voluminous (hundreds of millions of events per day). They need the dashboard to load quickly. The admin considers two options: using a summary index with a scheduled search to pre-compute the daily counts, or using data model acceleration on a CIM data model. Which approach is most appropriate for this specific requirement?

A.Use data model acceleration because it automatically updates and is easier to set up.

B.Use neither; instead, use report acceleration on the dashboard search.

C.Use both to ensure data availability.

D.Use a summary index because it allows custom summarization and reduces license usage.

AnswerD

Correct: Summary indexes pre-compute results, significantly reducing query time and resource consumption.

Why this answer

Option A is correct: A summary index pre-computes exactly the needed daily counts, reducing search time and license usage. Data model acceleration still queries the full dataset and may be less efficient for custom aggregation. Using both adds complexity.

Report acceleration on the dashboard search still queries the full data.

Full explanation →

339

MCQeasy

A Splunk administrator wants to create a reusable search component that accepts a sourcetype and a time range. What is the correct method to define this in Splunk?

A.Create a saved search that uses tokens to parameterize the query.

B.Use an eval statement to define a variable that holds the query.

C.Define a macro with arguments using backticks and $arg$ syntax.

D.Use a lookup definition with parameters to filter results.

AnswerC

Macros with arguments allow a reusable search fragment with parameter substitution.

Why this answer

Macros are the correct way to define reusable search components with arguments. They use backtick syntax and $arg$ placeholders. Option B is correct because macros are designed for this purpose.

Option A (saved search with tokens) is for dashboards, not reusable search fragments. Option C (eval statement) does not create reusable search components. Option D (lookup definition) is for lookup tables, not search logic.

Full explanation →

340

Matchingmedium

Match each Splunk index time field to its meaning.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

The hostname or IP of the data source

The file, script, or input that generated the event

The type of data, determines parsing behavior

The name of the index where the event is stored

The timestamp of the event

Why these pairings

These fields are automatically added during indexing.

Full explanation →

341

MCQhard

Refer to the exhibit. The search returns only transactions that ended with successful login. The administrator wants to see all failed login attempts that did not lead to a success. What is the most efficient approach?

A.Replace the search with | where closed_txn=0.

B.Increase maxpause to 30m.

C.Remove the final search command and instead filter on closed_txn=0.

D.Remove the keepevicted=true option.

AnswerC

With keepevicted=true, evicted (unclosed) transactions have closed_txn=0; filtering on that shows all failed login attempts.

Why this answer

Option D is correct because the search filters out evicted transactions with duration>0; removing that search and using duration<0 or adding a filter for evicted events would show failures. Option A is false because removing keepevicted would drop evicted transactions. Option B is false because adding maxspan may still evict.

Option C is false because it would include all transactions but still filter out evicted ones.

Full explanation →

342

MCQhard

A search uses `transaction session_id maxspan=30m` to group events. The search returns 5000 transaction events. The analyst needs to filter out any transaction that does not contain an event with status=failure. Which post-transaction command should be used?

A.`| transaction session_id maxspan=30m | stats count(eval(status="failure")) by session_id`

B.`| transaction session_id maxspan=30m | search status=failure`

C.`| transaction session_id maxspan=30m | where status=failure`

D.`| transaction session_id maxspan=30m | eval has_failure=if(match(_raw, "failure"),1,0) | where has_failure=1`

AnswerB

Yes, because after transaction, the resulting events have fields from all constituent events; if any constituent had status=failure, the transaction event will have that field. The search filters for transactions that contain at least one such event.

Why this answer

After transaction, you can use `where` with a subsearch or use `search` to filter based on fields within the transaction. Specifically, `search` can be used after transaction to filter events that contain a certain field-value pair.

Full explanation →

343

MCQmedium

A large e-commerce company uses Splunk to monitor their web application. They have a query that uses the transaction command to group related events into transactions based on session ID and a 30-minute max pause. The query runs slowly and often times out. The environment has 10 indexers with 4 CPU cores each. The search is run over the last 7 days. Which of the following is the best course of action to improve performance?

A.Use the eval command to create a transaction ID field and then use stats to group events.

B.Reduce the max pause to 15 minutes to limit the number of events in each transaction.

C.Replace the transaction command with a combination of stats and streamstats commands.

D.Increase the number of indexers to 20 to distribute the load.

AnswerC

Using stats and streamstats is more efficient than transaction and can achieve similar grouping results.

Why this answer

The `transaction` command is resource-intensive because it groups events by a field (session ID) and a max pause, requiring significant memory and processing to correlate events across the entire search time range. Replacing it with `stats` and `streamstats` is more efficient because `stats` can aggregate events by session ID without the overhead of transaction boundaries, and `streamstats` can compute running totals or windows within each session, leveraging distributed processing across indexers. This approach reduces memory pressure and avoids the timeout issue by using streaming operations that scale better with large datasets.

Exam trap

Splunk often tests the misconception that reducing the max pause or adding hardware (more indexers) is the best fix, when the real issue is replacing the inefficient `transaction` command with more scalable streaming commands like `stats` and `streamstats`.

How to eliminate wrong answers

Option A is wrong because using `eval` to create a transaction ID field and then `stats` to group events does not inherently improve performance; it still requires a similar grouping operation and does not address the core inefficiency of the `transaction` command's memory overhead. Option B is wrong because reducing the max pause to 15 minutes may limit transaction size but does not fundamentally reduce the computational cost of the `transaction` command, which still must evaluate event boundaries and maintain state for each session across the entire search window. Option D is wrong because increasing the number of indexers to 20 distributes the search load but does not optimize the query itself; the `transaction` command's performance bottleneck is often in the search head's memory and processing, not just indexing capacity, and adding indexers may not resolve timeouts if the command is inherently inefficient.

Full explanation →

344

Multi-Selecthard

Which TWO features are available for customizing dashboards in Splunk's Simple XML?

Select 2 answers

A.Using HTML in a panel to embed custom JavaScript.

B.Using CSS to change the color of all panels.

C.Automatically refreshing panels using token 'refresh'.

D.Adding drilldown actions to tables and charts.

E.Creating multiple dashboard versions for different user roles.

AnswersC, D

The 'refresh' token triggers auto-refresh.

Why this answer

Options A and E are correct. A: Drilldown actions are available. E: Token 'refresh' can auto-refresh panels.

B is false because HTML cannot embed JavaScript. C is false because CSS changes are limited. D is false because versioning is not a built-in feature.

Full explanation →

345

MCQmedium

A security analyst needs to find all events where the field `status` has a value of either "error" or "critical" and the field `bytes` is greater than 1000. Which search correctly accomplishes this?

A.(status=error OR status=critical) bytes>1000

B.status=error OR status=critical AND bytes>1000

C.status IN (error, critical) AND bytes>1000

D.status="error" OR status="critical" bytes>1000

AnswerA

Parentheses ensure the OR is evaluated first, and then the AND with bytes>1000.

Why this answer

Option A is correct because in Splunk's Search Processing Language (SPL), parentheses group the OR conditions to ensure they are evaluated together, and the space between the grouped condition and `bytes>1000` acts as an implicit AND. This correctly retrieves events where `status` is either "error" or "critical" AND `bytes` is greater than 1000.

Exam trap

The trap here is that Splunk's implicit AND (space) combined with operator precedence causes candidates to forget that OR conditions must be grouped with parentheses to avoid unintended logic, leading them to choose Option B or D.

How to eliminate wrong answers

Option B is wrong because without parentheses, AND has higher precedence than OR, so it is parsed as `status=error OR (status=critical AND bytes>1000)`, which returns events with status=error regardless of bytes, plus events matching the AND condition. Option C is wrong because the `IN` operator in Splunk requires the field name to be on the left and a parenthesized list of values, but the syntax `status IN (error, critical)` is invalid; the correct syntax is `status IN ("error", "critical")` with quoted strings. Option D is wrong because it omits parentheses around the OR conditions, causing the implicit AND to bind more tightly to the second condition, resulting in the same precedence issue as Option B.

Full explanation →

346

Multi-Selecthard

Which two techniques should be used to optimize a transaction search that is slow due to a high volume of events? (Choose two.)

Select 2 answers

A.Use the 'fields' command to limit fields before transaction.

B.Use the 'keepevicted' option to free memory.

C.Use the 'stats' command with values() and range() instead of transaction if possible.

D.Use the 'local' parameter to process on a single indexer.

E.Increase the maxspan value to reduce the number of transactions.

AnswersA, C

Correct: reduces memory per event.

Why this answer

Options A and D are correct. Using fields before transaction reduces memory and processing. Using stats instead of transaction can be more efficient for some correlations.

Option B (local) reduces parallelism. Option C (increasing maxspan) typically increases resource usage. Option E (keepevicted) does not optimize performance.

Full explanation →

347

MCQeasy

A Splunk user wants to create a macro named `nunique` that takes a field name as an argument and returns the count of distinct values for that field. Which macro definition should be used?

A.`nunique($field$)` defined as `stats dc($field$) as distinct_count`

B.`nunique($1$)` defined as `stats dc($1$) as distinct_count`

C.`nunique($field$)` defined as `| stats dc($field$) as distinct_count`

D.`nunique($1$)` defined as `| stats dc($1$) as distinct_count`

AnswerB

Correct: Uses positional argument and no leading pipe.

Why this answer

Option A is correct because macro definitions should not include a leading pipe when the macro is invoked with a pipe. Positional arguments ($1$) are standard. Option B incorrectly includes a pipe.

Options C and D use a named argument without defining the argument name in the macro properties, which is less common and not standard practice.

Full explanation →

348

MCQhard

Refer to the exhibit. What does the final result represent?

A.Users who log on more than twice on average.

B.Hours where the total logon count is more than double the average.

C.Hours where any user's logon count is more than double the average for that hour.

D.Users who have a logon count greater than twice their personal average.

AnswerC

Correct: per hour, per user comparison to hour average

Why this answer

The `eventstats` command calculates a per-hour average logon count across all users. The `where` clause then filters for events where a specific user's logon count for that hour is more than double that hourly average. This directly matches option C: hours where any user's logon count exceeds twice the average for that hour.

Exam trap

The trap here is that candidates confuse `eventstats ... by hour` (which computes a global average per hour) with a per-user average, leading them to incorrectly select option D or A.

How to eliminate wrong answers

Option A is wrong because the query does not compute a per-user average across all hours; it compares each user's hourly count to the hourly average, not a user's average. Option B is wrong because the comparison is against the average logon count for that specific hour, not the total logon count for the hour; the `where` clause checks `logon_count > 2 * avg_logons`, which is a per-user value, not a total. Option D is wrong because the average used is the hourly average across all users, not the user's own personal average; `eventstats` with `by hour` computes a global average per hour, not per user.

Full explanation →

349

MCQhard

The search returns zero results, but the lookup file contains users with names like 'admin1', 'admin2'. What is the most likely reason?

A.The lookup file is not in CSV format.

B.The 'like' function requires a wildcard pattern with '%' but the field value may have leading/trailing spaces or the pattern is case-sensitive.

C.The stats command only counts events where role=admin, but the role field is already filtered.

D.The search command runs before the eval command.

AnswerB

like() is case-sensitive; also if user has spaces, pattern may not match.

Why this answer

The 'like' function in Splunk uses SQL-style pattern matching where '%' matches any sequence of characters. If the lookup file contains 'admin1' and 'admin2', but the search uses 'like(role, "admin%")', leading/trailing spaces in the field values or case sensitivity (e.g., 'Admin1' vs 'admin1') would cause the pattern to fail, returning zero results. Option B correctly identifies this as the most likely reason because Splunk's 'like' is case-sensitive by default and does not trim spaces.

Exam trap

Splunk often tests the misconception that 'like' is case-insensitive or automatically handles spaces, leading candidates to overlook the need for explicit trimming or case normalization.

How to eliminate wrong answers

Option A is wrong because Splunk lookups can be in CSV format or other formats like KV store; a non-CSV format would cause a different error (e.g., 'Error opening lookup file'), not silently return zero results. Option C is wrong because the stats command counts events based on the filtered results; if the role field is already filtered to only admin values, stats would still count those events, not return zero. Option D is wrong because the search command runs before the eval command in the pipeline order, but that does not cause zero results; the eval command would still process the filtered events correctly.

Full explanation →

350

MCQhard

A transaction that groups events by field 'session_id' sometimes produces transactions that contain events from multiple distinct sessions due to session_id reuse over time. What is the best way to ensure transactions are correctly separated?

A.Use 'transaction session_id maxevents=1' to stop after one event.

B.Use 'transaction session_id mvlist=_raw' to include raw data.

C.Use 'transaction session_id maxspan=30m' to limit the time window.

D.Use 'transaction session_id startswith="new_session" endswith="end_session"'.

AnswerC

Correct: Time window separates reused IDs.

Why this answer

Option A is correct. Adding a maxspan limits the time window, preventing events from reused session IDs that are widely separated in time from merging. Options B, C, and D do not address the reuse issue effectively.

Full explanation →

351

MCQhard

A team wants to create a custom visualization that requires JavaScript and CSS modifications. Which Splunk feature should be used?

A.Splunk Web Framework

B.Simple XML dashboard

C.Dashboard studio

D.Custom visualization framework

AnswerD

This framework supports custom JS/CSS visualizations.

Why this answer

The Custom Visualization Framework (option D) is the correct choice because it is the only Splunk feature that allows developers to create entirely new visualizations using JavaScript and CSS. This framework provides the necessary APIs and hooks to register custom visualization types that can then be used in dashboards, whereas the other options either restrict customization or do not support custom JavaScript/CSS modifications.

Exam trap

The trap here is that candidates often confuse the Custom Visualization Framework with Dashboard Studio or Simple XML, assuming that those tools support arbitrary JavaScript/CSS customization, when in fact they only allow configuration of existing components.

How to eliminate wrong answers

Option A is wrong because the Splunk Web Framework is a broader platform for building custom web applications and does not specifically provide a structured way to create custom visualizations with JavaScript and CSS for use within Splunk dashboards. Option B is wrong because Simple XML dashboards are declarative and do not support embedding custom JavaScript or CSS directly; they rely on predefined visualization types. Option C is wrong because Dashboard Studio is a modern, drag-and-drop interface that uses predefined visualization components and does not allow custom JavaScript/CSS modifications for creating new visualizations.

Full explanation →

352

MCQmedium

A security team needs to correlate failed login attempts across multiple web servers to identify brute force attacks. Each server logs authentication failures with timestamps and source IPs. The team wants to create a transaction that groups failed attempts within 5 minutes from the same IP, but only if there are at least 3 failures. Which approach correctly implements this requirement?

A.index=web sourcetype=access_combined status=401 | search clientip=* | head 3

B.index=web sourcetype=access_combined status=401 | transaction clientip maxspan=5m maxevents=3

C.index=web sourcetype=access_combined status=401 | stats count by clientip, _time

D.index=web sourcetype=access_combined status=401 | transaction clientip maxspan=5m

AnswerB

This groups by clientip, within 5 minutes, and requires at least 3 events (maxevents=3 means at least 3).

Why this answer

Option B is correct because the `transaction` command groups events by `clientip` with a `maxspan=5m` window, and `maxevents=3` ensures only transactions with at least 3 events are retained. This directly meets the requirement to correlate failed login attempts (status=401) from the same source IP within 5 minutes, identifying brute force attacks.

Exam trap

The trap here is that candidates often forget to include `maxevents=3` to enforce the minimum event threshold, assuming `maxspan=5m` alone is sufficient, or they mistakenly use `head` or `stats` which do not perform time-based grouping.

How to eliminate wrong answers

Option A is wrong because `head 3` limits the output to the first 3 events found, not grouping events by IP or time; it also uses an unnecessary `search clientip=*` which is redundant. Option C is wrong because `stats count by clientip, _time` aggregates events by exact timestamp, not within a 5-minute window, and does not enforce a minimum count of 3 per IP. Option D is wrong because it lacks `maxevents=3`, so it would include transactions with fewer than 3 failures, failing the 'at least 3 failures' requirement.

Full explanation →

353

MCQhard

A search includes 'transaction userid maxspan=1h maxopentxn=1000'. What is the purpose of maxopentxn?

A.It limits the total number of transactions in the search results.

B.It limits the number of transactions that can be open simultaneously in memory.

C.It limits the number of events per transaction.

D.It limits the time span of open transactions.

AnswerA

Correct: maxopentxn limits the number of concurrently open transactions, indirectly controlling memory use.

Why this answer

maxopentxn limits the number of transactions kept in memory at a time. When the limit is reached, the oldest open transaction is closed and removed from memory.

Full explanation →

354

MCQhard

Refer to the exhibit. An administrator is configuring a CIDR match lookup for geo-IP. The lookup is not working. What is most likely the issue?

A.The max_matches setting should be 0

B.The filename should include the full path

C.The stanza name should be 'geo_ip' without brackets

D.The match_type should be 'match_type = cidr' without brackets

AnswerD

It should be a setting, not a stanza header.

Why this answer

In Splunk's transforms.conf, match_type is a setting, not a separate stanza. The bracket indicates a new stanza, which is incorrect. The correct syntax is 'match_type = cidr' under the [geo_ip] stanza.

Full explanation →

355

MCQhard

An administrator runs a transaction command that groups events by a customer ID but notices that some transactions are missing expected events. The log shows that the events are present and within the maxpause. What could be the reason?

A.Events are from different hosts or sources.

B.The startswith and endswith are conflicting.

C.The fields option is missing.

D.The maxpause value is too short.

AnswerA

By default, transaction groups by host, source, and sourcetype; events from different hosts are not grouped.

Why this answer

Option C is correct because by default transaction groups events by host and source as well. If events for the same customer ID come from different hosts, they are not grouped. Option A is false because maxpause is fine.

Option B is irrelevant. Option D is a best practice not a cause.

Full explanation →

356

Drag & Dropmedium

Order the steps to create a data model in Splunk in the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps

Order

Why this order

Data models are created by defining hierarchical objects with constraints and fields.

Full explanation →

357

MCQhard

A search `index=main | eval weekday=strftime(_time,"%A") | stats count by weekday | sort - count` shows that Monday has the highest count. However, the user suspects that Monday data is double-counted due to timezone offset. What should be done to investigate?

A.Use `date_wday` field which is based on the local time by default if configured.

B.Use `strftime(_time,"%w")` instead of %A to avoid string comparison issues.

C.Apply `| convert timeformat="%A" tz=US/Mountain _time as weekday` to adjust timezone.

D.Use `eval weekday=strftime(_time + timezone_offset, "%A")` with a fixed offset.

AnswerA

`date_wday` is automatically generated by Splunk based on the configured timezone in the source type.

Why this answer

Option A is correct because `_time` is in UTC; if the events are from timezones where Monday starts earlier or later, using `date_wday` from the local time conversion is more accurate. Option B is wrong because `date_hour` is not needed. Option C is wrong because using `strftime` with timezone is possible but not the most direct.

Option D is wrong because converting to epoch does not help.

Full explanation →

358

MCQmedium

You are a Splunk power user working for a healthcare organization. You have created a visualization that shows patient wait times by department over the last 30 days. The chart uses a timechart command with a 'stacked' option. Recently, the chart started showing negative values for some departments, which is impossible because wait times cannot be negative. You have verified that the raw data is correct and contains only positive wait times. The search is: index=healthcare sourcetype=patient_wait | timechart span=1d avg(wait_time) by department. The chart is displayed as a stacked area chart. The negative values appear only for a few departments sporadically. You suspect the issue is related to how null values are handled. What could be causing the negative values?

A.The timechart command is using 'limit=0' which causes overcounting of series.

B.There is a counting error in the search due to overlapping time ranges from data indexing delays.

C.The use of 'other' category in stacked charts can cause negative values when there are many series.

D.The 'stacked' option is misinterpreting null values as negative.

AnswerD

Stacked charts can interpret nulls as negative for series with gaps.

Why this answer

Option C is correct because stacked area charts can misrepresent null values as negative when there are gaps in data. Option A is wrong because timechart limit=0 does not cause negatives. Option B is wrong because overlapping time ranges would affect counts, not produce negatives.

Option D is wrong because 'other' category does not cause negatives.

Full explanation →

359

MCQeasy

A Splunk Power User needs to find the average duration of user sessions. The sessions are defined by a 'user_id' field and have a max inactivity of 15 minutes. Which search correctly calculates this?

A.index=main | transaction user_id maxpause=15m | stats avg(duration)

B.index=main | transaction user_id maxpause=15m | eval avg=avg(duration)

C.index=main | transaction user_id maxpause=15 | stats avg(_time)

D.index=main | transaction user_id maxspan=15m | stats avg(duration)

AnswerA

Correct: transaction adds duration, stats averages it.

Why this answer

The transaction command with maxpause=15m groups events by user_id and adds a duration field. The stats command then calculates the average duration.

Full explanation →

360

MCQeasy

A security analyst wants to group all authentication events (e.g., login, logout, failure) that occur within a 10-minute window for each user. The events are from multiple sources and share a common 'user' field. Which transaction command is most appropriate?

A.... | transaction user maxspan=600 maxevents=100

B.... | transaction user maxpause=120

C.... | transaction user maxspan=600 startswith="login" endswith="logout"

D.... | transaction user maxspan=600

AnswerD

Correct: maxspan sets a 10-minute window.

Why this answer

Option A is correct because maxspan=600 seconds (10 minutes) bounds the transaction time window. Option B is incorrect because maxevents=100 may truncate transactions with more events. Option C is incorrect because startswith and endswith are not required and may exclude valid transactions.

Option D is incorrect because using only maxpause could allow transactions to exceed 10 minutes if events continue.

Full explanation →

361

Multi-Selecteasy

Which THREE of the following are valid Splunk search commands?

Select 3 answers

A.regex

B.dedup

C.sort

D.filter

E.parse

AnswersA, B, C

`regex` is a valid command to filter events using a regular expression.

Why this answer

The `regex` command is a valid Splunk search command that filters search results by applying a Perl-compatible regular expression (PCRE) to raw events or specific fields. It is commonly used to extract or match patterns within event data, such as IP addresses or error codes, and is distinct from the `rex` command which extracts fields.

Exam trap

Splunk often tests the distinction between real Splunk commands and plausible-sounding but non-existent commands like `filter` or `parse`, which candidates might confuse with similar functions in other tools or programming languages.

Full explanation →

362

MCQeasy

A security analyst needs to find all events where the field 'status' is either 'error' or 'critical', and then count the number of events per source IP. Which search is correct?

A.index=security (status=error OR status=critical) | stats count by src_ip

B.index=security status=error AND status=critical | stats count by src_ip

C.index=security | where status=error OR status=critical | stats count by src_ip

D.index=security status=error OR status=critical | stats count by src_ip

AnswerA

Correct syntax: parentheses group OR conditions, then stats count.

Why this answer

Option A is correct because it uses the proper syntax to filter events where the 'status' field is either 'error' or 'critical' within the index, and then pipes the results into the stats command to count events by 'src_ip'. The parentheses around the OR condition ensure correct evaluation order, and the stats count by src_ip accurately aggregates the count per source IP.

Exam trap

Splunk often tests the importance of parentheses in OR conditions within Splunk searches, as candidates commonly assume that 'status=error OR status=critical' without parentheses works the same as with parentheses, but it can lead to unintended search behavior due to operator precedence.

How to eliminate wrong answers

Option B is wrong because it uses 'AND' between the two status conditions, which would require an event to have both 'error' AND 'critical' simultaneously in the same field, which is impossible and returns zero results. Option C is wrong because it uses the 'where' command after the initial index filter, which is less efficient and not necessary; the 'where' command is typically used for more complex expressions, but here the OR condition can be handled directly in the search string. Option D is wrong because it lacks parentheses around the OR condition, which can lead to incorrect evaluation order; without parentheses, the search might be interpreted as 'index=security status=error' OR 'status=critical', potentially returning events from other indexes if 'status=critical' matches elsewhere.

Full explanation →

363

MCQeasy

Which SPL command can be used to create a new field based on a conditional evaluation, such as setting a status field to 'critical' if a numeric threshold is exceeded?

A.| makemv

B.| rex field=_raw

C.| eval status=if(value>100,"critical","normal")

D.| convert status=if(value>100,"critical","normal")

AnswerC

Eval with if performs conditional assignment

Why this answer

The `eval` command in SPL is used to create new fields or modify existing ones by evaluating expressions. The `if()` function within `eval` allows conditional logic, making `| eval status=if(value>100,"critical","normal")` the correct syntax to create a new field 'status' that is set to 'critical' when the numeric field 'value' exceeds 100, and 'normal' otherwise.

Exam trap

Splunk often tests the distinction between `eval` (for field creation and computation) and `convert` (for data type conversion), leading candidates to mistakenly choose `convert` for conditional logic due to its similar syntax.

How to eliminate wrong answers

Option A is wrong because `makemv` is used to split a single multivalue field into separate values, not to create a field based on conditional evaluation. Option B is wrong because `rex field=_raw` is used for extracting fields using regular expressions from the `_raw` event data, not for conditional field creation. Option D is wrong because `convert` is used for type conversion (e.g., converting strings to numbers or timestamps), not for conditional logic; the syntax `convert status=if(...)` is invalid and would produce an error.

Full explanation →

364

MCQmedium

A systems engineer creates a summary index using a saved search that runs every 30 minutes. The summary index aggregates data from multiple sourcetypes. After a week, the engineer notices that the summary index contains duplicate events for certain time ranges. What is the most likely cause?

A.The macro used in the saved search includes a time zone conversion that shifts events.

B.The saved search schedule is set to run at the wrong time.

C.The summary index acceleration is enabled, causing automatic re-summarization.

D.The summary index time range extends beyond the schedule interval, causing overlapping windows.

AnswerD

For example, if the search runs every 30 minutes but covers a 1-hour window, each event is summarized twice.

Why this answer

Option B is correct: if the summary index time range overlaps with previous runs, duplicates occur. Option A: would cause missing data. Option C: acceleration does not cause duplicates.

Option D: time zone would shift but not create duplicates within same time range.

Full explanation →

365

Matchingmedium

Match each Splunk role to its typical permission level.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts

Matches

Full access to system configuration and all data

Can create and share knowledge objects, run searches

Can run searches and create personal knowledge objects

Allows deletion of search results and events

Allows access to Splunk REST endpoints

Why these pairings

Roles define what actions users can perform in Splunk.

Full explanation →

366

MCQmedium

An analyst needs to correlate events from two different data sources (web logs and database logs) based on a common session ID. The events occur within a short time window of 5 seconds. Which command is most appropriate?

A.transaction sessionid

B.transaction sessionid maxspan=5s

C.join type=inner sessionid [search index=db]

D.append [search index=db]

AnswerA

Transaction groups events by sessionid; adding maxspan=5s would limit total time, but default works if events are close.

Why this answer

The transaction command is designed to group events that share a common field, regardless of source, and can be constrained with maxspan. Option A is the most direct and efficient method.

Full explanation →

367

MCQeasy

What is the MOST likely reason the search returns no results?

A.The user does not have permission to read the lookup.

B.The lookup definition is not configured.

C.The CSV file has no header row.

D.The `inputlookup` command expects the definition name, not the filename.

AnswerD

Use `| inputlookup usertable`.

Why this answer

The `inputlookup` command in Splunk expects the lookup definition name as its argument, not the filename of the CSV file. If a user specifies the filename (e.g., `| inputlookup myfile.csv`) instead of the lookup definition name (e.g., `| inputlookup my_lookup`), Splunk will not find the lookup and returns no results. This is a common mistake because the command syntax requires the logical name defined in the lookup table configuration, not the physical file path.

Exam trap

Splunk often tests the distinction between the `inputlookup` command (which requires the definition name) and the `lookup` command (which can accept either a definition name or a filename in certain contexts), leading candidates to incorrectly assume both commands accept filenames.

How to eliminate wrong answers

Option A is wrong because if the user lacked read permission, Splunk would typically return an error message about permissions, not silently return no results. Option B is wrong because if the lookup definition were not configured, the `inputlookup` command would fail with an error indicating the definition does not exist, rather than returning zero results. Option C is wrong because a missing header row in the CSV file would cause the lookup to load data with default field names (e.g., field1, field2) or produce a warning, but it would still return rows of data, not zero results.

Full explanation →

368

Multi-Selecteasy

Which two statements about the transaction command are correct? (Choose two.)

Select 2 answers

A.The transaction command automatically adds an 'eventcount' field.

B.The transaction command requires a startswith or endswith parameter.

C.The transaction command can only correlate events within the same sourcetype.

D.The transaction command automatically adds a 'duration' field.

E.The transaction command can be used with events from different indexes.

AnswersA, D

Correct: eventcount is automatically added.

Why this answer

Options A and D are correct. The transaction command automatically adds duration and eventcount fields. Option B is false because startswith/endswith are optional.

Option C is false because transaction can correlate events from different sourcetypes. Option E is false because transaction can work across indexes.

Full explanation →

369

Drag & Dropmedium

Arrange the steps to create a new index in Splunk in the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps

Order

Why this order

Creating an index involves navigating to the indexes page, adding a new index with appropriate settings, and saving.

Full explanation →

370

MCQeasy

A search produces a table with many rows. Which visualization type is best suited to show the distribution of a single field's values?

A.Area chart

B.Pie chart

C.Scatter chart

D.Line chart

AnswerB

Pie charts show parts of a whole, ideal for distribution of a single field.

Why this answer

Option C is correct because a pie chart effectively shows proportional distribution of categorical data. Options A, B, and D are more suited for trends or relationships.

Full explanation →

371

MCQhard

Which statement best describes the search result?

A.It returns an error because price is not a field before the lookup.

B.It returns the count of distinct product_ids that have a price > 100.

C.It returns the count of events where the price is greater than 100, grouped by product_name.

D.It returns the count of successful web events, but only for products with price > 100.

AnswerC

Correct: after lookup and where filter, stats count by product_name groups events.

Why this answer

Option A is correct. The search enriches events with product_name and price, filters for price > 100, then counts events by product_name. Option B is wrong because it counts events, not distinct product_ids.

Option C is wrong because price is added by the lookup. Option D is too broad; it doesn't specify the grouping.

Full explanation →

372

MCQhard

A Splunk administrator runs the following search to identify the top 5 users by total bytes transferred: index=proxy sourcetype=webproxy | stats sum(bytes) as total_bytes by user | sort - total_bytes | head 5 The search returns results, but the numbers seem inflated. On closer inspection, the 'bytes' field is a string type. What must be done to correct the search?

A.Use 'convert num(bytes)' before stats.

B.Use 'eval bytes_numeric = tonumber(bytes)' then 'stats sum(bytes_numeric) as total_bytes by user'.

C.Use 'where isnum(bytes)' to filter out non-numeric values before stats.

D.Use 'eval bytes = string(bytes)' before stats.

AnswerB

This explicitly converts the string to numeric, ensuring correct summation.

Why this answer

Option B is correct because the `bytes` field is stored as a string, and `stats sum()` cannot perform arithmetic on string values — it would silently treat them as zero or concatenate them, leading to inflated results. The `tonumber()` function explicitly converts the string to a numeric type, enabling accurate summation. Using `eval` to create a new numeric field before `stats` is the standard approach in Splunk for this scenario.

Exam trap

The trap here is that candidates assume `stats sum()` automatically converts strings to numbers, or they reach for `convert` (a non-existent command) instead of the correct `eval tonumber()` pattern, which Splunk explicitly tests in the Advanced Searching domain.

How to eliminate wrong answers

Option A is wrong because `convert num(bytes)` attempts to convert the field in place, but `convert` is not a valid Splunk command for this purpose; the correct command is `eval` with `tonumber()`. Option C is wrong because `where isnum(bytes)` filters out non-numeric values but does not convert the string to a number, so `stats sum()` would still fail to sum correctly (strings would be ignored or cause errors). Option D is wrong because `eval bytes = string(bytes)` explicitly converts the field to a string, which is the opposite of what is needed and would make the inflation worse.

Full explanation →

373

MCQhard

A large e-commerce company is using Splunk to monitor user sessions across multiple microservices. Each service logs events with a common 'session_id' field. The security team wants to identify sessions where a user performed a 'password_change' action followed by a 'login' from a different IP address within 5 minutes, indicating possible account takeover. The current search uses `transaction session_id startswith=action=login endswith=action=password_change maxspan=10m`. However, the search returns very few results, and the team suspects it is missing many attacks. The logs show that sometimes 'password_change' occurs before 'login' (e.g., password reset then login) and the IP changes are observed across multiple events. The team needs to capture both orderings. Which approach should they take?

A.Use `transaction session_id maxspan=5m` and then filter for sessions that contain both actions

B.Use `transaction session_id startswith=action=password_change endswith=action=login maxspan=5m` in a separate search and append results

C.Keep the current search but increase maxspan to 30m

D.Add both startswith and endswith with OR conditions: `startswith=(action=login OR action=password_change) endswith=(action=login OR action=password_change)`

AnswerA

This captures any order within 5 minutes, then filter for both actions.

Why this answer

The current search only captures one order (login then password_change). To capture both orders, they should either use `transaction session_id maxspan=5m` without startswith/endswith and then filter, or use two separate transactions and combine. The best option is to use `transaction session_id maxspan=5m` and then search for events where both actions occur, because it avoids order dependency and is simpler.

Full explanation →

374

Multi-Selecthard

Which TWO of the following are valid reasons to use the Common Information Model (CIM) in a Splunk environment?

Select 2 answers

A.It improves the ability to correlate events from different technologies.

B.It enables searching across different data sources with common field names.

C.It improves search performance by pre-aggregating data.

D.It provides built-in security monitoring use cases.

E.It eliminates the need for custom field extractions.

AnswersA, B

By standardizing field names, CIM makes it easier to correlate events across different data sources.

Why this answer

Option A is correct because the Common Information Model (CIM) provides a standardized set of field names and event tags across different data sources, enabling correlation of events from disparate technologies (e.g., firewalls, IDS, endpoints) using common fields like 'dest_ip', 'src_ip', 'user', and 'action'. This normalization allows Splunk to join or relate events that share the same CIM-compliant fields, making it possible to build coherent security or operational stories across heterogeneous data.

Exam trap

The trap here is that candidates confuse the CIM's normalization role with performance optimization or built-in security content, leading them to select options about pre-aggregation or pre-built use cases, which are actually features of other Splunk components like data model acceleration or Splunk Security Essentials.

Full explanation →

375

MCQeasy

A company's security team uses Splunk to monitor firewall logs. They have a lookup file named 'threat_intel.csv' containing 10,000 IP addresses classified by threat level. The lookup is used in a dashboard that shows the number of blocked connections from high-threat IPs over the past 24 hours. Recently, the dashboard has become slow, taking over 30 seconds to load. The lookup file is updated every 15 minutes via a script that replaces the entire file. The search currently uses: `index=firewall | lookup threat_intel.csv src_ip OUTPUT threat_level | where threat_level="high" | stats count`. Which of the following is the MOST efficient way to improve dashboard performance?

A.Restrict the search to a smaller time range, such as the last hour.

B.Use the lookup with local=t to force it to run on the search head only.

C.Convert the lookup to a KV store collection with an index on src_ip.

D.Increase the lookup cache size in limits.conf.

AnswerC

KV store handles concurrent reads and writes efficiently, ideal for frequently updated lookups.

Why this answer

Option C is correct because converting the lookup to a KV Store collection with an index on `src_ip` allows Splunk to perform efficient key-value lookups without loading the entire 10,000-row CSV into memory on every search. The KV Store uses an indexed data structure, which dramatically reduces lookup time compared to a file-based lookup that must be fully scanned each time, especially when the file is replaced every 15 minutes and the search runs over a 24-hour window.

Exam trap

The trap here is that candidates often assume reducing the time range or caching will fix performance, but the real bottleneck is the file-based lookup's linear scan of 10,000 rows, which is only resolved by switching to an indexed KV Store collection.

How to eliminate wrong answers

Option A is wrong because reducing the time range to the last hour does not address the root cause of slow lookups; the performance bottleneck is the file-based lookup scanning 10,000 IPs, not the volume of events. Option B is wrong because using `local=t` forces the lookup to execute only on the search head, which does not improve performance—it may even worsen it by bypassing distributed lookup execution across indexers. Option D is wrong because increasing the lookup cache size in `limits.conf` only caches previously looked-up values within a single search; it does not help when the lookup file is replaced every 15 minutes, as the cache is invalidated on each file change, and the initial load still requires scanning the entire CSV.

Full explanation →

Page 5 of 7

All pages

Practice SPLK-1003 by domain

Target a specific domain to shore up weak areas.

Advanced Searching and Statistics Macros, Saved Searches and CIM Advanced Visualization and Lookups Transactions and Event Correlation

See all domains with question counts →