Automatically splitting large files with sed based on file size

Andrew Fletcher published: 19 January 2026 2 minutes read

commands
sed

When working with large prompt or configuration files (for example data/prompts-master.py), it’s common to copy or inspect the file in chunks using sed. Early on, this is easy to manage manually:

sed -n '1,324p' data/prompts-master.py
sed -n '325,647p' data/prompts-master.py

However, this approach breaks down as the file grows. If the file increases from 647 lines to 632, 976, or beyond, hard-coding line numbers becomes brittle and error-prone.

Let's walk through how to automatically calculate split points based on the current file length and how to adapt the strategy as files grow larger (e.g. splitting into 50%, 33%, or 25% chunks).

Step 1: Detect the number of lines dynamically

The foundation is wc -l, which returns the number of lines in a file:

wc -l data/prompts-master.py

To capture just the number (without the filename):

wc -l < data/prompts-master.py

This allows us to calculate split points programmatically.

Step 2: Split the file into two equal halves (≈50%)

The following one-liner:

Detects the total number of lines
Calculates the midpoint
Prints the file in two halves using sed

f="data/prompts-master.py"
n=$(wc -l < "$f")
mid=$(( (n + 1) / 2 ))
sed -n "1,${mid}p" "$f"
sed -n "$((mid + 1)),$n p" "$f"

Why (n + 1) / 2?

This ensures the first half gets the extra line when the file has an odd number of lines.

Examples:

632 lines → 316 / 316
976 lines → 488 / 488
975 lines → 488 / 487

No manual changes required.

Step 3: When files get too big (500+ lines per chunk)

If you hit limits where ~500 lines per chunk is too large (for example when pasting into another tool), you can change the split strategy.

Split into thirds (≈33%)

f="data/prompts-master.py"
n=$(wc -l < "$f")
chunk=$(( (n + 2) / 3 ))
sed -n "1,${chunk}p" "$f"
sed -n "$((chunk + 1)),$((chunk * 2))p" "$f"
sed -n "$((chunk * 2 + 1)),$n p" "$f"

Split into quarters (≈25%)

f="data/prompts-master.py"
n=$(wc -l < "$f")
chunk=$(( (n + 3) / 4 ))
sed -n "1,${chunk}p" "$f"
sed -n "$((chunk + 1)),$((chunk * 2))p" "$f"
sed -n "$((chunk * 2 + 1)),$((chunk * 3))p" "$f"
sed -n "$((chunk * 3 + 1)),$n p" "$f"

This automatically adapts whether the file has:

632 lines
976 lines
1,200+ lines

Step 4: A simple rule of thumb

You can standardise this logic:

< 500 lines → no split
500–900 lines → split into halves (50%)
900–1,500 lines → split into thirds (33%)
> 1,500 lines → split into quarters (25%)

This keeps each chunk manageable without revisiting line numbers every time the file grows.