Sequence Cleaner & Validator (DNA/Protein, FASTA)
Input & Options
0 bases
Cleaning
Case conversion
Validation
Cleaned & Validated
Your cleaned and validated sequence will appear here.
Tips: Ctrl/Cmd + K focuses the input. Validation updates as you type.
How It Works:
This tool processes sequence data entirely within your web browser. When you type or paste content into the input field, it applies the selected cleaning, case conversion, and validation options instantly. Your sequence data is never sent to a server, ensuring complete privacy and security.
Cleaning:
- Remove FASTA Headers & Line Numbers: This option will strip any lines starting with `>` (common in FASTA format) and any numerical line prefixes.
- Remove All Whitespace: This will remove all spaces, tabs, and newlines, resulting in a continuous string of characters.
Validation:
The tool uses regular expressions to check if your sequence contains only valid characters for the chosen type:
- DNA (ACGT): Checks for A, C, G, T (case-insensitive). Any other character will result in a validation error.
- Protein (20 AAs): Checks for the 20 standard amino acid single-letter codes (A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, Y, case-insensitive). Other characters will trigger an error.