Approximate string matching for high-throughput sequencing