Array 1 9-647 **** Predicted by CRISPRDetect 2.4 *** >NZ_WSKN01000292.1 Escherichia coli strain 8374wG3 NODE_292_length_680_cov_0.891501_ID_10744, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 9 29 100.0 32 ............................. ACCGCAGAAAAAACCAGCTGGCCTCGAAAGAA 70 29 100.0 32 ............................. ATCTGGCTCTTCAAAAAACAGCAGCCTGAACC 131 29 100.0 32 ............................. CCGCCATCACAGAACCTTGCCAGACCGTATCC 192 29 100.0 32 ............................. CGCCCAGGATCAGCTTACTGCGCACCGTTTCC 253 29 100.0 32 ............................. CGCCCAGGATCAGCTTACTGCGCACCGTTTCC 314 29 100.0 32 ............................. GCGGGTTTTTTACGCGTGGGGTTGCAACAGGT 375 29 100.0 32 ............................. AAAAAACAGCAAAAGATGAACAAAATATAAAC 436 29 100.0 32 ............................. TTCATGCGCCGCCCCACTTCACTGATAGCGAA 497 29 100.0 32 ............................. CTCAAAATCGGCGGCGGTTAAGTGGCTGACAT 558 29 100.0 32 ............................. GATCTAAAAAGGATCTGCGGAACTGAAAAAAT 619 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================ ================== 11 29 100.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGTTTAGAG # Right flank : GGTATTTATTGTGCAATTAGCTTTGCATTAAAG # Questionable array : NO Score: 6.26 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: R [10.0-38.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.24,0.27 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 22-415 **** Predicted by CRISPRDetect 2.4 *** >NZ_WSKN01000396.1 Escherichia coli strain 8374wG3 NODE_396_length_506_cov_1.19261_ID_10952, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 22 29 100.0 32 ............................. GAAAAGCTACTTTTGTGTTCAACTGATGCATT 83 29 100.0 31 ............................. CCGCGCAAATCCAGCGAGCCGCCGACGCTCA 143 29 100.0 32 ............................. TTGCAAACCGTGGCAAACGCAATTAACAAAAA 204 29 100.0 32 ............................. ATTGTTATAATTATTTATTGAAATATCATTCC 265 29 100.0 32 ............................. AATCTATTGTGAATTTGAAATGGTCCAGCACT 326 29 100.0 32 ............................. GGTAAAAACACGGTCTGAACCGACATTCATGT 387 29 93.1 0 ...........AT................ | ========== ====== ====== ====== ============================= ================================ ================== 7 29 99.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTATTGCCGGTGTCAGCAAAAG # Right flank : GGCGCACTGGATGCGATGATGGATATCACTTAGAATTCCCCGCCCCTGCGGTAGAACTCCCAGCTCCCATTTTCAAACCCATCAAGACGCC # Questionable array : NO Score: 6.21 # Score Detail : 1:0, 2:3, 3:0, 4:0.95, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [18.3-43.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.65,0.27 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 201529-202716 **** Predicted by CRISPRDetect 2.4 *** >NZ_WSKN01000002.1 Escherichia coli strain 8374wG3 NODE_2_length_318642_cov_49.4872_ID_10164, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 201529 29 100.0 32 ............................. GCATTGACGCTTTAAACGACGACGGACGCCAC 201590 29 100.0 32 ............................. AAAACAGCCTTTAGATTAGTACCTGACGACCG 201651 29 100.0 32 ............................. TAAACGCACCTGGCGCGCCACTTTATCAACAA 201712 29 100.0 32 ............................. CGGCTTGTTTAATTGCGTGGAACGTCTCAATT 201773 29 100.0 32 ............................. ACGGCGTGGATTGAGGGACGGGTATTTGGTCC 201834 29 100.0 32 ............................. CTATTGCTTTCGTACAGATTTTCAGTGGTGCT 201895 29 100.0 32 ............................. AAAATTCTGTGTTTCGACCATTACTTCGGTAA 201956 29 100.0 32 ............................. ATTCTTGATCACGCTTTTACCGAAGTAATGGT 202017 29 100.0 32 ............................. GGGGTAGAATTATTCTTCGTGAGCGATTTATC 202078 29 100.0 32 ............................. CGGCGTTCCGTGCGGCAATTGGAATCACACCA 202139 29 100.0 32 ............................. CGTTCTGAATCCGATATTCTTCAGCACCTTCA 202200 29 100.0 32 ............................. AGCGTCAATCAGCGCGTCTATCGCGTCACTTT 202261 29 100.0 32 ............................. ATTTGGGGGTATGAGAGCGCCGAGCCGTTCGG 202322 29 100.0 32 ............................. GCTCCCTGTCAGTTGTAATCGATAACGTTGAT 202383 29 100.0 32 ............................. ATGTAGGGGCAATCGAACGATTCTCTGCCGAC 202444 29 100.0 32 ............................. CCGAGCCCGATTATCGGCATGAGCGATGCGGA 202505 29 100.0 32 ............................. TCGAAGAAGAAAGGGAAATAATGCGAGGAACG 202566 29 100.0 32 ............................. TATTACGCGCCAGCAATGCTGACAGCGGCAAA 202627 29 100.0 32 ............................. CGCGAGAGCCAGCAAAACGCCAGGGCACAAAA 202688 29 93.1 0 .A..........................A | ========== ====== ====== ====== ============================= ================================ ================== 20 29 99.7 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTATTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGGAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGTGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTACTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : ACCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACCCCTCATGTTCAAAATAGTTCTCCATGCCAGAGAAGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATGTTGAATTAATATCTATTAATTTTTTCTTTAGGTTAATAGTTTGTTTTTTAAGCTTGTTATTCATTGATTAAGTAATAAATCTGAAAATTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATTACCGTTTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTT # Questionable array : NO Score: 6.25 # Score Detail : 1:0, 2:3, 3:0, 4:0.99, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [75.0-68.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.65,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 218218-218795 **** Predicted by CRISPRDetect 2.4 *** >NZ_WSKN01000002.1 Escherichia coli strain 8374wG3 NODE_2_length_318642_cov_49.4872_ID_10164, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 218218 29 100.0 32 ............................. CGACAAAATTCTCAAAACTCGATCAGGAAAAT 218279 29 100.0 32 ............................. CCACCGTTTTCGCCCACCAGGGCGCACAACCC 218340 29 100.0 32 ............................. GAAAAAGAGAAGGTAGAGAAAGCGGAATCTGG 218401 29 100.0 32 ............................. CAGGTCTATCGGGCGATCAATAAAATCGGTCA 218462 29 100.0 32 ............................. GCGCACCGTTGCGTCGAAAAGGCGCTGGAGAT 218523 29 100.0 32 ............................. TACGCTTACACAACGGGCGAATATTTTAACGG 218584 29 100.0 32 ............................. GAACCCAATAGTGAAATACAGCATCATTTTTT 218645 29 100.0 32 ............................. GGGCAAATATAAATTCCAGCGTGCTTCATGAA 218706 29 100.0 32 ............................. CTGCGTAGCGACCTTTGCTCTCAATTTCGTTG 218767 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================ ================== 10 29 100.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTCCTTGCTGCAGGTGAAATTGAACCACCACAACCCGCGCCGGATATGTTACCGCCTGCCATCCCTGAACCTGAAACGCTGGGTGATAGTGGTCACCGGGGGCGCGGCGGATGAGCATGGTCGTGGTTGTTACAGAAAATGTCCCTCCGCGCTTACGTGGACGGCTCGCAATCTGGCTACTGGAAGTGCGTGCCGGTGTGTATGTTGGTGATACATCAAAACGTATTCGGGAGATGATCTGGCAACAAATTACCCAACTGGCTGGTTGCGGAAATGTGGTGATGGCCTGGGCGACCAATACCGAATCGGGTTTTGAATTTCAGACCTGGGGAGAAAACAGACGTATTCCGGTGGATTTGGATGGGTTACGTTTGGTTTCTTTTCTTCCTGTTGATAATCAATAGGTTATGTGTTCTTTAAAAATAAGGAAATGTTTGAATTTAGTTGGTAGATTGTTGATGTGGAATAAATTTGTTTAAAAACAGATATGTATGCTTAGT # Right flank : GGGCGCACTGGATGCGATGATGGATATCACTTGGAGTTCCCCCGCCCCTGCGGTAGAACTCCCAGCTCCCATTTTCCAACCCATCAAGACGCCTTCGCCAACTCTTTCACCAGAGGTAGCATTATCCGCATAACGTCACGGCAGCGACGTTCTATTCTTCCAGGAAGTGCCTTATCAATATGCTGTTGATTATCAAACCTGACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAGTTAGTCGCTTCCACCGAAAGTACAGCAATGCCCGCTTTGTCGAATATTTCTGCGTCATTACAACACCCGGTGCCTTTCGGATAGTTTTTATTCAAACCAGGATTGGTCGTCGCGGCTATTCCCTGACTGCGCGCAATTGCCAGTGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAAC # Questionable array : NO Score: 6.26 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: F [73.3-40.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.51,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //