Array 1 293-20 **** Predicted by CRISPRDetect 2.4 *** >NZ_RYBC01000854.1 Escherichia coli strain D008p NODE_856_length_344_cov_0.317972, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 292 29 100.0 32 ............................. GTCATCACGATGAATCAAAATTTCGCCCGGCT 231 29 100.0 32 ............................. CTTCTCCACCGTTTGGCGAATCGGTGTGAGGG 170 29 100.0 32 ............................. ATTGTTATAATTATTTATTGAAATATCATTCC 109 29 100.0 32 ............................. AATCTATTGTGAATTTGAAATGGTCCAGCACT 48 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================ ================== 5 29 100.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : AATACCACAACCCTTAAGTTAGCGCTTATGGGGTTTTTCCCCACCGTTCAG # Right flank : GGGTAAAAACACGGTCTGAA # Questionable array : NO Score: 6.06 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:0.8, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: R [18.3-45.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.51 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 50-261 **** Predicted by CRISPRDetect 2.4 *** >NZ_RYBC01001052.1 Escherichia coli strain D008p NODE_1054_length_316_cov_0.608466, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 50 29 100.0 32 ............................. GCGAGTAAGAGTACTGAGTATTTTAATCTCAT 111 29 100.0 32 ............................. GCCATAGACTTTATTAACTTCATTGGGGGTTA 172 29 100.0 32 ............................. CGTATCACAACCGGCCTCAGTACGTATCAGAA 233 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================ ================== 4 29 100.0 32 GTGTTCCCCGCGCCAGCGGGGATAAATCG # Left flank : CCAGCGGGGATAAATCGAGCCAGACACATTGCTTTATCGCGCCACGCTGG # Right flank : AGGGGCAGGGAGTGACGCTGGAGCAGGCGATCGTGTTCCCCGCGCCAGCGGGGAT # Questionable array : NO Score: 5.86 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:0.6, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAATCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: NA [5,5] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: NA [35.0-26.7]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [4.87,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 467-30 **** Predicted by CRISPRDetect 2.4 *** >NZ_RYBC01000318.1 Escherichia coli strain D008p NODE_320_length_515_cov_1.06701, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 466 29 100.0 32 ............................. ACCTGAACGAAACGCAAAATAGTTTCTTTATG 405 29 100.0 32 ............................. TTCGTTTTTCGCAGCATTAACTTTTTGCCGCG 344 29 96.6 32 ............................T TGATCCGCGACCGCCAGATGGCGCAGCCTGTC 283 29 100.0 32 ............................. GTGCGCCAGCTATAAAAAACTCACCATCAACA 222 29 86.2 12 ...............C.A.C....G.... CAAACAGAGCCG G [196] Deletion [182] 180 29 100.0 32 ............................. ATCCCCTGCGTCCTCTTTTGAATAATCGCGGC 119 29 100.0 32 ............................. CGAGCGCCGTTGGCTACTGGCGTAGCCGTGGA 58 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================ ================== 8 29 97.8 29 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CCAGCGGGGATAAACCGACAATCAGGGAACGATTGTTGACACTGTAAA # Right flank : GCAAAAAAATAATATCCGGCAGTCTGTACG # Questionable array : NO Score: 5.88 # Score Detail : 1:0, 2:3, 3:0, 4:0.89, 5:0, 6:0.25, 7:-0.26, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-1] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [30.0-43.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.41,5.51 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 95264-94198 **** Predicted by CRISPRDetect 2.4 *** >NZ_RYBC01000004.1 Escherichia coli strain D008p NODE_4_length_210817_cov_33.6235, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 95263 29 100.0 32 ............................. ACAATCCCACGCCGATAATCTCTATACAGCAA 95202 29 100.0 32 ............................. GGCACGGAATTGTTATGCTGTTCCCCTGACCG 95141 29 100.0 32 ............................. ATCCGCCGCCGGTTAACGCTGGACCAGTTCCG 95080 29 100.0 32 ............................. GGCGAGTCCGTCAGCGGTGCGCCGCTGCAACA 95019 29 100.0 32 ............................. GGACAATGTGAAAAGCTTAATATTCATTACAT 94958 29 100.0 32 ............................. CGACGTTTTCTAATATCACCCAGCAATCAATT 94897 29 100.0 32 ............................. ATGACCATTGGTGAACGCATCCGCTTTCGCCG 94836 29 100.0 32 ............................. TACAGTTTCCATAAATTCACCTCGTTTATATA 94775 29 100.0 32 ............................. ATCGGACGATGGCGATCGCAATCGCGCGGGAA 94714 29 100.0 32 ............................. AGGACGAAACGACCGGAAAACTGGCGACGGGC 94653 29 100.0 32 ............................. AACCTTGTCGGGTCGCCCGTGCGTCATGATGA 94592 29 100.0 32 ............................. AGTTCCCACAAACCTGGGTGCATCTCGCGTTC 94531 29 100.0 32 ............................. ACTGCAAAGTTCTTCACGCTGGTTTTTATGCA 94470 29 100.0 32 ............................. CCAGCCGAAACAACGCCAGCAAAATCGACCGC 94409 29 100.0 32 ............................. GGATATAGAGCGGGTACTCGAGCGAAGCGGGG 94348 29 100.0 32 ............................. CCAGGACAGGCCGTGACGGTTGCCATTGAGTC 94287 29 100.0 32 ............................. TTTTTGTTCTCTTCAAAACGCCGAACAACCAA 94226 29 93.1 0 ............T.....A.......... | ========== ====== ====== ====== ============================= ================================ ================== 18 29 99.6 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CGTGCTTGCTGCTGGAGAAATACAACCGCCGGCCCCACCTGAAGATGCACAGCCTGTTGCCATTCCGCTTCCCGTTTCTCTGGGAGATGCCGGACATCGGAGTAGCTGAGATGAGTATGTTGGTCGTGGTCACTGAAAATGTACCTCCGCGCTTACGAGGCAGATTAGCCATCTGGTTGTTGGAGGTACGTGCAGGGGTATATGTAGGTGATGTATCCGCAAAAATTCGTGAAATGATCTGGGAACAAATAGCTGGACTGGCGGAAGAAGGCAATGTAGTGATGGCATGGGCAACGAATACGGAATCGGGATTTGAGTTCCAGACATTTGGGGTAAACAGGCGTACCCCGGTAGATTTGGATGGTTTAAGGTTGGTATCTTTTTTACCTGTTTGAAAACAAAGAATTAGCTGATCTTTAATAATAAGGAAATGTTACATTAAGGTTGGTGGGTTGTTTTTATGGGAAAAAATGCTTTAAGAACAAATGTATACTTTTAGA # Right flank : GACGCACTGGATGCGATGATGGATATCACTTGGAGTTCCCCGCCCCTGCGGTAGAACTCCCAACTCCCATTTTCATACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGCGGTAGCATTATCCGCATAACATCACGGCAGCGACGTTCTATTCTTCCTGGAAGTGCCTTATCAATATGTTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCAGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAATTTGTTGCTTCTACCGAAAGTACGGCAATACCGGCTTTGTCGAAAACTTCGGCGTCATTACAACAGCCAGTACCCTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCAATTCCATGACTACGCGCAATTGCCAGTGCTCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCACTGTTGAAATACAATTTATCGCCAACAA # Questionable array : NO Score: 6.24 # Score Detail : 1:0, 2:3, 3:0, 4:0.98, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [3,6] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [2-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [41.7-70.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.92 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 121267-120811 **** Predicted by CRISPRDetect 2.4 *** >NZ_RYBC01000004.1 Escherichia coli strain D008p NODE_4_length_210817_cov_33.6235, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 121266 29 100.0 32 ............................. CATATCAAGAATTTATGGACTCGGGTGGCAAA 121205 29 100.0 32 ............................. GCAAAAACCGGGCAATCGCAAAAAGGCGTAAT 121144 29 96.6 32 ............................T GTGTTTGCGGCATTAACGCTCACCAGCATTTC 121083 29 100.0 32 ............................. ACGTGGTCATGGGTGCTGCTGTTGCAGAGCCA 121022 29 100.0 32 ............................. AGCAGATACACGGCTTTGTATTCCGTGCGCCC 120961 29 100.0 32 ............................. AATAGCAATAGTCCATAGATTTGCGAAAACAG 120900 29 100.0 32 ............................. GAGCCTGACGAGACTACTGAGGCCGTTCTGTC 120839 29 93.1 0 .A..........................A | ========== ====== ====== ====== ============================= ================================ ================== 8 29 98.7 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTACTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGTGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTGCTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : CCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGTTCTCCATGCCAGAGAGGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATATTGAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTATATTTTTTAAGCTTGTTATTCATTGGTTAAGTAATAAATCTGGAAGTTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAACCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTTT # Questionable array : NO Score: 6.20 # Score Detail : 1:0, 2:3, 3:0, 4:0.94, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [2-1] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [68.3-75.0]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.65 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //