Array 1 201226-203084 **** Predicted by CRISPRDetect 2.4 *** >NZ_SHKG01000004.1 Escherichia coli strain EC_02 NODE_4_length_209830_cov_16.356972, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 201226 29 100.0 32 ............................. GCATTGACGCTTTAAACGACGACGGACGCCAC 201287 29 100.0 32 ............................. AAAACAGCCTTTAGATTAGTACCTGACGACCG 201348 29 100.0 32 ............................. TAAACGCACCTGGCGCGCCACTTTATCAACAA 201409 29 100.0 32 ............................. CGGCTTGTTTAATTGCGTGGAACGTCTCAATT 201470 29 100.0 32 ............................. ACGGCGTGGATTGAGGGACGGGTATTTGGTCC 201531 29 96.6 32 ............................T AGATCGCGCCACGAGGAAACGAATATGAACGG 201592 29 100.0 32 ............................. TAAACGCACCTGGCGCGCCACTTTATCAACAA 201653 29 100.0 32 ............................. CGGCTTGTTTAATTGCGTGGAACGTCTCAATT 201714 29 100.0 32 ............................. ACGGCGTGGATTGAGGGACGGGTATTTGGTCC 201775 29 96.6 32 ............................T AGATCGCGCCACGAGGAAACGAATATGAACGG 201836 29 100.0 32 ............................. TGCCGCCAGGCCAGCGACACATCAGACAACTG 201897 29 100.0 32 ............................. GTCTGTGATGGCCTGCTCGTGAGTCCGCGGCG 201958 29 100.0 32 ............................. TTTTGATTTCATTAACGGCGCTCCCCATATTT 202019 29 100.0 32 ............................. TGCGCCGTAGCGTGTCCACCTATTGTAGTAAA 202080 29 100.0 32 ............................. ATACAAACGCGGTGTTTATCAATATGAATTTT 202141 29 100.0 32 ............................. GCACCACGCGTACCCCGATGTTTGTTTTGCCA 202202 29 100.0 32 ............................. ATCCGGCTATATCTTTGAGCATTACAGAAATA 202263 29 100.0 32 ............................. AAAATTCTGTGTTTCGACCATTACTTCGGTAA 202324 29 100.0 32 ............................. ATTCTTGATCACGCTTTTACCGAAGTAATGGT 202385 29 100.0 32 ............................. GGGGTAGAATTATTCTTCGTGAGCGATTTATC 202446 29 100.0 32 ............................. CGGCGTTCCGTGCGGCAATTGGAATCACACCA 202507 29 100.0 32 ............................. CGTTCTGAATCCGATATTCTTCAGCACCTTCA 202568 29 100.0 32 ............................. AGCGTCAATCAGCGCGTCTATCGCGTCACTTT 202629 29 100.0 32 ............................. ATTTGGGGGTATGAGAGCGCCGAGCCGTTCGG 202690 29 100.0 32 ............................. GCTCCCTGTCAGTTGTAATCGATAACGTTGAT 202751 29 100.0 32 ............................. ATGTAGGGGCAATCGAACGATTCTCTGCCGAC 202812 29 100.0 32 ............................. CCGAGCCCGATTATCGGCATGAGCGATGCGGA 202873 29 100.0 32 ............................. TCGAAGAAGAAAGGGAAATAATGCGAGGAACG 202934 29 100.0 32 ............................. TATTACGCGCCAGCAATGCTGACAGCGGCAAA 202995 29 100.0 32 ............................. CGCGAGAGCCAGCAAAACGCCAGGGCACAAAA 203056 29 93.1 0 .A..........................A | ========== ====== ====== ====== ============================= ================================ ================== 31 29 99.6 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTATTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGTGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTACTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : ACCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACCCCTCATGTTCAAAATAGTTCTCCATGCCAGAGAAGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATGTTGAATTAATATCTATTAATTTTTTCTTTAGGTTAATAGTTTGTTTTTTAAGCTTGTTATTCATTGATTAAGTAATAAATCTGAAAATTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATTACCGTTTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTT # Questionable array : NO Score: 6.24 # Score Detail : 1:0, 2:3, 3:0, 4:0.98, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [2-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [75.0-68.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.65,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 8916-9554 **** Predicted by CRISPRDetect 2.4 *** >NZ_SHKG01000014.1 Escherichia coli strain EC_02 NODE_14_length_108048_cov_14.743709, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 8916 29 100.0 32 ............................. CGACAAAATTCTCAAAACTCGATCAGGAAAAT 8977 29 100.0 31 ............................. CCACCGTTTTCGCCACCAGGGCGCACAACCC 9037 29 100.0 32 ............................. GAAAAAGAGAAGGTAGAGAAAGCGGAATCTGG 9098 29 100.0 32 ............................. CAGGTCTATCGGGCGATCAATAAAATCGGTCA 9159 29 100.0 32 ............................. GCGCACCGTTGCGTCGAAAAGGCGCTGGAGAT 9220 29 100.0 32 ............................. TACGCTTACACAACGGGCGAATATTTTAACGG 9281 29 100.0 32 ............................. GAACCCAATAGTGAAATACAGCATCATTTTTT 9342 29 100.0 33 ............................. GAGGCCTATATCTCTAACCGCATCGGGCTGCGC 9404 29 100.0 32 ............................. GGGCAAATATAAATTCCAGCGTGCTTCATGAA 9465 29 100.0 32 ............................. CTGCGTAGCGACCTTTGCTCTCAATTTCGTTG 9526 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================= ================== 11 29 100.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTCCTTGCTGCAGGTGAAATTGAACCACCACAACCCGCGCCGGATATGTTACCGCCTGCCATCCCTGAACCTGAAACGCTGGGTGATAGTGGTCACCGGGGGCGCGGCGGATGAGCATGGTCGTGGTTGTTACAGAAAATGTCCCTCCGCGCTTACGTGGACGGCTCGCAATCTGGCTACTGGAAGTGCGTGCCGGTGTGTATGTTGGTGATACATCAAAACGTATTCGGGAGATGATCTGGCAACAAATTACCCAACTGGCTGGTTGCGGAAATGTGGTGATGGCCTGGGCGACCAATACCGAATCGGGTTTTGAATTTCAGACCTGGGGAGAAAACAGACGTATTCCGGTGGATTTGGATGGGTTACGTTTGGTTTCTTTTCTTCCTGTTGATAATCAATAGGTTATGTGTTCTTTAAAAATAAGGAAATGTTTGAATTTAGTTGGTAGATTGTTGATGTGGAATAAATTTGTTTAAAAACAGATATGTATGCTTAGT # Right flank : GGGCGCACTGGATGCGATGATGGATATCACTTGGAGTTCCCCCGCCCCTGCGGTAGAACTCCCAGCTCCCATTTTCCAACCCATCAAGACGCCTTCGCCAACTCTTTCACCAGAGGTAGCATTATCCGCATAACGTCACGGCAGCGACGTTCTATTCTTCCAGGAAGTGCCTTATCAATATGCTGTTGATTATCAAACCTGACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAGTTAGTCGCTTCCACCGAAAGTACAGCAATGCCCGCTTTGTCGAATATTTCTGCGTCATTACAACACCCGGTGCCTTTCGGATAGTTTTTATTCAAACCAGGATTGGTCGTCGCGGCTATTCCCTGACTGCGCGCAATTGCCAGTGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAAC # Questionable array : NO Score: 6.26 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: F [73.3-40.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.51,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //