Array 1 2002885-2003401 **** Predicted by CRISPRDetect 2.4 *** >NZ_CP029164.1 Escherichia coli strain 104 chromosome, complete genome Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 2002885 29 100.0 32 ............................. ACGTAACAAAACAACAGCAAAATATTATCGAC 2002946 29 100.0 32 ............................. GCTCCGCCGGTTTGATCTCCGGTTTGCGCTGT 2003007 29 100.0 32 ............................. GCAATTAATTTAGTTCCAGATGCTGCGAAAGA 2003068 29 100.0 32 ............................. TTGGTCACTCGTCAAAAGTCGAGACGGTCGAA 2003129 29 100.0 32 ............................. AATGATTGATATAAATCTGTGTACGGTGTCCG 2003190 29 93.1 32 .........G..C................ CTAGGATAAATTAAAAGACAAAATTGCAGCAA 2003251 29 93.1 32 .........G..G................ GAGCGACCAGTATCAAGATCGACAGGTTTTGC 2003312 29 93.1 32 .........G..C................ ATCGATATGTACGTTAGCGAGGGGATCACGCA 2003373 29 86.2 0 .........G.AC...............A | ========== ====== ====== ====== ============================= ================================ ================== 9 29 96.2 32 GAGTTCCCCACGTCAGCGGGGATAAACCG # Left flank : TGGATGAACTACTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCATCCATCTTGCCCATCCTGAACTAATCATTATCATTCTCTACAAATTCTGTGGCGTTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTGCTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTGTAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : ACCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCTGTACTTGTAATAAGCAACATATCCACATAACACCTCATGTTCAAAATAGTTCTCCATGCCAGATAAGTTCACAATTATCGATACAAAAAATCAAATTTAATCAAAGTGTTATTTGTATAACCCTTAAACCGTTAAGAAATTTTAATCTATTATTTTTTTAATATTAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTGTATTTTTTAAGCTTGTTATTCATTGATTAAGTAATAAATCTGGAAATTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAACCGCCTCTATAGGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTTT # Questionable array : NO Score: 6.07 # Score Detail : 1:0, 2:3, 3:0, 4:0.81, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCACGTCAGCGGGGATAAACCG # Alternate repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [7,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GAGTTCCCCACGTCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-7.00,-10.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-10] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [75.0-68.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.28,0.37 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 2029103-2030107 **** Predicted by CRISPRDetect 2.4 *** >NZ_CP029164.1 Escherichia coli strain 104 chromosome, complete genome Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 2029103 29 100.0 32 ............................. CATTGAAAACATTGCCTTTATTTTATTTTTTG 2029164 29 100.0 32 ............................. GTGCCGCCGCTGGGCACTTCCTTCCCGTGAGT 2029225 29 100.0 32 ............................. CGGACGGTGGGAATATAGAAAATCCGTCCACC 2029286 29 100.0 32 ............................. TTATACTCTTTTCATCGACTAAGGAGGGGAGG 2029347 29 100.0 32 ............................. CAATAACGCAGCATCCAGGAAGCTGTTTCCGC 2029408 29 100.0 32 ............................. CGATCGGTGAAGAGGTCCGCGAAATACTCACT 2029469 29 100.0 32 ............................. GCGATAGTTGATTCAGCCGCGCCAGCGAATGT 2029530 29 100.0 32 ............................. GGGCCGCCGCGAATTTACACACGATTCAATAC 2029591 29 100.0 32 ............................. AACTGGTGCGCGACGGCTGGCTAACAAAACGA 2029652 29 100.0 32 ............................. CGTGGCTGCGCTGGCCGTTGCAGCAGTTTGAT 2029713 29 100.0 32 ............................. CGTAAACGCCCCGTCGCCATTAATTTCGGGGT 2029774 29 100.0 32 ............................. TGGGATGAGCAAATAACGTCGTTTCCTAGAAA 2029835 29 100.0 32 ............................. CCGCCGTGCCAGTGATCCTCATACGGCCTGTT 2029896 29 100.0 32 ............................. AAATTAAGAACGGCGTAAACGACGGCAGCATG 2029957 29 96.6 32 C............................ GTGATGATTCAGAGCAGACATTAGCCCGCGCT 2030018 29 100.0 32 ............................. GGTAAAAACACGGTCTGAACCGACATTCATGT 2030079 29 96.6 0 ............T................ | ========== ====== ====== ====== ============================= ================================ ================== 17 29 99.6 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTGCTTGCCGCAGGTGAAATTGAACCACCACAACCCGCGCCGGATATGTTACCGCCTGCCATCCCTGAACCTGAAACGCTGGGCGATAGCGGTCACCGGGGACGCGGCGGATGAGTATGGTCGTCGTTGTTACAGAAAATGTCCCTCCGCGCTTACGTGGACGGCTCGCGATCTGGCTACTTGAAGTGCGTGCCGGTGTTTATGTCGGAGATACGTCCAAACGTATTCGGGAGATGATCTGGCAGCAAATCTCTCAACTGGCAGGTTGCGGAAATGTAGTGATGGCCTGGGCGACCAATACCGAGTCAGGTTTTGAATTTCAGACCTGGGGTGAAAATAGACGTATTCCGGTGGATTTGGATGGGTTGCGTTTGGTTTCTTTTCTTCCTGTTGATAATCAATAAGTTATCCGTTCTTTAAAAATAAGGAAATGTTTTAATTTAGTTGGTAGATTGTTGATGCGGAATAAATTTGTTTAAAAACAGTTATGTATGCTTAGT # Right flank : GGACGCACTGGATGCGATGATGGATATCACTTAGAATTCCCCGCCCCTGCGGTAGAACTCCCAACTCCCATTTTCATACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGCGGTAGCATTATCCGCATAACATCACGGCAGCGACGTTCTATTCTTCCTGGAAGTGCCTTATCAATATGTTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCAGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAATTTGTTGCTTCTACCGAAAGTACGGCAATACCGGCTTTGTCGAAAACTTCGGCGTCATTACAACAGCCAGTACCCTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCAATTCCATGACTACGCGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAACA # Questionable array : NO Score: 6.24 # Score Detail : 1:0, 2:3, 3:0, 4:0.98, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [71.7-45.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.92,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //