Array 1 1003756-1004150 **** Predicted by CRISPRDetect 2.4 *** >NZ_UGCB01000003.1 Escherichia coli strain NCTC9079, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 1003756 29 100.0 32 ............................. GCAAAAACCGGGCAATCGCAAAAAGGCGTAAT 1003817 29 96.6 32 ............................T GTGTTTGCGGCATTAACGCTCACCAGCATTTC 1003878 29 100.0 32 ............................. ACGTGGTCATGGGTGCTGCTGTTGCAGAGCCA 1003939 29 100.0 32 ............................. AGCAGATACACGGCTTTGTATTCCGTGCGCCC 1004000 29 100.0 32 ............................. AATAGCAATAGTCCATAGATTTGCGAAAACAG 1004061 29 100.0 32 ............................. GAGCCTGACGAGACTACTGAGGCCGTTCTGTC 1004122 29 93.1 0 .A..........................A | ========== ====== ====== ====== ============================= ================================ ================== 7 29 98.5 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTACTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGTGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTGCTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : ACCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGTTCTCCATGCCAGAGAGGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATATTGAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTATATTTTTTAAGCTTGTTATTCATTGGTTAAGTAATAAATCTGGAAGTTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAACCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTT # Questionable array : NO Score: 6.19 # Score Detail : 1:0, 2:3, 3:0, 4:0.93, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [1-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [75.0-68.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.65,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 1030123-1031370 **** Predicted by CRISPRDetect 2.4 *** >NZ_UGCB01000003.1 Escherichia coli strain NCTC9079, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 1030123 29 100.0 32 ............................. AACATTAAATTGCCGTGCTGGATGAATCGCGG 1030184 29 96.6 32 ............................C TGGATCAGCTGGTTCAATCGTTTACAGCACTG 1030245 29 100.0 32 ............................. TCGAAAAACTTCGTCCTGAAAAAATTCATCAT 1030306 29 100.0 32 ............................. GCAGTAAATTATTTGACCTCGTTGATAATCCC 1030367 29 100.0 32 ............................. GACGATCCCGAGATTCACAGAGTTACAACTAA 1030428 29 100.0 32 ............................. GAAATTTTCACATGGATTGTAGCCCTGTATAT 1030489 29 100.0 32 ............................. GCCAGGCGGTCGGATTAGGTGCAAATCAAAAT 1030550 29 100.0 32 ............................. CGCTGGCGGTGCGAGTGCTGGAGGCGCTGAAA 1030611 29 100.0 32 ............................. CAGGAGACGGCCAGCCGGAACGGCGGCGGCGT 1030672 29 100.0 32 ............................. TCGGGCGGCTCTGGTGTTCCTGACATGGCGGC 1030733 29 100.0 32 ............................. AATTTGCATCACATAGGGGCGAGGCTCAGTGA 1030794 29 100.0 32 ............................. TTCTTGCGGGTGTTGCAAATATTCTCCACGTA 1030855 29 100.0 31 ............................. CCGCGCAAATCCAGCGAGCCGCCGACGCTCA 1030915 29 100.0 32 ............................. TTGCAAACCGTGGCAAACGCAATTAACAAAAA 1030976 29 100.0 32 ............................. AGTTAAGAGGGGGTTTTTCCCCACCGTTCAGG 1031037 29 100.0 32 ............................. GTCATCACGATGAATCAAAATTTCGCCCGGCT 1031098 29 100.0 32 ............................. CTTCTCCACCGTTTGGCGAATCGGTGTGAGGG 1031159 29 100.0 32 ............................. ATTGTTATAATTATTTATTGAAATATCATTCC 1031220 29 100.0 32 ............................. AATCTATTGTGAATTTGAAATGGTCCAGCACT 1031281 29 100.0 32 ............................. GGTAAAAACACGGTCTGAACCGACATTCATGT 1031342 29 96.6 0 ............T................ | ========== ====== ====== ====== ============================= ================================ ================== 21 29 99.7 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTGCTTGCCGCTGGTGAAATTGAACCACCACAACCCGCGCCGGATATGTTACCGCCTGCCATCCCTGAACCTGAAACCTTAGGCGATAGCGGTCATCGAGGACGCGGCGGATGAGTATGATCGTAGTTGTAACGGAAAATGTTCCTCCACGTTTACGTGGACGGCTCGCTATCTGGCTACTGGAAGTGCGTGCCGGTGTGTATGTCGGAGATACATCCAAACGTATTCGGGAGATGATCTGGCAGCAAATCTCTCAACTGGCAGGTTGCGGAAATGTAGTGATGGCTTGGGCGACCAATACCGAGTCAGGTTTTGAATTTCAGACCTGGGGTGAAAATAGACGTATTCCGGTGGATTTGGATGGGTTGCGTTTGGTTTCTTTTCTTCCTTTTGATAATCAATAAGTTATCCGTTCTTTAAAAATAAGGAAATGTTTTAATTTAGTTGGTAGATTGTTGATGCGGAATAAATTTGTTTAAAAACAGTTATGTATGCTTAGT # Right flank : GGGCGCACTGGATGCGATGATGGATATCACTTGGAGTTCCCCCGCCCCTGCGGTAGAACTCCCAGCTCCCATTTTCAAACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAACGGTAGCATTATCCGCATAACGTCACGGCAGCGACGTTCTATTCTTCCAGGAAGTGCCTTATCAATATGCTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAGTTAGTCGCTTCCACCGAAAGTACAGCAATGCCCGCTTTGTCGAATATTTCTGCGTCATTACAACACCCAGTGCCTTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCTATTCCGTGACTGCGCGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAAC # Questionable array : NO Score: 6.25 # Score Detail : 1:0, 2:3, 3:0, 4:0.99, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [1-1] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [71.7-40.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.92,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //