Array 1 21130-20735 **** Predicted by CRISPRDetect 2.4 *** >NZ_JAJCFO010000011.1 Escherichia coli strain MSK.23.108 EKGELHAJ_11, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 21129 29 100.0 32 ............................. GCAAAAACCGGGCAATCGCAAAAAGGCGTAAT 21068 29 96.6 32 ............................T GTGTTTGCGGCATTAACGCTCACCAGCATTTC 21007 29 100.0 32 ............................. ACGTGGTCATGGGTGCTGCTGTTGCAGAGCCA 20946 29 100.0 32 ............................. AGCAGATACACGGCTTTGTATTCCGTGCGCCC 20885 29 100.0 32 ............................. AATAGCAATAGTCCATAGATTTGCGAAAACAG 20824 29 100.0 32 ............................. GAGCCTGACGAGACTACTGAGGCCGTTCTGTC 20763 29 93.1 0 .A..........................A | ========== ====== ====== ====== ============================= ================================ ================== 7 29 98.5 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTACTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGTGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTGCTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : CCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGTTCTCCATGCCAGAGAGGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATATTGAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTATATTTTTTAAGCTTGTTATTCATTGGTTAAGTAATAAATCTGGAAGTTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAACCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTTT # Questionable array : NO Score: 6.19 # Score Detail : 1:0, 2:3, 3:0, 4:0.93, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [2-1] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [68.3-75.0]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.65 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 4933-5632 **** Predicted by CRISPRDetect 2.4 *** >NZ_JAJCFO010000026.1 Escherichia coli strain MSK.23.108 EKGELHAJ_26, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 4933 29 100.0 32 ............................. ACAATCCCACGCCGATAATCTCTATACAGCAA 4994 29 100.0 32 ............................. GGCACGGAATTGTTATGCTGTTCCCCTGACCG 5055 29 100.0 32 ............................. ATCCGCCGCCGGTTAACGCTGGACCAGTTCCG 5116 29 100.0 32 ............................. GGCGAGTCCGTCAGCGGTGCGCCGCTGCAACA 5177 29 100.0 32 ............................. GGACAATGTGAAAAGCTTAATATTCATTACAT 5238 29 100.0 32 ............................. CGACGTTTTCTAATATCACCCAGCAATCAATT 5299 29 100.0 32 ............................. ATTTCATCAAAGCATTAAGGGATGGAATAAAG 5360 29 100.0 32 ............................. TCATGAATATGGGGAAAACGAACAATCTGTTT 5421 29 100.0 32 ............................. ATGACCATTGGTGAACGCATCCGCTTTCGCCG 5482 29 100.0 32 ............................. CCAGGACAGGCCGTGACGGTTGCCATTGAGTC 5543 29 100.0 32 ............................. TTTTTGTTCTCTTCAAAACGCCGAACAACCAA 5604 29 93.1 0 ............T.....A.......... | ========== ====== ====== ====== ============================= ================================ ================== 12 29 99.4 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CGTGCTTGCTGCTGGAGAAATACAACCGCCGGCCCCACCTGAAGATGCACAGCCTGTTGCCATTCCGCTTCCCGTTTCTCTGGGAGATGCCGGACATCGGAGTAGCTGAGATGAGTATGTTGGTCGTGGTCACTGAAAATGTACCTCCGCGCTTACGAGGCAGATTAGCCATCTGGTTGTTGGAGGTACGTGCAGGGGTATATGTAGGTGATGTATCCGCAAAAATTCGTGAAATGATCTGGGAACAAATAGCTGGACTGGCGGAAGAAGGCAATGTAGTGATGGCATGGGCAACGAATACGGAATCGGGATTTGAGTTCCAGACATTTGGGGTAAACAGGCGTACCCCGGTAGATTTGGATGGTTTAAGGTTGGTATCTTTTTTACCTGTTTGAAAACAAAGAATTAGCTGATCTTTAATAATAAGGAAATGTTACATTAAGGTTGGTGGGTTGTTTTTATGGGAAAAAATGCTTTAAGAACAAATGTATACTTTTAGA # Right flank : GGACGCACTGGATGCGATGATGGATATCACTTGGAGTTCCCCGCCCCTGCGGTAGAACTCCCAACTCCCATTTTCATACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGCGGTAGCATTATCCGCATAACATCACGGCAGCGACGTTCTATTCTTCCTGGAAGTGCCTTATCAATATGTTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCAGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAATTTGTTGCTTCTACCGAAAGTACGGCAATACCGGCTTTGTCGAAAACTTCGGCGTCATTACAACAGCCAGTACCCTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCAATTCCATGACTACGCGCAATTGCCAGTGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCACTGTTGAAATACAATTTATCGCCAACA # Questionable array : NO Score: 6.23 # Score Detail : 1:0, 2:3, 3:0, 4:0.97, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [6,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [70.0-41.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.92,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //