Array 1 3753536-3752897 **** Predicted by CRISPRDetect 2.4 *** >NZ_CP023541.1 Escherichia coli O104:H21 str. CFSAN002236 strain ATCC BAA-178 chromosome, complete genome Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 3753535 29 100.0 32 ............................. CCTACGCCATGTGGATTAGTCCGGCGTTTCAT 3753474 29 100.0 32 ............................. TACCGCTGTTCATGCCGGAAATTGACCGGACC 3753413 29 100.0 32 ............................. CAGCTCGCAGCGCTCGGAACGTGGCGCTATAG 3753352 29 100.0 32 ............................. GTTTCTCATATTTCGCGTCAATCTCCACCAGG 3753291 29 100.0 32 ............................. ACTGATAACGCCGGAGCCTTCTTTTTCAGCAA 3753230 29 100.0 32 ............................. AGGTTGACGTTGATTTTGTTCGTTATGTTGCC 3753169 29 100.0 32 ............................. GCGTCTCGAGCGCGGGACGATTCAAAACCAGC 3753108 29 100.0 32 ............................. CCAAAGAAGAACAACGAGCCAACTGGTTTCAG 3753047 29 100.0 32 ............................. GCAATTTGTTGTCCGCGATCCGGTACGCGCGT 3752986 29 100.0 32 ............................. CGGCTATGGAATTTATGGAGAAGTTTGGTTTT 3752925 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================ ================== 11 29 100.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTCCTTGCCGCAGGTGAAATTGAACCGCCACAACCTGCGCCGGATATGTTACCGCCAGCCATCCCCGAACCTGAAACGTTGGGCGATAGCGGTCATCGAGGACGCGGCTGATGAGCATGGTCGTGGTTGTTACAGAAAATGTCCCGCCGCGCTTACGTGGACGGCTCGCAATCTGGCTACTGGAAGTGCGTGCCGGTGTGTATGTTGGTGATACATCAAAACGTATTCGGGAGATGATCTGGCAGCAAATTACCCAACTGGCAGGTTGTGGAAATGTGGTCATGGCCTGGGCGACAAATACTGAGTCTGGTTTTGAGTTTCAGACCTGGGGCAAAAACAGACGTATTCCGGTGGATTTGGATGGGTTACGTTTGGTTTCTTTTCTTCCTGTTGATAATCAATAAGTTAGACGTTCTTTAAAAATAAGGAAATGTTTGAATTTAGTTGGTAGATTGTTGATGTGGAATAAATTTGTTTAAAAACAGATATGTATGCTTAGT # Right flank : GGCGCACTGGATGCGATGATGGATATCACTTAGAATTCCCCGCCCCTGCGGTAGAACTCCCAGCTCCCATTTTCAAACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAACGGTAGCATTATCCGCATAACGTCACGGCAGCGACGTTCTATTCTTCCAGGAAGTGCCTTATCAATATGCTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAGTTAGTCGCTTCCACCGAAAGTACAGCAATGCCCGCTTTGTCGAATATTTCTGCGTCATTACAACACCCAGTGCCTTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCTATTCCGTGACTGCGCGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAACAA # Questionable array : NO Score: 6.26 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: R [43.3-73.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.51 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 3780121-3779238 **** Predicted by CRISPRDetect 2.4 *** >NZ_CP023541.1 Escherichia coli O104:H21 str. CFSAN002236 strain ATCC BAA-178 chromosome, complete genome Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 3780120 29 100.0 32 ............................. GGATTTAGATAAATAGATGAATGCACCGAAGC 3780059 29 100.0 32 ............................. CCGAGTTCCTGAAATGATTTGGCGGCACCAGA 3779998 29 100.0 32 ............................. ATTACTGCGTTTAACTGCCACATCGGGGAGCT 3779937 29 100.0 32 ............................. TTCTGATACGTTCCGGTTTTCGTTTGCACCGA 3779876 29 100.0 32 ............................. CTGACACCAGCCTGAGTAGCCAGGCTGTTAAA 3779815 29 100.0 32 ............................. TGTGTTGATAACCTTTTGAACGCATTCAAGGG 3779754 29 100.0 32 ............................. GTATTTTTGCTGACGGATTCTCAGAGAGTTTC 3779693 29 100.0 32 ............................. TTTCTATCTCCCAGTGGGAGAGAGATGACAGT 3779632 29 100.0 32 ............................. GTTTTGGCGATATCACCTGATGCCTGCAATCC 3779571 29 96.6 32 ..A.......................... CCAATCGAGCAGACTTTTCCCGTCTATGCGTT 3779510 29 100.0 32 ............................. GCGATCTCGCGGAATACACCGACGAGGCGGGC 3779449 29 100.0 32 ............................. TAAGGCCGTCGCCGGATCAGCCTGGCTATGCC 3779388 29 93.1 32 .A.C......................... TTCTTGCGGGTGTTGCAAATATTCTTCACGTA 3779327 29 93.1 32 .A.C......................... GACGCCGCCGCCGCGAAGCCGTTTCCGATGTT 3779266 29 93.1 0 .A..........................A | ========== ====== ====== ====== ============================= ================================ ================== 15 29 98.4 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTACTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGTGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTGCTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : CCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGTTCTCCATGCCAGAGAGGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATATTGAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTATGTTTTTTAAGCTTGTTATTCATTGGTTAAGTAATAAATCTGGAAGTTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAACCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTTT # Questionable array : NO Score: 6.18 # Score Detail : 1:0, 2:3, 3:0, 4:0.92, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [2-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [68.3-75.0]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.65 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //