Array 1 633-115 **** Predicted by CRISPRDetect 2.4 *** >NZ_LQVS01000126.1 Escherichia coli strain GN03253 GCID_ECOLID_00135_NODE_70.ctg_1, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 632 29 100.0 32 ............................. ATAGAACGGGACGAGATTTTTAAACAATGGCT 571 29 100.0 32 ............................. CAATCTGAGCCAGACGCGACGAATAAAAGCAT 510 29 100.0 33 ............................. TTGACGTTGATTTTGTTCGTTATGTTGCCAGCC 448 29 100.0 32 ............................. CTCTGATTCATCGGCGGCGATACTGTCATCAC 387 29 100.0 32 ............................. GGCTGGTGGGTTCGGGTAACTGGTTTGCTGTC 326 29 100.0 32 ............................. TACATGTTGATGACGTTTGCCAAATGCCATGG 265 29 100.0 32 ............................. ATTATTAATTCTGGTGGCGCTGGTCGCCCTGG 204 29 100.0 32 ............................. AGCGCGCGCGGGCTACTGCACTCGGTGATAAC 143 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================= ================== 9 29 100.0 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CTGGATGAACTACTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGTATTGCGCGTAATTGGCGTTTGTCGATGCAAACCCATAAATATTTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATATTTTGTCGCCTCTGAAAAACCTCAATTTTGCCCATCCTGGACTAATCATTATCATTCTCTACAAATTCTGTGGCGTTAATTTTTCGTTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGCGTTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTGTAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : GCCAGAAAACATGAAAAAACTTTGGGAGGGGATGAGTTCCCATAAGCGCTAACTTAAGGGTTGAACCATCTGAAGAATGCGACGCCTCGGTGCCTCGTTAAGACGATGCCTCGCG # Questionable array : NO Score: 6.26 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [3,6] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: R [56.7-75.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.51 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 78-1021 **** Predicted by CRISPRDetect 2.4 *** >NZ_LQVS01000121.1 Escherichia coli strain GN03253 GCID_ECOLID_00135_NODE_66.ctg_1, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 78 29 100.0 32 ............................. CTCTTCAGCAATGAAATCGTCAAACGAGATTA 139 29 100.0 32 ............................. CACAATTATTCGGTACGACGGGTTCAGGCATT 200 29 100.0 32 ............................. GACCAGAGATGTCGTAGCCGTATTTCGCAGCC 261 29 100.0 32 ............................. TCACCGGGTCAGATACTGATGTTATGGCTTAT 322 29 100.0 32 ............................. AGGAGTTTAATTTCCAGATTGAGCGCTGGATA 383 29 100.0 32 ............................. GGCACAAAAAAACCCGCGCACGGCGGGTTAAT 444 29 96.6 32 ......T...................... GTGAGTCCGTCAGCGGTGCGCCGCTGCAACAC 505 29 100.0 32 ............................. CTCGATCAGGAAAATGAATTCCTGGAAAAAAA 566 29 100.0 32 ............................. CGTGGTCGGGATTGTTGCGCCAGTCTCCGGGG 627 29 100.0 32 ............................. CACGGCTGGCCATTTGAAATACCTGTTGCTCT 688 29 96.6 32 .T........................... AACAGCGAGCCAACTGGTTTCAGATTGCTGAA 749 29 96.6 32 .T........................... TAAGGCCGTCGCCGGATCAGCCTGGCTATGCC 810 29 96.6 32 ...C......................... TTCTTGCGGGTGTTGCAAATATTCTTCACGTA 871 29 96.6 32 ...C......................... GAGCCTGACGAGACTACTGAGGCCGTTCTGTC 932 29 100.0 32 ............................. GACGCCGCCGCCGCGAAGCCGTTTCCGATGTT 993 29 96.6 0 ............................A | ========== ====== ====== ====== ============================= ================================ ================== 16 29 98.7 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : ATCGTGAGAGTAATTCATCGGCACGTTAAATCATATCAGGCGTAATACCACAACCCTTAAGTTAGCGCTTATGGGATG # Right flank : ACCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGTTCTCCATGCCAGAGAGGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATATTGAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTATATTTTTTAAGCTTGTTATTCATTGGTTAAGTAATAAATCTGGAAGTTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAACCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTT # Questionable array : NO Score: 6.20 # Score Detail : 1:0, 2:3, 3:0, 4:0.94, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [6,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-1] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [56.7-68.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.65,0.27 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 26080-26282 **** Predicted by CRISPRDetect 2.4 *** >NZ_LQVS01000068.1 Escherichia coli strain GN03253 GCID_ECOLID_00135_NODE_2.ctg_1, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================ ================================= ================== 26080 28 100.0 33 ............................ AACCTACCGTCTTGGCTAGCGGTTGCAGCGAAC 26141 28 100.0 32 ............................ GGAACAATCTTGCAAAGGCTGTGAAAGTTGGC 26201 28 100.0 28 ............................ TTCACAGGTAACATACTCCACCCACCAT 26257 26 85.7 0 ................A...-A..-... | ========== ====== ====== ====== ============================ ================================= ================== 4 28 96.4 31 GTTCACTGCCGTACAGGCAGCTTAGAAA # Left flank : GATAAATTCATCGTCGAGTTGCAGGTTCAGCTGGATCAGAAAGGTGTTTCTCTGGAAGTGAGCCAGGAAGCGCGTAACTGGCTGGCCGAGAAAGGTTACGACCGGGCAATGGGCGCACGTCCGATGGCGCGTGTCATCCAGGACAACCTGAAAAAACCGCTCGCCAACGAACTGCTGTTTGGTTCGCTGGTGGACGGCGGTCAGGTCACCGTCGCGCTGGATAAAGAGAAAAATGAGCTGACTTACGGATTCCAGAGTGCACAAAAGCACAAGGCGGAAGCAGCGCATTAATCTGATTGTCAGGTAGGTTGGTCAAGTCCGTAATCTCGAAAGAGATTGCGGACTTTTTATTTATGGGGTGGAGGTTCAGACCCTTTTTTTAATGATGATGGTAAGTTGTTGATAATTAGTGCTGCGGGAAGGTAAGGATAAAAAAGGGTGCTGCAGGAGAATGGGATGGTTTTGCTTTATTAACAACGGGCTAAACGTGTAGTATTTGA # Right flank : ATGCGAAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTCGAACCCCCGATACGTTGCCGTATACACACTTTCCAGGCGTGCTCCTTCAGCCACTCGGACACCTCACCAAATTGTTTTGCTGCCAAACCTCATGGGTGGCAACGGGGCGCTACTATAGGGAGTTGGAGTAAAACGGTCAAGAAGAATTTTAATGATAATTATTGTTTGCTCATACTGTAAACAACTTGTGCAGTATATCTACATCGAGACAGGTTATGGACTTATACTTCCAAAGTACTTCATACATATCACAAAATAAAAAGGCCGGTTAAACCGACCTTTTACTCGTTCTTTCTCTTCGCCCATCAGGCGGTAAAACAATCAGCGACTACGGAAGACAATGCGGCCTTTGCTCAGGTCGTACGGGGTCAGTTCAACAGTCACTTTGTCGCCCGTCAGGATGCGGATGTAGTTTTTGCGCATTTTACCGGAGA # Questionable array : NO Score: 5.68 # Score Detail : 1:0, 2:3, 3:0, 4:0.82, 5:0, 6:0.25, 7:0.01, 8:0.6, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCACTGCCGTACAGGCAGCTTAGAAA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [8,6] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTTCACTGCCGTACAGGCAGCTTAGAAA with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-8.00,-7.70] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-4] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [58.3-51.7]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.65,0 Confidence: HIGH] # Array family : I-F [Matched known repeat from this family], // Array 1 4088-5641 **** Predicted by CRISPRDetect 2.4 *** >NZ_LQVS01000077.1 Escherichia coli strain GN03253 GCID_ECOLID_00135_NODE_26.ctg_1, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 4088 29 100.0 32 ............................. TACTAAAAGTCTGGTTCACGAATATCAAAAGG 4149 29 100.0 32 ............................. CCGTCGGCAGCGGCGTTAAATGGGGCGCGCTT 4210 29 100.0 32 ............................. TGATAAATTGTCCGCCCTGGCGGAATACCTCA 4271 29 100.0 32 ............................. ACGGCTTCATGTTCTTGGTGATGGGGTTCACC 4332 29 100.0 32 ............................. AAAAAATGCGACGACCGCAGCCATTCCGATCT 4393 29 100.0 32 ............................. AGATCAACACGGTAGATATTTCCGTGATTGGG 4454 29 100.0 32 ............................. AGTCTTTAATCAAAATGGATTTTTATAATGAA 4515 29 100.0 32 ............................. GCCGGGTTAAGAAGGTGTATGGATGGCCCGGA 4576 29 100.0 32 ............................. GGTTACGCCTGCACAGAGTACAATGCGTGGGG 4637 29 100.0 32 ............................. CGGTGGCAGTGATGAGGCGTTCCCAATTAATG 4698 29 100.0 32 ............................. CGCACTCAAAATAGTAAATTAATTTATGAATT 4759 29 100.0 32 ............................. CATCCGGCGCTGAACATCGCCACCTGCCTAAC 4820 29 100.0 32 ............................. CGGTGATGCGCGGTATCGATCAGCATCCGGCT 4881 29 100.0 32 ............................. GCTCATTTCAAATGGTCAGGTCCGGTGGTTTT 4942 29 96.6 32 ............................C TGATCACATCATGTTTATTCGCGGTCGTATTG 5003 29 100.0 32 ............................. TACTGGAAAAAGCTGGCGACGGTGAGCGCAGC 5064 29 100.0 32 ............................. GGCACGGAATTGTTATGCTGTTCCCCTGACCG 5125 29 100.0 32 ............................. ATCCGCCGCCGGTTAACGCTGGACCAGTTCCG 5186 29 100.0 32 ............................. GGCGAGTCCGTCAGCGGTGCGCCGCTGCAACA 5247 29 100.0 32 ............................. GGACAATGTGAAAAGCTTAATATTCATTACAT 5308 29 100.0 32 ............................. CGACGTTTTCTAATATCACCCAGCAATCAATT 5369 29 100.0 32 ............................. ATTTCATCAAAGCATTAAGGGATGGAATAAAG 5430 29 100.0 32 ............................. TCATGAATATGGGGAAAACGAACAATCTGTTT 5491 29 100.0 32 ............................. TACAGTTTCCATAAATTCACCTCGTTTATATA 5552 29 100.0 32 ............................. TTTTTGTTCTCTTCAAAACGCCGAACAACCAA 5613 29 93.1 0 ............T.....A.......... | ========== ====== ====== ====== ============================= ================================ ================== 26 29 99.6 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CGTGCTTGCTGCTGGAGAAATACAACCGCCGGCCCCACCTGAAGATGCACAGCCTGTTGCCATTCCGCTTCCCGTTTCTCTGGGAGATGCCGGACATCGGAGTAGCTGAGATGAGTATGTTGGTCGTGGTCACTGAAAATGTACCTCCGCGCTTACGAGGCAGATTAGCCATCTGGTTGTTGGAGGTACGTGCAGGGGTATATGTAGGTGATGTATCCGCAAAAATTCGTGAAATGATCTGGGAACAAATAGCTGGACTGGCGGAAGAAGGCAATGTAGTGATGGCATGGGCAACGAATACGGAATCGGGATTTGAGTTCCAGACATTTGGGGTAAACAGGCGTACCCCGGTAGATTTGGATGGTTTAAGGTTGGTATCTTTTTTACCTGTTTGAAAACAAAGAATTAGCTGATCTTTAATAATAAGGAAATGTTACATTAAGGTTGGTGGGTTGTTTTTATGGGAAAAAATGCTTTAAGAACAAATGTATACTTTTAGA # Right flank : GGACGCACTGGATGCGATGATGGATATCACTTGGAGTTCCCCGCCCCTGCGGTAGAACTCCCAACTCCCATTTTCATACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGCGGTAGCATTATCCGCATAACATCACGGCAGCGACGTTCTATTCTTCCTGGAAGTGCCTTATCAATATGTTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCAGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAATTTGTTGCTTCTACCGAAAGTACGGCAATACCGGCTTTGTCGAAAACTTCGGCGTCATTACAACAGCCAGTACCCTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCAATTCCATGACTACGCGCAATTGCCAGTGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCACTGTTGAAATACAATTTATCGCCAACA # Questionable array : NO Score: 6.24 # Score Detail : 1:0, 2:3, 3:0, 4:0.98, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [6,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [70.0-41.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.92,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //