Array 1 3-371 **** Predicted by CRISPRDetect 2.4 *** >NZ_SQMU01000091.1 Escherichia coli strain EETUKB159 NODE_979_length_342_cov_49.739765, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================ ========================================================================= ================== 3 28 100.0 32 ............................ TGACGCCATATGCAGATCATTGAGGCGAAACC 63 28 100.0 32 ............................ GGAAGAGACGGATGTTGACCAGCGAAATCCGA 123 28 100.0 32 ............................ ATCGATATGCGAACAGAAAAACCGTTCTCAGC 183 28 100.0 73 ............................ NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTCGCGTCATGGAATGGGCGAAATTGCGCAA 284 28 100.0 32 ............................ TGGAGGAAGACGTCTGATGCCGAAATATTTAA 344 28 100.0 0 ............................ | ========== ====== ====== ====== ============================ ========================================================================= ================== 6 28 100.0 40 GTTCACTGCCGTACAGGCAGCTTAGAAA # Left flank : GCG # Right flank : G # Questionable array : NO Score: 5.63 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:-0.62, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCACTGCCGTACAGGCAGCTTAGAAA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [8,6] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTTCACTGCCGTACAGGCAGCTTAGAAA with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-8.00,-7.70] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: NA [0.0-0.0]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.24,0 Confidence: HIGH] # Array family : I-F [Matched known repeat from this family], // Array 1 195085-194937 **** Predicted by CRISPRDetect 2.4 *** >NZ_SQMU01000053.1 Escherichia coli strain EETUKB159 NODE_284_length_689885_cov_52.922344, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================ ================================ ================== 195084 28 100.0 32 ............................ TTAAAATAAATGCAACGGACAAAGAAGCCATT 195024 28 100.0 32 ............................ GAATATTTTGGAAAAATAGCTATCAATCCGGG 194964 28 85.7 0 ....................TC.TC... | ========== ====== ====== ====== ============================ ================================ ================== 3 28 95.2 32 GTTCACTGCCGTACAGGCAGCTTAGAAA # Left flank : CTTTTTATGCGCGCCATGCCCACCGCATACAAAAAAAGCCCGTACTTTCGTACGAGCTCTTCTTTAAAGATGGCGGTGAGGCGGGGATTCGAACCCCGGATACGTTGCCGTATACACACTTTCCAGGCGTGCTCCTTCAGCCACTCGGACACCTCACCAAATTGTTTTGCTGCCAGACCTCATAGGTGGCAACGGGGCGCTACTATAGGGAGTTGGAGTGAAACGGTCAAGAAGAATTTATATAGATTGATTTGTTTGGTTACGCAATGAACACGCTGTTCGCGGGACGGAGATTATGACCGTATGTGTTCTGGTCAATTGTTTATCAAAAGCTATGCAGAAAATATGAGATTGAAGAAATACCAAACCGACCCTTTTTCTAGGTTGTAATGTAACTCATTGATTTTCTTATTGCTATTTTGAAGTCTGGAAAAAGGGTTTGAATCTGCGATTTTGTAAGTTTTAACAGTAAATCAATCGGATAGTCTGCTATTATTCCA # Right flank : CTCAACACTCCATCCTCTAATATTTATTCCCCATAACTCATAGACGCAAAAAAGGCCGGTTAAACCGACCTTTTACTCATTCTTTCTCTTCGCCCATCAGGCGGTAAAACAATCAGCGACTACGGAAGACAATGCGGCCTTTGCTCAGGTCGTACGGGGTCAGTTCAACAGTCACTTTGTCGCCCGTCAGGATGCGGATGTAGTTTTTGCGCATTTTACCGGAGATGTGTGCAGTAACCACGTGACCGTTTTCTAACTCTACGCGGAACATGGTATTAGGCAACGTTTCAAGAACGGTACCTTGCATTTCAATATTGTCTTCTTTGGCCATCTAATCCTCTGGGGTATCACTACCGTAATTTGAACCGGCAAGATAATGCCGAAGTTCTGTAAATAAGTAAAGATTTGCGCGCTAAATCGCAACAAACAGGTTCGGCACATTACTCCGAAAACACACGGCTAAGCCGCACCAAAAGCGCAACGTATAAGGGAGCGGTGAG # Questionable array : NO Score: 5.12 # Score Detail : 1:0, 2:3, 3:0, 4:0.76, 5:0, 6:0.25, 7:0.02, 8:0.4, 9:0.69, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCACTGCCGTACAGGCAGCTTAGAAA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: NA [4,4] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTTCACTGCCGTACAGGCAGCTTAGAAG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: NA [0.00,0.00] Score: 0/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: NA [65.0-68.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,4.5 Confidence: HIGH] # Array family : I-F [Matched known repeat from this family], // Array 2 204722-204143 **** Predicted by CRISPRDetect 2.4 *** >NZ_SQMU01000053.1 Escherichia coli strain EETUKB159 NODE_284_length_689885_cov_52.922344, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================ ========================================= ================== 204721 28 100.0 32 ............................ ATAGTTTGTGCGTCTTTTGAGTCAGTCCACGC 204661 28 100.0 32 ............................ AGGGATTGATGGCGGAAGTCAGAGCACAAAAG 204601 28 100.0 32 ............................ GTTCCACGCGGATATCTGTAATTTCCAGCAAA 204541 28 100.0 32 ............................ GTGCGGACGTGATGAATCTCCGCTGGGCTTTC 204481 28 100.0 32 ............................ ACGTTCGCACCGGTCAGGGTACTGCGCAGCGT 204421 28 100.0 32 ............................ TCAGCCGGAGGCTCTCAATTTCAGCCGCGCGG 204361 28 100.0 32 ............................ AGCACGGCTGCGGGGAATGGCTCAATCTCTGC 204301 28 100.0 41 ............................ TGATGGCGCAGCAGTCCTCCCTCCTGCCGNNNNNNNNNNCA 204232 28 100.0 32 ............................ CTGAACGTTGAAGAGTGCGACCGTCTCTCCTT 204172 28 85.7 0 ....................T...C.CT | C,A [204146,204149] ========== ====== ====== ====== ============================ ========================================= ================== 10 28 98.6 33 GTTCACTGCCGTACAGGCAGCTTAGAAA # Left flank : AAATTCATCGTCGAGTTGCAGGTTCAGTTGGATCAGAAAGGTGTTTCTCTGGAAGTGAGCCAGGAAGCGCGTAACTGGCTGGCCGAGAAAGGTTACGACCGGGCAATGGGCGCACGTCCGATGGCGCGTGTCATCCAGGACAACCTGAAAAAACCGCTCGCCAACGAACTGCTGTTTGGTTCGCTGGTGGACGGCGGTCAGGTCACCGTCGCGCTGGATAAAGAGAAAAATGAGCTGACTTACGGATTCCAGAGTGCACAAAAGCACAAGGCGGAAGCAGCGCATTAATCTGATTGTCAGGTAGGTTGGTTAAGTCCGTGATCTCGTCAGGGGTTACGGACTTTTTATTTATGGGGGGAGGAGGTTCAGACCCTTTTTTTAATGATGATGCTAAGTTATTGATAATTAGTGCTGCGGGTAGGTAAGGATAAAAAAGGGTGGCAGCAGGAGATTGAGATGGTTTTGCTTTATTAACAACGGGCTAAACGTGTAGTATTTGA # Right flank : ACGATAGTGTTAGACGTTTGGTCGTGCAATGACACTCTCAACTTCAAACCATTAGCGTTAGCACGCAATAACAATCGTAATAATTGCGATGGAAATCAATTTTCAGCACATAAATCAATGCTGTACTAAGCCCAATACCTTCAAATATAAAATAATCACAGGATGTGTTTATGTCTTCGAATTACCTTACGCCTTCCGATCTCAAAACCATTCTCCACTCCAAACGCGCCAATATTTATTATCTGGAAAAATGCCGGGTACAGGTGAATGGTGGGCGGGTGGAGTATGTTACCAGCGAAGGTAAAGAGTCGTACTACTGGAATATCCCCATTGCGAATACCACGGCGTTGATGCTGGGAATGGGAACTTCCGTTACCCAGGCGGCGATGCGTGAATTTGCTCATGCCGGGGTGATGGTAGGCTTTTGTGGTACGGATGGCACGCCGCTGTATTCAGCAAATGAAGTGGATGTTGATGTCTCCTGGCTCAGCCCACAAAGT # Questionable array : NO Score: 6.19 # Score Detail : 1:0, 2:3, 3:0, 4:0.93, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCACTGCCGTACAGGCAGCTTAGAAA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [6,8] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTTCACTGCCGTACAGGCAGCTTAGAAA with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-7.70,-8.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [6-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [58.3-60.0]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.65 Confidence: HIGH] # Array family : I-F [Matched known repeat from this family], //