Array 1 2-335 **** Predicted by CRISPRDetect 2.4 *** >NZ_SQFM01000121.1 Escherichia coli strain NOR4_71 NODE_833_length_306_cov_47.993465, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 2 29 100.0 32 ............................. GCAGCAGATTTACAAACAGATCGGAGATTACT 63 29 100.0 32 ............................. CGCGGCCTGCGCTCGCGTAAAATCAGTTGCAG 124 29 100.0 32 ............................. TCCGCGCAAGCCTATATAAACCAGATTGATCA 185 29 100.0 32 ............................. TACAATCCCGCATCGAAACTGAATACCCCGAT 246 29 100.0 32 ............................. TCTGGGCCGGATTCGATCCCGCGCGTACCGGA 307 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================ ================== 6 29 100.0 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GG # Right flank : T # Questionable array : NO Score: 6.26 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [6,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: NA [0.0-1.7]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.24,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 66331-66125 **** Predicted by CRISPRDetect 2.4 *** >NZ_SQFM01000075.1 Escherichia coli strain NOR4_71 NODE_290_length_118913_cov_64.736481, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================ ================================ ================== 66330 28 100.0 32 ............................ TCGACGGGGTGCGGTAAAACCTTTGCGAACGC 66270 28 100.0 32 ............................ GCTGTATTCGGCAAATGAAGTGGATGTTGATG 66210 28 96.4 32 ..................T......... TTCGTACTTAACGAACTGTTGCTCCTCTGCTG 66150 26 82.1 0 ........T..A........A.-.-... | ========== ====== ====== ====== ============================ ================================ ================== 4 28 94.6 32 GTTCACTGCCGTACAGGCAGCTTAGAAA # Left flank : AAATTCATCGTCGAGTTGCAGGTTCAGTTGGATCAGAAAGGTGTTTCTCTGGAAGTGAGCCAGGAAGCGCGTAACTGGCTGGCCGAGAAAGGTTACGACCGGGCAATGGGCGCACGTCCGATGGCGCGTGTCATCCAGGACAACCTGAAAAAACCGCTCGCCAACGAACTGTTGTTTGGTTCGCTGGTGGACGGCGGTCAGGTGACGGTTGCGCTGGATAAAGAGAAAAATGAGCTGACTTATGGATTCCAGAGTGCACAAAAGCACAAGGCGGAAGCAGCGCATTAATCTGATTGTCAGGTAGGTTGGTGAAGTCCGTAATCTCGAAAGAGGTTACGGACTTTTTGTTTGTAGGCTGGGGCGGTGAAAACCCTATTTTTGGAGGTGAAGGTAAGTTGTTGATAATTAATGGTGCTGGAAGGTAAGAATAAAAAAGGGTGCCAGCAGGAAAATGAGATGATTTTGCTTTATTAACAACGAGCTAAACGTGTAGTATTTGA # Right flank : TGCGAAAAAAGCTCGTACTTTCGTACGCGCTTTTCTTTAAATATGACGGTGAGGGGGGGATTGACTCGCTTCGCTCGCCCTGCGGGCAGCCCACTCACTGTGTTCGTGGTCTGTCCAACTGGCTGCGCCAGTTGTCGAACCCCGGTCGGGGCTTCTCATCCCCCCTGGAGTGCAATATGCGAAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTGAATATGGCGGTGAGGGGGGGATTCGAACCCCCGATACGTTGCCGTATACACACTTTCCAGGCGTGCTCCTTCAGCCACTCGGACACCTCACCAAATTGTTTTGCTGCCAAACCTCATGGGTGGCAACGGGGCGCTACTATAGGGAGTTGGAGTAAAACGGTCAAGAAGAATTTTAATGATAATTATTGTTTGCTCATACTGTAAACAAGTTGTGCAGTATATCTACATCGAGACAAGTTATGGATTTATACTTCCAAAGTACTTCATACATATCACAAAATA # Questionable array : NO Score: 5.59 # Score Detail : 1:0, 2:3, 3:0, 4:0.73, 5:0, 6:0.25, 7:0.01, 8:0.6, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCACTGCCGTACAGGCAGCTTAGAAA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [6,8] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTTCACTGCCGTACAGGCAGCTTAGAAA with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-7.70,-8.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [5-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [53.3-65.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.92 Confidence: HIGH] # Array family : I-F [Matched known repeat from this family], // Array 1 110808-111880 **** Predicted by CRISPRDetect 2.4 *** >NZ_SQFM01000136.1 Escherichia coli strain NOR4_71 NODE_1159_length_230229_cov_67.557579, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ======================================= ================== 110808 29 100.0 32 ............................. AACGACGCACAGGATATAATGCTTGGCCTGGG 110869 29 100.0 32 ............................. TGCGTGTCGGGCTGATTCGTCAGCTTCTTGGT 110930 29 100.0 32 ............................. ATGATCCAGCGCGGCGGCTCATATGAAACTGG 110991 29 100.0 32 ............................. TGACCGCCGATACGTTTGCTGGCGCGGCAGAA 111052 29 100.0 32 ............................. GTTACCGCGACTCATGCGACGATAAAAAATAC 111113 29 96.6 39 ............................N NNNNNNNNNGCCGACGGTCGCAGTGCTGGACCATTTCAA 111181 29 100.0 32 ............................. GTGCCGCGACTCACCAGATAGAAATAACGCAA 111242 29 100.0 32 ............................. GCTACCCCATTTGCACGCTGAGTTTGATTTCT 111303 29 100.0 32 ............................. GAGTAACCACGGTGGCAAAAATATCAGGGGTG 111364 29 100.0 32 ............................. CCACATCCAGCATTTGCTGAGGTGAAATCCAG 111425 29 100.0 32 ............................. CGATTCAAAAATAAAATGACGACGGAGGAGGC 111486 29 96.6 32 ....................T........ GCGGGATTGTTCCGTTTGCCCGCGCCACCAGC 111547 29 100.0 32 ............................. AAGGGGACGGCTACGGGACGCCGCCTATTGAC 111608 29 93.1 32 ............T......A......... GAAATAGCTTTTGCTGATCATCACGGTTTAAC 111669 29 100.0 32 ............................. AAGGGGACGGCTACGGGACGCCGCCTATTGAC 111730 29 100.0 32 ............................. TTTCGCGGTTGCGCAGAGCCGCCGCCGAGGCA 111791 29 100.0 32 ............................. GGTAAAAACACGGTCTGAACCGACATTCATGT 111852 29 96.6 0 ............T................ | ========== ====== ====== ====== ============================= ======================================= ================== 18 29 99.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTGCTTGCCGCAGGTGAAATTGAACCACCACAACCCGCGCCGGATATGTTACCGCCTGCCATCCCTGAACCTGAAACGCTGGGCGATAGCGGTCACCGGGGACGCGGCGGATGAGTATGGTCGTCGTTGTTACAGAAAATGTCCCTCCGCGCTTACGTGGACGGCTCGCGATCTGGCTCCTGGAAGTGCGTGCCGGTGTTTATGTCGGAGATACGTCCAAACGTATTCGGGAGATGATCTGGCAGCAAATCTCTCAACTGGCAGGTTGCGGAAATGTGGTAATGGCCTGGGCGACCAATACCGAGTCGGGTTTTGAATTTCAGACCTGGGGTGAAAATAGACGTATTCCGGTGGATTTGGATGGGGTGCGTTTGGTTTCTTTTCTTCCTGTTGATAATCAATAAGTTATCTGTTCTTTAAAAATAAGGAAATGTTTTAATTTAGTTGGTAGATTGTTGATGCGGAATAAATTTGTTTAAAAACAGTTATGTATGCTTAGT # Right flank : GGACGCACTGGATGCGATGATGGACATCACTTGGAGTTCCCCGCCCCTGCGGTAGAACTCCCAGCTCCTATTTTCAAACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGCGGTAGCATTATCCGCATAACATCACGGCAGCGACGTTCTATTCTTCCTGGAAGTGCCTTATCAATATGTTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCTTTATTCCCAAGATTCCAGTTTGTTGCTTCTACCGAAAGTACGGCAATACCGGCTTTGTCGAAAACTTCGGCGTCATTACAACAGCCAGTACCCTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCAATTCCATGACTACGCGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAACA # Questionable array : NO Score: 6.21 # Score Detail : 1:0, 2:3, 3:0, 4:0.95, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [1-4] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [71.7-40.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.92,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //