Array 1 28579-28784 **** Predicted by CRISPRDetect 2.4 *** >NZ_LVRP01000354.1 Escherichia coli strain GN04729 GCID_ECOLID_00199_NODE_64.ctg_1, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================ ================================ ================== 28579 28 100.0 32 ............................ TCGACGGGGTGCGGTAAAACCTTTGCGAACGC 28639 28 100.0 32 ............................ GCTGTATTCGGCAAATGAAGTGGATGTTGATG 28699 28 96.4 32 ..................T......... TTCGTACTTAACGAACTGTTGCTCCTCTGCTG 28759 26 82.1 0 ........T..A........-A..-... | ========== ====== ====== ====== ============================ ================================ ================== 4 28 94.6 32 GTTCACTGCCGTACAGGCAGCTTAGAAA # Left flank : AAATTCATCGTCGAGTTGCAGGTTCAGTTGGATCAGAAAGGTGTTTCTCTGGAAGTGAGCCAGGAAGCGCGTAACTGGCTGGCCGAGAAAGGTTACGACCGGGCAATGGGCGCACGTCCGATGGCGCGTGTCATCCAGGACAACCTGAAAAAACCGCTCGCCAACGAACTGTTGTTTGGTTCGCTGGTGGACGGCGGTCAGGTGACGGTTGCGCTGGATAAAGAGAAAAATGAGCTGACTTATGGATTCCAGAGTGCACAAAAGCACAAGGCGGAAGCAGCGCATTAATCTGATTGTCAGGTAGGTTGGTGAAGTCCGTAATCTCGAAAGAGGTTACGGACTTTTTGTTTGTAGGCTGGGGCGGTGAAAACCCTATTTTTGGAGGTGAAGGTAAGTTGTTGATAATTAATGGTGCTGGAAGGTAAGAATAAAAAAGGGTGCCAGCAGGAAAATGAGATGATTTTGCTTTATTAACAACGAGCTAAACGTGTAGTATTTGA # Right flank : TGCGAAAAAAGCTCGTACTTTCGTACGCGCTTTTCTTTAAATATGACGGTGAGGGGGGGATTGACTCGCTTCGCTCGCCCTGCGGGCAGCCCACTCACTGTGTTCGTGGTCTGTCCAACTGGCTGCGCCAGTTGTCGAACCCCGGTCGGGGCTTCTCATCCCCCTTGGAGTGCAATATGCGAAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTGAATATGGCGGTGAGGGGGGGATTCGAACCCCCGATACGTTGCCGTATACACACTTTCCAGGCGTGCTCCTTCAGCCACTCGGACACCTCAC # Questionable array : NO Score: 5.59 # Score Detail : 1:0, 2:3, 3:0, 4:0.73, 5:0, 6:0.25, 7:0.01, 8:0.6, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCACTGCCGTACAGGCAGCTTAGAAA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [8,6] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTTCACTGCCGTACAGGCAGCTTAGAAA with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-8.00,-7.70] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-6] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [65.0-53.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.92,0 Confidence: HIGH] # Array family : I-F [Matched known repeat from this family], // Array 1 20898-19832 **** Predicted by CRISPRDetect 2.4 *** >NZ_LVRP01000124.1 Escherichia coli strain GN04729 GCID_ECOLID_00199_NODE_21.ctg_1, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 20897 29 100.0 32 ............................. ATGATCCAGCGCGGCGGCTCATATGAAACTGG 20836 29 100.0 32 ............................. TGCGTGTCGGGCTGATTCGTCAGCTTCTTGGT 20775 29 100.0 32 ............................. AACGACGCACAGGATATAATGCTTGGCCTGGG 20714 29 100.0 32 ............................. TGACCGCCGATACGTTTGCTGGCGCGGCAGAA 20653 29 100.0 32 ............................. GTTACCGCGACTCATGCGACGATAAAAAATAC 20592 29 100.0 32 ............................. GTGCCGCGACTCACCAGATAGAAATAACGCAA 20531 29 100.0 32 ............................. GCTACCCCATTTGCACGCTGAGTTTGATTTCT 20470 29 100.0 32 ............................. GAGTAACCACGGTGGCAAAAATATCAGGGGTG 20409 29 100.0 32 ............................. CCACATCCAGCATTTGCTGAGGTGAAATCCAG 20348 29 100.0 32 ............................. TTAACTTCCTGCGTCTGCTTGGGGGAATGGCC 20287 29 100.0 32 ............................. TTAGCGTGTGATTTTCCGTGTTATAGGTTAGC 20226 29 100.0 32 ............................. GCGGGATTGTTCCGTTTGCCCGCGCCACCAGC 20165 29 100.0 32 ............................. GGGCCGACGGTCGCAGTGCTGGACCATTTCAA 20104 29 100.0 32 ............................. AAGGGGACGGCTACGGGACGCCGCCTATTGAC 20043 29 93.1 32 ............T......A......... GAAATAGCTTTTGCTGATCATCACGGTTTAAC 19982 29 100.0 32 ............................. TTTCGCGGTTGCGCAGAGCCGCCGCCGAGGCA 19921 29 100.0 32 ............................. GGTAAAAACACGGTCTGAACCGACATTCATGT 19860 29 96.6 0 ............T................ | ========== ====== ====== ====== ============================= ================================ ================== 18 29 99.4 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTGCTTGCCGCAGGTGAAATTGAACCACCACAACCCGCGCCGGATATGTTACCGCCTGCCATCCCTGAACCTGAAACGCTGGGCGATAGCGGTCACCGGGGACGCGGCGGATGAGTATGGTCGTCGTTGTTACAGAAAATGTCCCTCCGCGCTTACGTGGACGGCTCGCGATCTGGCTCCTGGAAGTGCGTGCCGGTGTTTATGTCGGAGATACGTCCAAACGTATTCGGGAGATGATCTGGCAGCAAATCTCTCAACTGGCAGGTTGCGGAAATGTGGTAATGGCCTGGGCGACCAATACCGAGTCGGGTTTTGAATTTCAGACCTGGGGTGAAAATAGACGTATTCCGGTGGATTTGGATGGGGTGCGTTTGGTTTCTTTTCTTCCTGTTGATAATCAATAAGTTATCTGTTCTTTAAAAATAAGGAAATGTTTTAATTTAGTTGGTAGATTGTTGATGCGGAATAAATTTGTTTAAAAACAGTTATGTATGCTTAGT # Right flank : GACGCACTGGATGCGATGATGGACATCACTTGGAGTTCCCCGCCCCTGCGGTAGAACTCCCAGCTCCTATTTTCAAACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGCGGTAGCATTATCCGCATAACATCACGGCAGCGACGTTCTATTCTTCCTGGAAGTGCCTTATCAATATGTTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCAGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAATTTGTTGCTTCTACCGAAAGTACGGCAATACCGGCTTTGTCGAAAACTTCGGCGTCATTACAACAGCCAGTACCCTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCAATTCCATGACTACGCGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACGCCGCTGTTGAAATACAATTTATCGCCAACAA # Questionable array : NO Score: 6.23 # Score Detail : 1:0, 2:3, 3:0, 4:0.97, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [3-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [40.0-71.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.92 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 47178-46600 **** Predicted by CRISPRDetect 2.4 *** >NZ_LVRP01000124.1 Escherichia coli strain GN04729 GCID_ECOLID_00199_NODE_21.ctg_1, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 47177 29 100.0 32 ............................. TCTGGGCCGGATTCGATCCCGCGCGTACCGGA 47116 29 100.0 32 ............................. TACAATCCCGCATCGAAACTGAATACCCCGAT 47055 29 100.0 32 ............................. TTGCCCAGGCTTTTGCGAAAATTTGTGATTTG 46994 29 100.0 32 ............................. ACACGGGGCAGATTGAGCAGGACTGCGACCTC 46933 29 100.0 32 ............................. GCAGCAGATTTACAAACAGATCGGAGATTACT 46872 29 100.0 32 ............................. CGCGGCCTGCGCTCGCGTAAAATCAGTTGCAG 46811 29 100.0 32 ............................. TCCGCGCAAGCCTATATAAACCAGATTGATCA 46750 29 100.0 32 ............................. CGAAATCGCACGCGCTTCCCGCATGGGGGAAA 46689 29 100.0 32 ............................. TGTAAAAGTCTTGGTTTTGGCGCATCATTTGT 46628 29 96.6 0 ............................A | ========== ====== ====== ====== ============================= ================================ ================== 10 29 99.7 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTTTTGGCGACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGATTGTGCATTGAAACCTGCATTGCTCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGATTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGCGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTCGGTATAGGATTACTTTAAATATTTAGCTTTTCAATCAATGGATTAAGTGCTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGACTTAAAAAATCAATAATTAATAATAGGTTATGTTTAGT # Right flank : CCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTCGTATTAAGCAATATATCCACGTAACACCTCATGTTCAAAATAGCTCTCCATATATGAGAAGTTCACAATTATCGATACAAAAAATCAAATTTAATTAAAGTGTTAGTTGTATGATACTTAAATCATTAAGAAATTATCATATATTATTTTTTTAATATTGAATTGATGTTTGTTAATTTTTTCTTTAGGATAGTAGTTTGTTTTTTAAGCTTATTATTCATTGATTAAGTAATAAATCTGGAAATTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAATCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTTT # Questionable array : NO Score: 6.25 # Score Detail : 1:0, 2:3, 3:0, 4:0.99, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [3,6] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [1-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [68.3-76.7]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.65 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //