Array 1 201453-202213 **** Predicted by CRISPRDetect 2.4 *** >NZ_PJGD01000002.1 Escherichia coli strain 3731 NODE_2_length_316543_cov_107.448, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 201453 29 100.0 32 ............................. GCATTGACGCTTTAAACGACGACGGACGCCAC 201514 29 100.0 32 ............................. AAAACAGCCTTTAGATTAGTACCTGACGACCG 201575 29 100.0 32 ............................. TAAACGCACCTGGCGCGCCACTTTATCAACAA 201636 29 100.0 32 ............................. CGGCTTGTTTAATTGCGTGGAACGTCTCAATT 201697 29 100.0 32 ............................. ACGGCGTGGATTGAGGGACGGGTATTTGGTCC 201758 29 96.6 32 ............................T AGATCGCGCCACGAGGAAACGAATATGAACGG 201819 29 100.0 32 ............................. TGCCGCCAGGCCAGCGACACATCAGACAACTG 201880 29 100.0 32 ............................. AGCGTCAATCAGCGCGTCTATCGCGTCACTTT 201941 29 100.0 32 ............................. ATTTGGGGGTATGAGAGCGCCGAGCCGTTCGG 202002 29 100.0 32 ............................. GCTCCCTGTCAGTTATAATCGATAACGTTGAT 202063 29 100.0 32 ............................. ATGTAGGGGCAATCGAACGATTCTCTGCCGAC 202124 29 100.0 32 ............................. CGCGAGAGCCAGCAAAACGCCAGGGCACAAAA 202185 29 93.1 0 .A..........................A | ========== ====== ====== ====== ============================= ================================ ================== 13 29 99.2 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTATTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGTGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTACTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : ACCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACCCCTCATGTTCAAAATAGTTCTCCATGCCAGAGAAGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATGTTGAATTAATATCTATTAATTTTTTCTTTAGGTTAATAGTTTGTTTTTTAAGCTTGTTATTCATTGATTAAGTAATAAATCTGAAAATTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATTACCGTTTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTT # Questionable array : NO Score: 6.22 # Score Detail : 1:0, 2:3, 3:0, 4:0.96, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [75.0-68.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.65,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 217730-218430 **** Predicted by CRISPRDetect 2.4 *** >NZ_PJGD01000002.1 Escherichia coli strain 3731 NODE_2_length_316543_cov_107.448, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 217730 29 100.0 32 ............................. CGACAAAATTCTCAAAACTCGATCAGGAAAAT 217791 29 100.0 32 ............................. CCACCGTTTTCGCCCACCAGGGCGCACAACCC 217852 29 100.0 32 ............................. GAAAAAGAGAAGGTAGAGAAAGCGGAATCTGG 217913 29 100.0 32 ............................. CAGGTCTATCGGGCGATCAATAAAATCGGTCA 217974 29 100.0 32 ............................. GCGCACCGTTGCGTCGAAAAGGCGCTGGAGAT 218035 29 100.0 32 ............................. TACGCTTACACAACGGGCGAATATTTTAACGG 218096 29 100.0 32 ............................. GAACCCAATAGTGAAATACAGCATCATTTTTT 218157 29 100.0 32 ............................. ACCTGGAGGCGAAAAAGGCGCTTCGACGTAAA 218218 29 100.0 33 ............................. GAGGCCTATATCTCTAACCGCATCGGGCTGCGC 218280 29 100.0 32 ............................. GGGCAAATATAAATTCCAGCGTGCTTCATGAA 218341 29 100.0 32 ............................. CTGCGTAGCGACCTTTGCTCTCAATTTCGTTG 218402 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================= ================== 12 29 100.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTCCTTGCTGCAGGTGAAATTGAACCACCACAACCCGCGCCGGATATGTTACCGCCTGCCATCCCTGAACCTGAAACGCTGGGTGATAGTGGTCACCGGGGGCGCGGCGGATGAGCATGGTCGTGGTTGTTACAGAAAATGTCCCTCCGCGCTTACGTGGACGGCTCGCAATCTGGCTACTGGAAGTGCGTGCCGGTGTGTATGTTGGTGATACATCAAAACGTATTCGGGAGATGATCTGGCAACAAATTACCCAACTGGCTGGTTGCGGAAATGTGGTGATGGCCTGGGCGACCAATACCGAATCGGGTTTTGAATTTCAGACCTGGGGAGAAAACAGACGTATTCCGGTGGATTTGGATGGGTTACGTTTGGTTTCTTTTCTTCCTGTTGATAATCAATAGGTTATGTGTTCTTTAAAAATAAGGAAATGTTTGAATTTAGTTGGTAGATTGTTGATGTGGAATAAATTTGTTTAAAAACAGATATGTATGCTTAGT # Right flank : GGGCGCACTGGATGCGATGATGGATATCACTTGGAGTTCCCCCGCCCCTGCGGTAGAACTCCCAGCTCCCATTTTCCAACCCATCAAGACGCCTTCGCCAACTCTTTCACCAGAGGTAGCATTATCCGCATAACGTCACGGCAGCGACGTTCTATTCTTCCAGGAAGTGCCTTATCAATATGCTGTTGATTATCAAACCTGACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAGTTAGTCGCTTCCACCGAAAGTACAGCAATGCCCGCTTTGTCGAATATTTCTGCGTCATTACAACACCCGGTGCCTTTCGGATAGTTTTTATTCAAACCAGGATTGGTCGTCGCGGCTATTCCCTGACTGCGCGCAATTGCCAGTGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAAC # Questionable array : NO Score: 6.26 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: F [73.3-40.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.51,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //