Array 1 2987-151 **** Predicted by CRISPRDetect 2.4 *** >NZ_QKOE01000009.1 Azoarcus communis SWub3 = DSM 12120 strain SWub3 NODE_9_length_165098_cov_137.769741, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 2986 29 100.0 32 ............................. CCAGTTGCACCACCTTGGCATCGGGTTCATTC 2925 29 100.0 32 ............................. TTCCTCCTTGCGCGACCCGAACCCCACGACGA 2864 29 100.0 32 ............................. TCGTGGTCGAATCCCTCGATGGTGCGAATAAT 2803 29 100.0 32 ............................. TCCCCGGTGATATTGAGCGCATCCGAGCTGTC 2742 29 100.0 32 ............................. CCACTCGCCGTGCTCGCTGATCCTCTCGTAGA 2681 29 96.6 32 ............................A TGGGCGACACGATTACTCCGGTGCGTGCTCTG 2620 29 100.0 32 ............................. CCTGGGCCGCCACTTTGCAGAGATTTCCTGCA 2559 29 100.0 32 ............................. AACGTAGGCTGATCCTCGCCGAAGTCGTGCCG 2498 29 100.0 32 ............................. CAACAATCAGACCCTCAAACGGGCGCCTCCAA 2437 29 100.0 32 ............................. CGAGCTACCACGAAACCCTGCGCGGCATCCAG 2376 29 100.0 32 ............................. AGGTACGGCTGCATATCCCTACCGCAAAAACC 2315 29 100.0 32 ............................. TTACAGCAACACCCGGCGCAGGCCACGTCGTA 2254 29 100.0 32 ............................. CTTGACATACATGGCCTTCTGAGCGACGCGCC 2193 29 100.0 32 ............................. CAGGTTGCCGCAATCCAAGCCGCCCAGGACGC 2132 29 100.0 32 ............................. GCGACCCGCGGCGGGGCAATGCACCTGACCGG 2071 29 100.0 32 ............................. AAATGGCCGACCCCGAGCAAGCCGCAGAAGCC 2010 29 100.0 33 ............................. TCACGCTGCCGCCGGCTGGCAGATTTGGCACGA 1948 29 96.6 32 ............................T GCTCACGCAGGGGCGCCGTGGGGGGTTTAGCC 1887 29 100.0 32 ............................. GCATCCCGCGCCACGGGCGCGAGCTCGCACAG 1826 29 100.0 32 ............................. CGCGGGCGGTAGGTCGCGGGGCGGACGAACAG 1765 29 96.6 32 ............................C GGATTCACCTGTACCGGTGATGCTGTCCAGTG 1704 29 100.0 32 ............................. AATTGGTCGAGCATTCAGGGGGTTGCCAAGCA 1643 29 100.0 32 ............................. GCCGGCCTGCTGGCAGGGTGAGGACGGGATGA 1582 29 100.0 32 ............................. CGGGTCAGACTGGCACCATCCGTATCGATATA 1521 29 100.0 32 ............................. CGCCACCAACTACGCCGACAATCGGCTAGTGA 1460 29 100.0 32 ............................. ATCGAGGGCCTGCTCGGCACGCCACAGCGCCT 1399 29 100.0 32 ............................. GTGATCGCGCGCGATCACGGCATGTTTGTCGT 1338 29 100.0 32 ............................. ACGGCGATCATCCCCGCGCTCGCAGAGCTCGA 1277 29 100.0 32 ............................. TCACGCTCCAAACCCCTGTCTTCTAGTGCCGT 1216 29 100.0 32 ............................. TTCCGATGACGAAATCGTAGGCCGACATCGGT 1155 29 100.0 32 ............................. CGCAGCGCCATCATCCGGAGATATCGCCTACG 1094 29 100.0 32 ............................. ACTTCTGCCGTCGCCGCTGTCGTCGATCCGGC 1033 29 100.0 32 ............................. TCGGTTTTGAGCGTTCGCAGGCGCTTGGTCAT 972 29 100.0 32 ............................. TGGGAAATCGCCATCGGGTATGTCTCCGACCG 911 29 100.0 32 ............................. CACGCTCCCCGAGTGGGGGCAGGGAAAGGGTT 850 29 100.0 32 ............................. ATTTTCTCAACTTGCATACTTGCACAGTCCGA 789 29 96.6 32 ............................A GATGGCGATCGGAGGCGCCGCGGCTTTCCATC 728 29 100.0 32 ............................. AACGCCTCTTTCAGCGTATCCCATATCCAGGC 667 29 100.0 32 ............................. CAGTGCTTCCGTGCTGGCGCGCTGGCCGTTGG 606 29 100.0 32 ............................. CAGATTGAACTCCCGCAGTACCGTGCCGGCCT 545 29 100.0 32 ............................. TGGGCAGGCAACTGGGCGGCCGCGCACGCGCT 484 29 100.0 32 ............................. CAGCCGTTCAGACGGTGCATCACGCCTATATG 423 29 100.0 32 ............................. CGGCTCTTTCAGTCGGCCCGAAGCGTTGTAGA 362 29 100.0 32 ............................. TCACGACGGCAGAAAGGGTCGATGCCTGGGCC 301 29 100.0 32 ............................. GTCGCGTGATGTACAGCCATCCCGTACTGACC 240 29 100.0 32 ............................. CAGGCCGCTCTAGAGGCCCTTTTCGAAAAATG 179 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================= ================== 47 29 99.7 32 GTGTTCCCCGCATGCGCGGGGATGAACCG # Left flank : CTGGAACGCATCATCCCCGATATCGAAACCATCCTTTCAGCTGGTGAAATTAGCCGTCCCGACTCATCAGATGGTGTCGTCGGACCCGCCATTCCAAACCCCGAGAACATCGGCGATGCTGGTCATCGTTCTTGAAAATGCCCCACCCCGCCTGCGGGGTCGCCTTGCAGTGTGGCTGCTGGAGGTGCGCGCAGGCGTGTATGTTGGTAACTACTCGAACAAGGTTCGGGAACATATCTGGAAGCAAGTGGAAGAAGGCATCGAAGAAGGTAATGCCGTCATGGTTTGGCGGACTAACAATGAAGCCGGTTTCGAGTTTCTTACCTTGGGAGCGAACCGACGCATGCCTGCAGACTGGGACGGCGTTCGCTTAGTTAGCTTTCACCCCGAACCCAAGCAGGACGCTCCTTAATAATTCGAAAATACCATCTGCATCATGAGTCACCGGTAGAATCTGACGCATCCTTTTTTCCATGTATTTTCATGGAGATGGAGGAAGT # Right flank : GGTCGCCATCTGAGTGCCGAATGTCAGGCGACACCGTAAATTGACCCCCTGGCGACACGAGGAATGAACCCCCTCGTTCGCAAGGAGGCCATCGTTGGAAACGAAGCACATCGAGGTAGATCCTGTACGCCGTGTAGGTTGCGGAGGGCAG # Questionable array : NO Score: 6.25 # Score Detail : 1:0, 2:3, 3:0, 4:0.99, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCATGCGCGGGGATGAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCATGCGCGGGGATGAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.10,-12.70] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [1-1] Score: 0/0.41 # AT richness analysis in flanks prediction: R [40.0-56.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.37,5.14 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 992-2120 **** Predicted by CRISPRDetect 2.4 *** >NZ_QKOE01000029.1 Azoarcus communis SWub3 = DSM 12120 strain SWub3 NODE_29_length_33022_cov_139.531418, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 992 29 100.0 32 ............................. CGGGGTGTGCAGGAAGCCGACAAGCAAGCGCA 1053 29 100.0 32 ............................. TAGGCAAGACCCAGCAAACTTTGCAGTGAGGG 1114 29 100.0 32 ............................. GACGTGGAGCGCGATGGCATGGGCGACCAGAT 1175 29 100.0 32 ............................. TGATGGGATTCCATTGGAAAACACTTGGATAC 1236 29 100.0 32 ............................. CGGCCGGGGTGATCGGGCCAGGGCGCGCGGCG 1297 29 100.0 32 ............................. CATGGGGCGCGCATTCTAGCCAAGCCATAGCG 1358 29 100.0 32 ............................. AGCGCATAACCGAGACCACCACGGCTCCCGAT 1419 29 100.0 33 ............................. ATCACCGGCGGGGGCGAGGGGTCGCACCACCTC 1481 29 96.6 32 .C........................... TCCAGCGGCGCGAGCCGCTCGACACGCCTGCA 1542 29 100.0 32 ............................. ACATCTGGCAGCAGGCCCACGACCGCGCGTTT 1603 29 100.0 32 ............................. AGCAAGCCGCGCCCGATACTTATCATCTAGTC 1664 29 100.0 32 ............................. AGCCGATCCCAACGAGAACAACCAAAGGAACC 1725 29 100.0 32 ............................. GGCGCCGGTGCGTAGGCGGGCGACGCCTCAAC 1786 29 100.0 32 ............................. CGCGCCATCGCGGACCTGCTCGAGATCCAGCA 1847 29 100.0 32 ............................. CGCCAGGCTCATTTGCCGCTTTAGAAATTCGG 1908 29 100.0 32 ............................. CTCGCGGCTGCGCGCGCTCTGGTGCTTTCGGC 1969 29 100.0 32 ............................. GTGGGCAAGAGCCGCGCCTACATCTATGGGCG 2030 29 96.6 32 ...........................T. ACCCAGGGCATGATTTTCCCGATTCCGTGCCA 2091 29 89.7 0 ........................GC..C | A [2114] ========== ====== ====== ====== ============================= ================================= ================== 19 29 99.1 32 GTGTTCCCCGCATGCGCGGGGATGAACCG # Left flank : GAAGCGGTCGATCGTGGCTACACGGTGCTCTTCACCTCGGGGGCGGCGCTCATGGCCGGACTCACGAAGGCGCACGCCGACGGAAGGCTTGAGGAGAAACTGCTGCAGATCTCAAAGCCGAAACTGCTGATCATCGATGAACTGGGCTACCTGCCGCTCGAGCCAGCTGCCGCACACCTGTTCTTCCAACTCGTGTCGCGGCGCTATGAGCGGGCCAGCATGCTGATCACCAGCAACCGGCCGGTCGGCGAGTGGGGGCAGGTGTTTGGCGATGCGGTGGCTGCCACCGCCATCCTCGACCGGCTGTTGCATCACAGCCAGGTGGTCACGATTCGGGGCGACAGCTACCGGCTGCGCGACAAGCGCCGCAGCGGCCTTCTGCAGAAGGCCGCTGCCCCCACCCCGATCACATCAGAGAGTTGATTAACCCAGGGGGCCAATTCCAGATGTCGTCAGGGGGTTAAATCTGCGTGTCGCTTGACACCGAACGCTCGAACCAC # Right flank : CGATCAGATGTCGCCAAACCTTTGACGGGCGATACCGTCATTGATACCAATCTGGATCATGGCCTATGCTCCAGTCGAGCGCGGTGCGAAACCGGTAGGGACGTGCCCCTGCGACCGACGGTAGCTACGCCGAGACAGCCGATCAAGTCAGAGTGTTGAAAAACCGCGCTCACTCACCCCATTTCCCATTCGTCAGTTGATTACCTCATCCACTACGACAACCACCTCCGCGCAGCTTGTTCAGCCTTTTCGCGTCGTGCTGCACTAGCCATCTTTCATTCAAAACGTATTCCCGGCATTCGTTGCGACAAACCACATGATGGGACGCTGGCCGTGAAATAGGCATGACCAATTGCGAAGTATGAATTGGTACGAGCTGCCTCCGTGGCACCACGGCGGGGTTGGCACGCCGCCCTACGACCGTACGCGGTCCATGAGGGGACGGGTGACGCGGGGGTCGAGGGGCGGCGCGCCCCTGCCCTCAGAGATTCCCCTTCCCC # Questionable array : NO Score: 6.21 # Score Detail : 1:0, 2:3, 3:0, 4:0.95, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCATGCGCGGGGATGAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,5] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCATGCGCGGGGATGAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-12.70,-12.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-5] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [45.0-51.7]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.28,0.37 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //