Array 1 1-4186 **** Predicted by CRISPRDetect 2.4 *** >NZ_VVJW01000003.1 Akkermansia muciniphila strain BIOML-A28 scaffold3_size321570, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ================================= =================================== ================== 1 33 100.0 34 ................................. AGAAGGGCCGGGGTTAAACCCGGCTGACTAGCTT 68 33 100.0 34 ................................. CCATGTGTGCACCATGTCCAAATCCTCAAGGGCG 135 33 100.0 34 ................................. TGAAACGTCATAGTTAACGCCCTGCCACATGTGC 202 33 100.0 34 ................................. CCATGTGTGCACCATGTCCAAATCCTCAAGGGCG 269 33 100.0 34 ................................. TGAAACGTCATAGTTAACGCCCTGCCACATGTGC 336 33 100.0 34 ................................. TCCATCTTGCAGTTTGATTTGAGTGATCATGATT 403 33 100.0 34 ................................. AGTGGGTGTGAGGTAGTAGCGGTGGACGAAAAAC 470 33 100.0 34 ................................. CCCGGTAGGTAAAATAGACCAGACCGCCGTTACA 537 33 100.0 33 ................................. TGAGACGTCATAGTTAACGCCCTGCCACATGTG 603 33 100.0 34 ................................. TTTAGTTAGTGGTTTTGTTGTCCGGGGGTTGCCC 670 33 100.0 34 ................................. CGTCACCTATGGGGCGGGGTTGCCGTGTGATAAG 737 33 100.0 34 ................................. CACACATGCACCATGTCCAAATCCTCAAGGGCGG 804 33 100.0 34 ................................. TGGAACAAGGAATCCCTACTAAAATGGTAGGGAT 871 33 100.0 34 ................................. CATGTCATGCCGTTTTCGTTGGTTTTAAGTGGAT 938 33 100.0 34 ................................. TTGTTGGTGGTTTTAGTTTAACTGATTTTATTAT 1005 33 100.0 34 ................................. TCCGGTGGGAAGGATTGCCCAGACCGCCGTGACA 1072 33 100.0 34 ................................. TTCATAATGTTTCCTTTCTTGTTGGTGGTTTTAG 1139 33 100.0 33 ................................. CACATATGCACCATGTCCAAATCCTCAAGGGCG 1205 33 100.0 34 ................................. TCGCATGTTTGTGTAAGTCTCCTTGTGGCTCTTA 1272 33 100.0 34 ................................. CATATCAGGCCGTTTTCGTTGGTTTTCAGCGGAT 1339 33 100.0 34 ................................. ACCGCAAAAGTCACGATAGGAACCGCAATCGGAC 1406 33 100.0 35 ................................. CCGCCTCGTAGGTCTCGCATTCCGCTGGTGTAAGA 1474 33 100.0 34 ................................. CCCGGTGGGAAGGATTGCCCAGACCGCCGTGACA 1541 33 100.0 34 ................................. AACGGCCCCCGTAGGATAAATGACGGTGCTATAT 1608 33 100.0 35 ................................. TATGCGGACTGAGGTAGGCAGGCCGGTAGCATATA 1676 33 100.0 32 ................................. GCGTCCAGAATATAGGCGTAGGCTTCAACATT 1741 33 100.0 35 ................................. CCCGGTAGGCAAGATTGACCACACCGCCGTGACAC 1809 33 100.0 33 ................................. GATTCATGGCAACGGGGGTAAGTGGCTTTTCGC 1875 33 100.0 33 ................................. TGCCAGTAATAGCAAGTGTAAGGGCCGTTTCTT 1941 33 100.0 34 ................................. GCACCGGAATCGGTATATTTCGGACGGCCTGACC 2008 33 100.0 34 ................................. AATATCCCTGATGCTGAATAACAAAACCACTAAA 2075 33 100.0 34 ................................. ATAATTGTTTCCTTTCTTTAGTGGTTTTAGTTTA 2142 33 100.0 34 ................................. CGGACGGCCTGACCCCTCCACCAGCAACGGAATC 2209 33 100.0 34 ................................. TCTTAGCGAAAAGCCGGATTCCTTTCCAGGTCAT 2276 33 100.0 33 ................................. TTGGTAGTCACAACATGAAAGCCGTTATGCCTT 2342 33 100.0 34 ................................. CCTTCCATTCACCCCGGATGACTATTTCCGAGGA 2409 33 100.0 34 ................................. CCTGGGATGTTGCCCCCGCCCTTGCGGATTTGGA 2476 33 100.0 34 ................................. ACCTTCTTGCAATTTGATTTGATTGTTCATTGTT 2543 33 100.0 34 ................................. CCCGGTCTCTTATATAGTGAGGGGAGTTCTTTTG 2610 33 100.0 34 ................................. AAGAAATTGAGGATTCAAACTACCGCCTTTGCGA 2677 33 100.0 34 ................................. TCCTGGGACGTTGCCCCCGCCCTTGCGGATTTGG 2744 33 100.0 34 ................................. CCCGGAGACCAAGGCATCCCATTTCCCTACATGT 2811 33 100.0 34 ................................. CGGTGTGGGAGATATACCGTTTCCGTTGCATAAC 2878 33 100.0 34 ................................. TGCCAGCTTGCGAACCTTTATGGAATTGAGATTG 2945 33 100.0 35 ................................. TCCTATCGAACGCACGTTGCCACCATTGACCGAGC 3013 33 100.0 34 ................................. CCGTGACAGGGCGGGGGCGTCATCATGCGGGTTC 3080 33 100.0 35 ................................. ATCTTCCACACGTGCCAAGTAACTGAGGGCATAGG 3148 33 100.0 33 ................................. CGGCACCGTTTTCGTAGACCGCCCATGTGAGAG 3214 33 100.0 34 ................................. ATCATCCGCAATGTTGAAGGCCTTGCGGGCCGCA 3281 33 100.0 35 ................................. CGTGGGATAGGGTGAATCGTGAATCCAAAGGAATC 3349 33 100.0 34 ................................. TTGCGGAAACAATAGACCGCCCTGTTGACCGCCC 3416 33 100.0 34 ................................. AATATCGTCCATTTGCCGGGTAGTGGTTTGTGAA 3483 33 100.0 34 ................................. ATCGTACATGGGCAAGCCATAATCGGAACAAAAA 3550 33 100.0 35 ................................. CCCGGTGCGGGTGATTTGCGTGTAGGCAACCGGCC 3618 33 100.0 34 ................................. AGCTTCAACGTTGAACGCGCGGCGGGCCGCATAT 3685 33 100.0 33 ................................. AATTTGATTTGATTATTCATTGTTTTATTTCTT 3751 33 100.0 34 ................................. TAATCCAGAAGTTAATGATATCACGGGGACGGGG 3818 33 100.0 34 ................................. CATGACATAGAACGCACGTACCTTCCGTCTAATC 3885 33 100.0 34 ................................. CCCGGTGGGTAATATGGACCATACCGCTGTGACA 3952 33 100.0 33 ................................. TAGAGAGCGAGGCAGGCCGCCCGGTTGCCTATT 4018 33 100.0 34 ................................. CCGGCGTAGGACAATCTCCCATGTGCTATGCGGC 4085 33 100.0 34 ................................. GGAAGCCTCAGCACATCACACATCGCCGCCTGTA GC [4104] 4154 33 97.0 0 .........T....................... | ========== ====== ====== ====== ================================= =================================== ================== 63 33 100.0 34 GTCGCACTCCGCAAGGAGTGCGTGGATTGAAAC # Left flank : | # Right flank : CCTGAATGGTGATTGTTATTTGCGGGTTTCCGCCCGTCGCACTCCTTGCAGAGTGCATGAAGAGGACGACGAGGCGGTGGGCTGGGGCGGCCTGTTGTCCTCTTGCAAGGGGATGGCAACAATGTTGCAGACATAGCACAGGAGAGGTGGTCAAGGGCGTTTTTCCACAATGCTTGATAATTTTTCTACAAAGTGTTGCTACGTTCCAACTCCTTAAAGAATCCAATTTCCAAAAAAATCCAGGATTTTTGAATCTGGAAGGCTTGTCAGTTAATTCCTCTATTTATTTTTCAGAAAGGGGGTGCTGATGCAGGTTGAGCTCCGCAGGAGGAATGGCCTGGAGCAGAGCCAGGATGGGGTCCTGCGGTGCGGCGCCTATTTCCCGCTGGCGTTTGAATTCCTCCTCTTCCTGCTGAAGGGCCGCCAGTTTGGTGTCTTCCACCAGCGCCTTGAATTTGTCCGCCTGAGAGTATTCTCCGTCCTCGTCCCGGAAAACGCGG # Questionable array : NO Score: 9.26 # Score Detail : 1:0, 2:3, 3:3, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGCACTCCGCAAGGAGTGCGTGGATTGAAAC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: F Score: 4.5/4.5 # A,T distribution in repeat prediction: F [8,6] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCGCTCTCCGCAAGGAGGGCGTGGATTGAAAC with 94% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-7.00,-5.90] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-3] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [0.0-0.0]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [10.15,0 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], // Array 1 281804-284235 **** Predicted by CRISPRDetect 2.4 *** >NZ_VVJW01000002.1 Akkermansia muciniphila strain BIOML-A28 scaffold2_size383866, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== =============================== =================================== ================== 281804 31 100.0 34 ............................... TTTTAGTTTAACTTATTTTATTACCTTGTCAACT 281869 31 100.0 34 ............................... AACGTTTTTGGTAGTCACAATGTGGAATCCGTTG 281934 31 100.0 33 ............................... TATCTTGTTGTCAAATGTTTTTTGTTTTTGTAT 281998 31 100.0 34 ............................... ATCGCAAAAGTCACGATAGGAACCGCAATCGGAT 282063 31 100.0 33 ............................... CTATTGTTTAATGTTTAGTGTTTTATTGTTTGG 282127 31 100.0 33 ............................... CATGTCATGCCGTTTTCGTTGGTTTTCAGTGGA 282191 31 100.0 35 ............................... ATGATTTAGTTTCCTTTCTTTTAGGGGGTTATGCG 282257 31 100.0 34 ............................... AACATTGCGGATGAATAACTTATGATTGTGTCAT 282322 31 100.0 34 ............................... AGCGTCCAGAATATAGGCGTAGGCTTCAACATTT 282387 31 100.0 33 ............................... ATCCTACGGGAGACGTCGACACAATCATGGAGA 282451 31 100.0 34 ............................... AGGAATATGCTGACCTTAAATCCTTCTTCACCCG 282516 31 100.0 34 ............................... ATACACCGCCCATGTGAGAGTTTCTCCAGCGTTT 282581 31 100.0 34 ............................... CATCTTGTTGTTCATAATGTTTTCCTTTCTTTTA 282646 31 100.0 34 ............................... CCGCCTCGTAGGTCTCGCATTCCGCCGGGGTTAA 282711 31 100.0 34 ............................... AACGTAAGCGAAAAGGCCCGCCCCCTCGTAAAAG 282776 31 100.0 34 ............................... TGCGTCATCATCTACGTTAAAGGCCTTGCGGGCC 282841 31 100.0 33 ............................... AAGGCCAAAAGAAGCCAAAAGAATCCAAAGGAA 282905 31 100.0 34 ............................... GCACCGGAATCGGTATATTTCGGACGGCCTGACC 282970 31 100.0 34 ............................... CCGCCTCGTAGGTCTCGCATTCAGCGGGTGTGAG 283035 31 100.0 34 ............................... AAAGCCAAAAGAAGCCAAAAGAAGCCAAAAGAAG 283100 31 100.0 34 ............................... TCCCTACCCTTTTAGTAGGGATTCCTTGTTCCTA 283165 31 100.0 34 ............................... CTACTAAACCTATCGGAATTGTGATTGAAGGCAT 283230 31 100.0 34 ............................... CGTTTCCCTGGTGGGGTAGATAACGGTGGAATAT 283295 31 100.0 34 ............................... AATTTGTATGACACAATCGTAAGTTATTCAGCAT 283360 31 100.0 34 ............................... TGTTTCCCCGGTGGGGTAAATAACGGAGGCATAT 283425 31 100.0 34 ............................... CCGCCGTAGGACAATCTCCCATGTGCTATGCGCC 283490 31 100.0 34 ............................... TATAGCCAATTGTGAGAGTTTCCGGTTTGTAACA 283555 31 100.0 34 ............................... TGCAGCCAGGACTAGGGATAAGATTCCCGCGCCG 283620 31 100.0 34 ............................... AGCATCTAAAATATGTGCGTAAGCTTCGACACGG 283685 31 100.0 34 ............................... ACATTATATCATAAAAATAAGAGAGTTATGAACA 283750 31 100.0 34 ............................... GGTCCCTACAAATTACCAAGTCCCCCTTCTTAAA 283815 31 100.0 34 ............................... TTATCGTCCAACGAAATAATGACGTCCTTCCACG 283880 31 100.0 34 ............................... TACTTGGAGCGGAAACTTTCTACGGCATGGTCGA 283945 31 100.0 34 ............................... CGGAATGCTCGACGGCAATCTTCAACGCGTTATT 284010 31 100.0 34 ............................... TTTTTAGCCGGCGCCGCAACAGGGGCGGCTTCCG 284075 31 100.0 34 ............................... TATTTGTTTGGTTTGTGATTGTGCGGGGGAACAA 284140 31 100.0 34 ............................... CAACAAACCGATACGTCCCCCAAGTTGTTTCTTA 284205 31 90.3 0 .........T...................GT | ========== ====== ====== ====== =============================== =================================== ================== 38 31 99.7 34 GTCGCACCCACACGGGTGCGTGAATTGAAAC # Left flank : ACTGCCCTGTAAGAGCGAGGCAAGCTTCTTCTATTGAGCCTTAAACAGGAGCCGTCCGTAAGGGCGGCTCCTGTTGCATGGCGGCGCAAGTGGAATGCAGGTCAGTTGACTGCCATTAAGGTGGCAGCAAGGCATGTATATAGATCCGGATATTCCTGTTTTCCGGTAGGTGAATTCCTGGTTTTGTGAAGGAATGAGGAAGATGTTTCTGGACGCAGTAAAAAAGAACGGTTAGCATCAATGAGCCAAATTGAGTGCAAGCTGGCTCGGACGTGAAGTAACTGAGGTTTTCTAAGATGATGCCACATCGGATGCTGCGCTTGCGCCAACCTCAAGCTCACAGAAAATTTCCGGGAATCCGGCGCATGCTGTAAGCGATTGAGAAATGTAGCTTGACAGAAAAATTCCCTTTGATCAGGCCTGACCGCATCTGGTTCTTGCATCAGGTTGGCGCAAGACCTTTGTTGCATGTTTGATACTCAAACCGTATTCGTCAGGCC # Right flank : TTTTTATGTATGTTTGACATATTTAAATACGAATCGTTGTTTTATTAAGGGAGGGAGATATATTGAAATAAGGGTATTTGTTGCTGAAGAATGAAAGATGGATTTTTGTGTTTTTTGTTGAATGCAATATTTTGACGGAGGAAGTTGGTTTTTGGCTGATAGAGGAGAAATGACGGGAAGTATTATTGGACGAATATGAAAAGATATTGTCTGGTTTCCGGGTTATGTTTGTGCCTGTTCATGGGGATATTTTCATGTATTCCGGAAGCTCCGGAGCATAAGACGGAAAAAGATGCCTCAGCCATTGTTCTGGATGAGGATTCGGATATAATCTGGAGGAATTTTTCTTTGGGCCGCGTTTCATTCGATGATAAGGCTCCGCATAGTAAAGGTTCCCGCATTTTTCACCGATTGATTCCAGATACGGAGGTTTATATCCGTCAGTTGTCCCGTATTGTTCTCCATACGTTGTATGAGAGTCCTGAAGAATGTATTGTTCC # Questionable array : NO Score: 9.25 # Score Detail : 1:0, 2:3, 3:3, 4:0.99, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGCACCCACACGGGTGCGTGAATTGAAAC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: F Score: 4.5/4.5 # A,T distribution in repeat prediction: F [8,5] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCGCACCCTCACGGGTGCGTGGATTGAAAC with 94% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-5.00,-7.70] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-3] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [51.7-73.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [9.78,0.64 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], // Array 1 547-45 **** Predicted by CRISPRDetect 2.4 *** >NZ_VVJW01000007.1 Akkermansia muciniphila strain BIOML-A28 scaffold7_size118865, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ================================= ================================== ================== 546 33 100.0 34 ................................. AGGAGAATTGTTGTAACTGCTTAATGGTTATACA 479 33 100.0 34 ................................. ACCCGTAGGTAAAATAGACCAGACCGCCGTTACA 412 33 100.0 34 ................................. GATATATTTGCATGGGACGTTGCCCCCGCCCTCG 345 33 100.0 34 ................................. ATCCGCAATGTTAAAGGCTCTACGGGCGGCGTAG 278 33 100.0 34 ................................. CCATGTGTGCACCATGTCCAAATCCTCAAGGGCG 211 33 100.0 34 ................................. TGAAACGTCATAGTTAACGCCTTCCCATGTGTGC 144 33 100.0 34 ................................. CTTTCTTTTAGGGGGTTATGCGGGGGTGATTCCC 77 33 100.0 0 ................................. | ========== ====== ====== ====== ================================= ================================== ================== 8 33 100.0 34 GTCGCACTCCGCAAGGAGTGCGTGGATTGAAAC # Left flank : GCTGTTGTCATGTATATTCTCATTACGTATGATGTAGCTACGGATGATAAGGCCGGGCAGCGGCGGTTGCGGCAAGTTGCCCGAGCCTGTGAAAATGTCGGACAGAGAGTGCAGAATTCCGTATTTGAATGTGAATTGACTCCTGCCCAATTGGTTGACATTAGGAACAAGCTGCTTAAGATTATTGATAACGAGAGTGACAGTCTCAGAATTTATCATATGGGGTCCAATTGGCATCATAAAATAGAACAATTGGGTAAGGAGAAAAGCTATGACATCTCCGGTCCCTTGATTATTTAAAGACTGTGGAACATGGCCTGTGCGCCAACCTCAAGCTCACACGAATTCCCCGGCAGGTCGGCGATTGGTGTAATGCATTGAGAATAGAAGATTGACAGATAAATACTCAGAAAGTGTAACCGTGCAATGACGGTCTTTTTAGGGAGGTTGGCGCAAAGTATGATTTGCTCTGTTGAGTAGTAATGTATAAAGTCGGTTGC # Right flank : CAGAAGGGCCGGGGTTAAACCCGGCTGACTAGCTTGTCGCACTCC # Questionable array : NO Score: 9.26 # Score Detail : 1:0, 2:3, 3:3, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGCACTCCGCAAGGAGTGCGTGGATTGAAAC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: R Score: 4.5/4.5 # A,T distribution in repeat prediction: R [6,8] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTCGCTCTCCGCAAGGAGGGCGTGGATTGAAAC with 94% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-5.90,-7.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: R [28.3-56.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,10.01 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], //