Array 1 118230-122750 **** Predicted by CRISPRDetect 2.4 *** >NZ_VVKI01000002.1 Akkermansia muciniphila strain BIOML-A40 scaffold2_size503871, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ================================= =================================== ================== 118230 33 100.0 34 ................................. AGGAGAATTGTTGTAACTGCTTAATGGTTATACA 118297 33 100.0 34 ................................. ACCCGTAGGTAAAATAGACCAGACCGCCGTTACA 118364 33 100.0 34 ................................. GATATATTTGCATGGGACGTTGCCCCCGCCCTCG 118431 33 100.0 34 ................................. ATCCGCAATGTTAAAGGCTCTACGGGCGGCGTAG 118498 33 100.0 34 ................................. CCATGTGTGCACCATGTCCAAATCCTCAAGGGCG 118565 33 100.0 34 ................................. TGAAACGTCATAGTTAACGCCTTCCCATGTGTGC 118632 33 100.0 34 ................................. CTTTCTTTTAGGGGGTTATGCGGGGGTGATTCCC 118699 33 100.0 34 ................................. AGAAGGGCCGGGGTTAAACCCGGCTGACTAGCTT 118766 33 100.0 34 ................................. CCATGTGTGCACCATGTCCAAATCCTCAAGGGCG 118833 33 100.0 34 ................................. TGAAACGTCATAGTTAACGCCCTGCCACATGTGC 118900 33 100.0 34 ................................. TCCATCTTGCAGTTTGATTTGAGTGATCATGATT 118967 33 100.0 34 ................................. AGTGGGTGTGAGGTAGTAGCGGTGGACGAAAAAC 119034 33 100.0 34 ................................. CCCGGTAGGTAAAATAGACCAGACCGCCGTTACA 119101 33 100.0 33 ................................. TGAGACGTCATAGTTAACGCCCTGCCACATGTG 119167 33 100.0 34 ................................. TTTAGTTAGTGGTTTTGTTGTCCGGGGGTTGCCC 119234 33 100.0 34 ................................. CGTCACCTATGGGGCGGGGTTGCCGTGTGATAAG 119301 33 100.0 34 ................................. CACACATGCACCATGTCCAAATCCTCAAGGGCGG 119368 33 100.0 34 ................................. TGGAACAAGGAATCCCTACTAAAATGGTAGGGAT 119435 33 100.0 34 ................................. CATGTCATGCCGTTTTCGTTGGTTTTAAGTGGAT 119502 33 100.0 34 ................................. TTGTTGGTGGTTTTAGTTTAACTGATTTTATTAT 119569 33 100.0 34 ................................. TCCGGTGGGAAGGATTGCCCAGACCGCCGTGACA 119636 33 100.0 34 ................................. TTCATAATGTTTCCTTTCTTGTTGGTGGTTTTAG 119703 33 100.0 33 ................................. CACATATGCACCATGTCCAAATCCTCAAGGGCG 119769 33 100.0 34 ................................. TCGCATGTTTGTGTAAGTCTCCTTGTGGCTCTTA 119836 33 100.0 34 ................................. CATATCAGGCCGTTTTCGTTGGTTTTCAGCGGAT 119903 33 100.0 34 ................................. ACCGCAAAAGTCACGATAGGAACCGCAATCGGAC 119970 33 100.0 35 ................................. CCGCCTCGTAGGTCTCGCATTCCGCTGGTGTAAGA 120038 33 100.0 34 ................................. CCCGGTGGGAAGGATTGCCCAGACCGCCGTGACA 120105 33 100.0 34 ................................. AACGGCCCCCGTAGGATAAATGACGGTGCTATAT 120172 33 100.0 35 ................................. TATGCGGACTGAGGTAGGCAGGCCGGTAGCATATA 120240 33 100.0 32 ................................. GCGTCCAGAATATAGGCGTAGGCTTCAACATT 120305 33 100.0 35 ................................. CCCGGTAGGCAAGATTGACCACACCGCCGTGACAC 120373 33 100.0 33 ................................. GATTCATGGCAACGGGGGTAAGTGGCTTTTCGC 120439 33 100.0 33 ................................. TGCCAGTAATAGCAAGTGTAAGGGCCGTTTCTT 120505 33 100.0 34 ................................. GCACCGGAATCGGTATATTTCGGACGGCCTGACC 120572 33 100.0 34 ................................. AATATCCCTGATGCTGAATAACAAAACCACTAAA 120639 33 100.0 34 ................................. ATAATTGTTTCCTTTCTTTAGTGGTTTTAGTTTA 120706 33 100.0 34 ................................. CGGACGGCCTGACCCCTCCACCAGCAACGGAATC 120773 33 100.0 34 ................................. TCTTAGCGAAAAGCCGGATTCCTTTCCAGGTCAT 120840 33 100.0 33 ................................. TTGGTAGTCACAACATGAAAGCCGTTATGCCTT 120906 33 100.0 34 ................................. CCTTCCATTCACCCCGGATGACTATTTCCGAGGA 120973 33 100.0 34 ................................. CCTGGGATGTTGCCCCCGCCCTTGCGGATTTGGA 121040 33 100.0 34 ................................. ACCTTCTTGCAATTTGATTTGATTGTTCATTGTT 121107 33 100.0 34 ................................. CCCGGTCTCTTATATAGTGAGGGGAGTTCTTTTG 121174 33 100.0 34 ................................. AAGAAATTGAGGATTCAAACTACCGCCTTTGCGA 121241 33 100.0 34 ................................. TCCTGGGACGTTGCCCCCGCCCTTGCGGATTTGG 121308 33 100.0 34 ................................. CCCGGAGACCAAGGCATCCCATTTCCCTACATGT 121375 33 100.0 34 ................................. CGGTGTGGGAGATATACCGTTTCCGTTGCATAAC 121442 33 100.0 34 ................................. TGCCAGCTTGCGAACCTTTATGGAATTGAGATTG 121509 33 100.0 35 ................................. TCCTATCGAACGCACGTTGCCACCATTGACCGAGC 121577 33 100.0 34 ................................. CCGTGACAGGGCGGGGGCGTCATCATGCGGGTTC 121644 33 100.0 35 ................................. ATCTTCCACACGTGCCAAGTAACTGAGGGCATAGG 121712 33 100.0 33 ................................. CGGCACCGTTTTCGTAGACCGCCCATGTGAGAG 121778 33 100.0 34 ................................. ATCATCCGCAATGTTGAAGGCCTTGCGGGCCGCA 121845 33 100.0 35 ................................. CGTGGGATAGGGTGAATCGTGAATCCAAAGGAATC 121913 33 100.0 34 ................................. TTGCGGAAACAATAGACCGCCCTGTTGACCGCCC 121980 33 100.0 34 ................................. AATATCGTCCATTTGCCGGGTAGTGGTTTGTGAA 122047 33 100.0 34 ................................. ATCGTACATGGGCAAGCCATAATCGGAACAAAAA 122114 33 100.0 35 ................................. CCCGGTGCGGGTGATTTGCGTGTAGGCAACCGGCC 122182 33 100.0 34 ................................. AGCTTCAACGTTGAACGCGCGGCGGGCCGCATAT 122249 33 100.0 33 ................................. AATTTGATTTGATTATTCATTGTTTTATTTCTT 122315 33 100.0 34 ................................. TAATCCAGAAGTTAATGATATCACGGGGACGGGG 122382 33 100.0 34 ................................. CATGACATAGAACGCACGTACCTTCCGTCTAATC 122449 33 100.0 34 ................................. CCCGGTGGGTAATATGGACCATACCGCTGTGACA 122516 33 100.0 33 ................................. TAGAGAGCGAGGCAGGCCGCCCGGTTGCCTATT 122582 33 100.0 34 ................................. CCGGCGTAGGACAATCTCCCATGTGCTATGCGGC 122649 33 100.0 34 ................................. GGAAGCCTCAGCACATCACACATCGCCGCCTGTA GC [122668] 122718 33 97.0 0 .........T....................... | ========== ====== ====== ====== ================================= =================================== ================== 68 33 100.0 34 GTCGCACTCCGCAAGGAGTGCGTGGATTGAAAC # Left flank : GCTGTTGTCATGTATATTCTCATTACGTATGATGTAGCTACGGATGATAAGGCCGGGCAGCGGCGGTTGCGGCAAGTTGCCCGAGCCTGTGAAAATGTCGGACAGAGAGTGCAGAATTCCGTATTTGAATGTGAATTGACTCCTGCCCAATTGGTTGACATTAGGAACAAGCTGCTTAAGATTATTGATAACGAGAGTGACAGTCTCAGAATTTATCATATGGGGTCCAATTGGCATCATAAAATAGAACAATTGGGTAAGGAGAAAAGCTATGACATCTCCGGTCCCTTGATTATTTAAAGACTGTGGAACATGGCCTGTGCGCCAACCTCAAGCTCACACGAATTCCCCGGCAGGTCGGCGATTGGTGTAATGCATTGAGAATAGAAGATTGACAGATAAATACTCAGAAAGTGTAACCGTGCAATGACGGTCTTTTTAGGGAGGTTGGCGCAAAGTATGATTTGCTCTGTTGAGTAGTAATGTATAAAGTCGGTTGC # Right flank : CCTGAATGGTGATTGTTATTTGCGGGTTTCCGCCCGTCGCACTCCTTGCAGAGTGCATGAAGAGGACGACGAGGCGGTGGGCTGGGGCGGCCTGTTGTCCTCTTGCAAGGGGATGGCAACAATGTTGCAGACATAGCACAGGAGAGGTGGTCAAGGGCGTTTTTCCACAATGCTTGATAATTTTTCTACAAAGTGTTGCTACGTTCCAACTCCTTAAAGAATCCAATTTCCAAAAAAATCCAGGATTTTTGAATCTGGAAGGCTTGTCAGTTAATTCCTCTATTTATTTTTCAGAAAGGGGGTGCTGATGCAGGTTGAGCTCCGCAGGAGGAATGGCCTGGAGCAGAGCCAGGATGGGGTCCTGCGGTGCGGCGCCTATTTCCCGCTGGCGTTTGAATTCCTCCTCTTCCTGCTGAAGGGCCGCCAGTTTGGTGTCTTCCACCAGCGCCTTGAATTTGTCCGCCTGAGAGTATTCTCCGTCCTCGTCCCGGAAAACGCGG # Questionable array : NO Score: 9.26 # Score Detail : 1:0, 2:3, 3:3, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGCACTCCGCAAGGAGTGCGTGGATTGAAAC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: F Score: 4.5/4.5 # A,T distribution in repeat prediction: F [8,6] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCGCTCTCCGCAAGGAGGGCGTGGATTGAAAC with 94% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-7.00,-5.90] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-3] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [56.7-48.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [10.15,0 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], // Array 1 281804-284235 **** Predicted by CRISPRDetect 2.4 *** >NZ_VVKI01000003.1 Akkermansia muciniphila strain BIOML-A40 scaffold3_size383866, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== =============================== =================================== ================== 281804 31 100.0 34 ............................... TTTTAGTTTAACTTATTTTATTACCTTGTCAACT 281869 31 100.0 34 ............................... AACGTTTTTGGTAGTCACAATGTGGAATCCGTTG 281934 31 100.0 33 ............................... TATCTTGTTGTCAAATGTTTTTTGTTTTTGTAT 281998 31 100.0 34 ............................... ATCGCAAAAGTCACGATAGGAACCGCAATCGGAT 282063 31 100.0 33 ............................... CTATTGTTTAATGTTTAGTGTTTTATTGTTTGG 282127 31 100.0 33 ............................... CATGTCATGCCGTTTTCGTTGGTTTTCAGTGGA 282191 31 100.0 35 ............................... ATGATTTAGTTTCCTTTCTTTTAGGGGGTTATGCG 282257 31 100.0 34 ............................... AACATTGCGGATGAATAACTTATGATTGTGTCAT 282322 31 100.0 34 ............................... AGCGTCCAGAATATAGGCGTAGGCTTCAACATTT 282387 31 100.0 33 ............................... ATCCTACGGGAGACGTCGACACAATCATGGAGA 282451 31 100.0 34 ............................... AGGAATATGCTGACCTTAAATCCTTCTTCACCCG 282516 31 100.0 34 ............................... ATACACCGCCCATGTGAGAGTTTCTCCAGCGTTT 282581 31 100.0 34 ............................... CATCTTGTTGTTCATAATGTTTTCCTTTCTTTTA 282646 31 100.0 34 ............................... CCGCCTCGTAGGTCTCGCATTCCGCCGGGGTTAA 282711 31 100.0 34 ............................... AACGTAAGCGAAAAGGCCCGCCCCCTCGTAAAAG 282776 31 100.0 34 ............................... TGCGTCATCATCTACGTTAAAGGCCTTGCGGGCC 282841 31 100.0 33 ............................... AAGGCCAAAAGAAGCCAAAAGAATCCAAAGGAA 282905 31 100.0 34 ............................... GCACCGGAATCGGTATATTTCGGACGGCCTGACC 282970 31 100.0 34 ............................... CCGCCTCGTAGGTCTCGCATTCAGCGGGTGTGAG 283035 31 100.0 34 ............................... AAAGCCAAAAGAAGCCAAAAGAAGCCAAAAGAAG 283100 31 100.0 34 ............................... TCCCTACCCTTTTAGTAGGGATTCCTTGTTCCTA 283165 31 100.0 34 ............................... CTACTAAACCTATCGGAATTGTGATTGAAGGCAT 283230 31 100.0 34 ............................... CGTTTCCCTGGTGGGGTAGATAACGGTGGAATAT 283295 31 100.0 34 ............................... AATTTGTATGACACAATCGTAAGTTATTCAGCAT 283360 31 100.0 34 ............................... TGTTTCCCCGGTGGGGTAAATAACGGAGGCATAT 283425 31 100.0 34 ............................... CCGCCGTAGGACAATCTCCCATGTGCTATGCGCC 283490 31 100.0 34 ............................... TATAGCCAATTGTGAGAGTTTCCGGTTTGTAACA 283555 31 100.0 34 ............................... TGCAGCCAGGACTAGGGATAAGATTCCCGCGCCG 283620 31 100.0 34 ............................... AGCATCTAAAATATGTGCGTAAGCTTCGACACGG 283685 31 100.0 34 ............................... ACATTATATCATAAAAATAAGAGAGTTATGAACA 283750 31 100.0 34 ............................... GGTCCCTACAAATTACCAAGTCCCCCTTCTTAAA 283815 31 100.0 34 ............................... TTATCGTCCAACGAAATAATGACGTCCTTCCACG 283880 31 100.0 34 ............................... TACTTGGAGCGGAAACTTTCTACGGCATGGTCGA 283945 31 100.0 34 ............................... CGGAATGCTCGACGGCAATCTTCAACGCGTTATT 284010 31 100.0 34 ............................... TTTTTAGCCGGCGCCGCAACAGGGGCGGCTTCCG 284075 31 100.0 34 ............................... TATTTGTTTGGTTTGTGATTGTGCGGGGGAACAA 284140 31 100.0 34 ............................... CAACAAACCGATACGTCCCCCAAGTTGTTTCTTA 284205 31 90.3 0 .........T...................GT | ========== ====== ====== ====== =============================== =================================== ================== 38 31 99.7 34 GTCGCACCCACACGGGTGCGTGAATTGAAAC # Left flank : ACTGCCCTGTAAGAGCGAGGCAAGCTTCTTCTATTGAGCCTTAAACAGGAGCCGTCCGTAAGGGCGGCTCCTGTTGCATGGCGGCGCAAGTGGAATGCAGGTCAGTTGACTGCCATTAAGGTGGCAGCAAGGCATGTATATAGATCCGGATATTCCTGTTTTCCGGTAGGTGAATTCCTGGTTTTGTGAAGGAATGAGGAAGATGTTTCTGGACGCAGTAAAAAAGAACGGTTAGCATCAATGAGCCAAATTGAGTGCAAGCTGGCTCGGACGTGAAGTAACTGAGGTTTTCTAAGATGATGCCACATCGGATGCTGCGCTTGCGCCAACCTCAAGCTCACAGAAAATTTCCGGGAATCCGGCGCATGCTGTAAGCGATTGAGAAATGTAGCTTGACAGAAAAATTCCCTTTGATCAGGCCTGACCGCATCTGGTTCTTGCATCAGGTTGGCGCAAGACCTTTGTTGCATGTTTGATACTCAAACCGTATTCGTCAGGCC # Right flank : TTTTTATGTATGTTTGACATATTTAAATACGAATCGTTGTTTTATTAAGGGAGGGAGATATATTGAAATAAGGGTATTTGTTGCTGAAGAATGAAAGATGGATTTTTGTGTTTTTTGTTGAATGCAATATTTTGACGGAGGAAGTTGGTTTTTGGCTGATAGAGGAGAAATGACGGGAAGTATTATTGGACGAATATGAAAAGATATTGTCTGGTTTCCGGGTTATGTTTGTGCCTGTTCATGGGGATATTTTCATGTATTCCGGAAGCTCCGGAGCATAAGACGGAAAAAGATGCCTCAGCCATTGTTCTGGATGAGGATTCGGATATAATCTGGAGGAATTTTTCTTTGGGCCGCGTTTCATTCGATGATAAGGCTCCGCATAGTAAAGGTTCCCGCATTTTTCACCGATTGATTCCAGATACGGAGGTTTATATCCGTCAGTTGTCCCGTATTGTTCTCCATACGTTGTATGAGAGTCCTGAAGAATGTATTGTTCC # Questionable array : NO Score: 9.25 # Score Detail : 1:0, 2:3, 3:3, 4:0.99, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGCACCCACACGGGTGCGTGAATTGAAAC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: F Score: 4.5/4.5 # A,T distribution in repeat prediction: F [8,5] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCGCACCCTCACGGGTGCGTGGATTGAAAC with 94% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-5.00,-7.70] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-3] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [51.7-73.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [9.78,0.64 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], //