Array 1 118252-122771 **** Predicted by CRISPRDetect 2.4 *** >NZ_VVKG01000004.1 Akkermansia muciniphila strain BIOML-A38 scaffold4_size504136, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ================================= =================================== ================== 118252 33 100.0 34 ................................. AGGAGAATTGTTGTAACTGCTTAATGGTTATACA 118319 33 100.0 34 ................................. ACCCGTAGGTAAAATAGACCAGACCGCCGTTACA 118386 33 100.0 34 ................................. GATATATTTGCATGGGACGTTGCCCCCGCCCTCG 118453 33 100.0 34 ................................. ATCCGCAATGTTAAAGGCTCTACGGGCGGCGTAG 118520 33 100.0 34 ................................. CCATGTGTGCACCATGTCCAAATCCTCAAGGGCG 118587 33 100.0 34 ................................. TGAAACGTCATAGTTAACGCCTTCCCATGTGTGC 118654 33 100.0 34 ................................. CTTTCTTTTAGGGGGTTATGCGGGGGTGATTCCC 118721 33 100.0 34 ................................. AGAAGGGCCGGGGTTAAACCCGGCTGACTAGCTT 118788 33 100.0 34 ................................. CCATGTGTGCACCATGTCCAAATCCTCAAGGGCG 118855 33 100.0 34 ................................. TGAAACGTCATAGTTAACGCCCTGCCACATGTGC 118922 33 100.0 34 ................................. TCCATCTTGCAGTTTGATTTGAGTGATCATGATT 118989 33 100.0 34 ................................. AGTGGGTGTGAGGTAGTAGCGGTGGACGAAAAAC 119056 33 100.0 34 ................................. CCCGGTAGGTAAAATAGACCAGACCGCCGTTACA 119123 33 100.0 33 ................................. TGAGACGTCATAGTTAACGCCCTGCCACATGTG 119189 33 100.0 34 ................................. TTTAGTTAGTGGTTTTGTTGTCCGGGGGTTGCCC 119256 33 100.0 34 ................................. CGTCACCTATGGGGCGGGGTTGCCGTGTGATAAG 119323 33 100.0 34 ................................. CACACATGCACCATGTCCAAATCCTCAAGGGCGG 119390 33 100.0 34 ................................. TGGAACAAGGAATCCCTACTAAAATGGTAGGGAT 119457 33 100.0 34 ................................. CATGTCATGCCGTTTTCGTTGGTTTTAAGTGGAT 119524 33 100.0 34 ................................. TTGTTGGTGGTTTTAGTTTAACTGATTTTATTAT 119591 33 100.0 34 ................................. TCCGGTGGGAAGGATTGCCCAGACCGCCGTGACA 119658 33 100.0 34 ................................. TTCATAATGTTTCCTTTCTTGTTGGTGGTTTTAG 119725 33 100.0 33 ................................. CACATATGCACCATGTCCAAATCCTCAAGGGCG 119791 33 100.0 34 ................................. TCGCATGTTTGTGTAAGTCTCCTTGTGGCTCTTA 119858 33 100.0 34 ................................. CATATCAGGCCGTTTTCGTTGGTTTTCAGCGGAT 119925 33 100.0 34 ................................. ACCGCAAAAGTCACGATAGGAACCGCAATCGGAC 119992 33 100.0 35 ................................. CCGCCTCGTAGGTCTCGCATTCCGCTGGTGTAAGA 120060 33 100.0 34 ................................. CCCGGTGGGAAGGATTGCCCAGACCGCCGTGACA 120127 33 100.0 34 ................................. AACGGCCCCCGTAGGATAAATGACGGTGCTATAT 120194 33 100.0 34 ................................. CACACATGCACCATGTCCAAATCCTCAAGGGCGG 120261 33 100.0 32 ................................. GCGTCCAGAATATAGGCGTAGGCTTCAACATT 120326 33 100.0 35 ................................. CCCGGTAGGCAAGATTGACCACACCGCCGTGACAC 120394 33 100.0 33 ................................. GATTCATGGCAACGGGGGTAAGTGGCTTTTCGC 120460 33 100.0 33 ................................. TGCCAGTAATAGCAAGTGTAAGGGCCGTTTCTT 120526 33 100.0 34 ................................. GCACCGGAATCGGTATATTTCGGACGGCCTGACC 120593 33 100.0 34 ................................. AATATCCCTGATGCTGAATAACAAAACCACTAAA 120660 33 100.0 34 ................................. ATAATTGTTTCCTTTCTTTAGTGGTTTTAGTTTA 120727 33 100.0 34 ................................. CGGACGGCCTGACCCCTCCACCAGCAACGGAATC 120794 33 100.0 34 ................................. TCTTAGCGAAAAGCCGGATTCCTTTCCAGGTCAT 120861 33 100.0 33 ................................. TTGGTAGTCACAACATGAAAGCCGTTATGCCTT 120927 33 100.0 34 ................................. CCTTCCATTCACCCCGGATGACTATTTCCGAGGA 120994 33 100.0 34 ................................. CCTGGGATGTTGCCCCCGCCCTTGCGGATTTGGA 121061 33 100.0 34 ................................. ACCTTCTTGCAATTTGATTTGATTGTTCATTGTT 121128 33 100.0 34 ................................. CCCGGTCTCTTATATAGTGAGGGGAGTTCTTTTG 121195 33 100.0 34 ................................. AAGAAATTGAGGATTCAAACTACCGCCTTTGCGA 121262 33 100.0 34 ................................. TCCTGGGACGTTGCCCCCGCCCTTGCGGATTTGG 121329 33 100.0 34 ................................. CCCGGAGACCAAGGCATCCCATTTCCCTACATGT 121396 33 100.0 34 ................................. CGGTGTGGGAGATATACCGTTTCCGTTGCATAAC 121463 33 100.0 34 ................................. TGCCAGCTTGCGAACCTTTATGGAATTGAGATTG 121530 33 100.0 35 ................................. TCCTATCGAACGCACGTTGCCACCATTGACCGAGC 121598 33 100.0 34 ................................. CCGTGACAGGGCGGGGGCGTCATCATGCGGGTTC 121665 33 100.0 35 ................................. ATCTTCCACACGTGCCAAGTAACTGAGGGCATAGG 121733 33 100.0 33 ................................. CGGCACCGTTTTCGTAGACCGCCCATGTGAGAG 121799 33 100.0 34 ................................. ATCATCCGCAATGTTGAAGGCCTTGCGGGCCGCA 121866 33 100.0 35 ................................. CGTGGGATAGGGTGAATCGTGAATCCAAAGGAATC 121934 33 100.0 34 ................................. TTGCGGAAACAATAGACCGCCCTGTTGACCGCCC 122001 33 100.0 34 ................................. AATATCGTCCATTTGCCGGGTAGTGGTTTGTGAA 122068 33 100.0 34 ................................. ATCGTACATGGGCAAGCCATAATCGGAACAAAAA 122135 33 100.0 35 ................................. CCCGGTGCGGGTGATTTGCGTGTAGGCAACCGGCC 122203 33 100.0 34 ................................. AGCTTCAACGTTGAACGCGCGGCGGGCCGCATAT 122270 33 100.0 33 ................................. AATTTGATTTGATTATTCATTGTTTTATTTCTT 122336 33 100.0 34 ................................. TAATCCAGAAGTTAATGATATCACGGGGACGGGG 122403 33 100.0 34 ................................. CATGACATAGAACGCACGTACCTTCCGTCTAATC 122470 33 100.0 34 ................................. CCCGGTGGGTAATATGGACCATACCGCTGTGACA 122537 33 100.0 33 ................................. TAGAGAGCGAGGCAGGCCGCCCGGTTGCCTATT 122603 33 100.0 34 ................................. CCGGCGTAGGACAATCTCCCATGTGCTATGCGGC 122670 33 100.0 34 ................................. GGAAGCCTCAGCACATCACACATCGCCGCCTGTA GC [122689] 122739 33 97.0 0 .........T....................... | ========== ====== ====== ====== ================================= =================================== ================== 68 33 100.0 34 GTCGCACTCCGCAAGGAGTGCGTGGATTGAAAC # Left flank : GCTGTTGTCATGTATATTCTCATTACGTATGATGTAGCTACGGATGATAAGGCCGGGCAGCGGCGGTTGCGGCAAGTTGCCCGAGCCTGTGAAAATGTCGGACAGAGAGTGCAGAATTCCGTATTTGAATGTGAATTGACTCCTGCCCAATTGGTTGACATTAGGAACAAGCTGCTTAAGATTATTGATAACGAGAGTGACAGTCTCAGAATTTATCATATGGGGTCCAATTGGCATCATAAAATAGAACAATTGGGTAAGGAGAAAAGCTATGACATCTCCGGTCCCTTGATTATTTAAAGACTGTGGAACATGGCCTGTGCGCCAACCTCAAGCTCACACGAATTCCCCGGCAGGTCGGCGATTGGTGTAATGCATTGAGAATAGAAGATTGACAGATAAATACTCAGAAAGTGTAACCGTGCAATGACGGTCTTTTTAGGGAGGTTGGCGCAAAGTATGATTTGCTCTGTTGAGTAGTAATGTATAAAGTCGGTTGC # Right flank : CCTGAATGGTGATTGTTATTTGCGGGTTTCCGCCCGTCGCACTCCTTGCAGAGTGCATGAAGAGGACGACGAGGCGGTGGGCTGGGGCGGCCTGTTGTCCTCTTGCAAGGGGATGGCAACAATGTTGCAGACATAGCACAGGAGAGGTGGTCAAGGGCGTTTTTCCACAATGCTTGATAATTTTTCTACAAAGTGTTGCTACGTTCCAACTCCTTAAAGAATCCAATTTCCAAAAAAATCCAGGATTTTTGAATCTGGAAGGCTTGTCAGTTAATTCCTCTATTTATTTTTCAGAAAGGGGGTGCTGATGCAGGTTGAGCTCCGCAGGAGGAATGGCCTGGAGCAGAGCCAGGATGGGGTCCTGCGGTGCGGCGCCTATTTCCCGCTGGCGTTTGAATTCCTCCTCTTCCTGCTGAAGGGCCGCCAGTTTGGTGTCTTCCACCAGCGCCTTGAATTTGTCCGCCTGAGAGTATTCTCCGTCCTCGTCCCGGAAAACGCGG # Questionable array : NO Score: 9.26 # Score Detail : 1:0, 2:3, 3:3, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGCACTCCGCAAGGAGTGCGTGGATTGAAAC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: F Score: 4.5/4.5 # A,T distribution in repeat prediction: F [8,6] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCGCTCTCCGCAAGGAGGGCGTGGATTGAAAC with 94% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-7.00,-5.90] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-3] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [56.7-48.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [10.15,0 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], // Array 1 281804-284235 **** Predicted by CRISPRDetect 2.4 *** >NZ_VVKG01000002.1 Akkermansia muciniphila strain BIOML-A38 scaffold2_size384327, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== =============================== =================================== ================== 281804 31 100.0 34 ............................... TTTTAGTTTAACTTATTTTATTACCTTGTCAACT 281869 31 100.0 34 ............................... AACGTTTTTGGTAGTCACAATGTGGAATCCGTTG 281934 31 100.0 33 ............................... TATCTTGTTGTCAAATGTTTTTTGTTTTTGTAT 281998 31 100.0 34 ............................... ATCGCAAAAGTCACGATAGGAACCGCAATCGGAT 282063 31 100.0 33 ............................... CTATTGTTTAATGTTTAGTGTTTTATTGTTTGG 282127 31 100.0 33 ............................... CATGTCATGCCGTTTTCGTTGGTTTTCAGTGGA 282191 31 100.0 35 ............................... ATGATTTAGTTTCCTTTCTTTTAGGGGGTTATGCG 282257 31 100.0 34 ............................... AACATTGCGGATGAATAACTTATGATTGTGTCAT 282322 31 100.0 34 ............................... AGCGTCCAGAATATAGGCGTAGGCTTCAACATTT 282387 31 100.0 33 ............................... ATCCTACGGGAGACGTCGACACAATCATGGAGA 282451 31 100.0 34 ............................... AGGAATATGCTGACCTTAAATCCTTCTTCACCCG 282516 31 100.0 34 ............................... ATACACCGCCCATGTGAGAGTTTCTCCAGCGTTT 282581 31 100.0 34 ............................... CATCTTGTTGTTCATAATGTTTTCCTTTCTTTTA 282646 31 100.0 34 ............................... CCGCCTCGTAGGTCTCGCATTCCGCCGGGGTTAA 282711 31 100.0 34 ............................... AACGTAAGCGAAAAGGCCCGCCCCCTCGTAAAAG 282776 31 100.0 34 ............................... TGCGTCATCATCTACGTTAAAGGCCTTGCGGGCC 282841 31 100.0 33 ............................... AAGGCCAAAAGAAGCCAAAAGAATCCAAAGGAA 282905 31 100.0 34 ............................... GCACCGGAATCGGTATATTTCGGACGGCCTGACC 282970 31 100.0 34 ............................... CCGCCTCGTAGGTCTCGCATTCAGCGGGTGTGAG 283035 31 100.0 34 ............................... AAAGCCAAAAGAAGCCAAAAGAAGCCAAAAGAAG 283100 31 100.0 34 ............................... TCCCTACCCTTTTAGTAGGGATTCCTTGTTCCTA 283165 31 100.0 34 ............................... CTACTAAACCTATCGGAATTGTGATTGAAGGCAT 283230 31 100.0 34 ............................... CGTTTCCCTGGTGGGGTAGATAACGGTGGAATAT 283295 31 100.0 34 ............................... AATTTGTATGACACAATCGTAAGTTATTCAGCAT 283360 31 100.0 34 ............................... TGTTTCCCCGGTGGGGTAAATAACGGAGGCATAT 283425 31 100.0 34 ............................... CCGCCGTAGGACAATCTCCCATGTGCTATGCGCC 283490 31 100.0 34 ............................... TATAGCCAATTGTGAGAGTTTCCGGTTTGTAACA 283555 31 100.0 34 ............................... TGCAGCCAGGACTAGGGATAAGATTCCCGCGCCG 283620 31 100.0 34 ............................... AGCATCTAAAATATGTGCGTAAGCTTCGACACGG 283685 31 100.0 34 ............................... ACATTATATCATAAAAATAAGAGAGTTATGAACA 283750 31 100.0 34 ............................... GGTCCCTACAAATTACCAAGTCCCCCTTCTTAAA 283815 31 100.0 34 ............................... TTATCGTCCAACGAAATAATGACGTCCTTCCACG 283880 31 100.0 34 ............................... TACTTGGAGCGGAAACTTTCTACGGCATGGTCGA 283945 31 100.0 34 ............................... CGGAATGCTCGACGGCAATCTTCAACGCGTTATT 284010 31 100.0 34 ............................... TTTTTAGCCGGCGCCGCAACAGGGGCGGCTTCCG 284075 31 100.0 34 ............................... TATTTGTTTGGTTTGTGATTGTGCGGGGGAACAA 284140 31 100.0 34 ............................... CAACAAACCGATACGTCCCCCAAGTTGTTTCTTA 284205 31 90.3 0 .........T...................GT | ========== ====== ====== ====== =============================== =================================== ================== 38 31 99.7 34 GTCGCACCCACACGGGTGCGTGAATTGAAAC # Left flank : ACTGCCCTGTAAGAGCGAGGCAAGCTTCTTCTATTGAGCCTTAAACAGGAGCCGTCCGTAAGGGCGGCTCCTGTTGCATGGCGGCGCAAGTGGAATGCAGGTCAGTTGACTGCCATTAAGGTGGCAGCAAGGCATGTATATAGATCCGGATATTCCTGTTTTCCGGTAGGTGAATTCCTGGTTTTGTGAAGGAATGAGGAAGATGTTTCTGGACGCAGTAAAAAAGAACGGTTAGCATCAATGAGCCAAATTGAGTGCAAGCTGGCTCGGACGTGAAGTAACTGAGGTTTTCTAAGATGATGCCACATCGGATGCTGCGCTTGCGCCAACCTCAAGCTCACAGAAAATTTCCGGGAATCCGGCGCATGCTGTAAGCGATTGAGAAATGTAGCTTGACAGAAAAATTCCCTTTGATCAGGCCTGACCGCATCTGGTTCTTGCATCAGGTTGGCGCAAGACCTTTGTTGCATGTTTGATACTCAAACCGTATTCGTCAGGCC # Right flank : TTTTTATGTATGTTTGACATATTTAAATACGAATCGTTGTTTTATTAAGGGAGGGAGATATATTGAAATAAGGGTATTTGTTGCTGAAGAATGAAAGATGGATTTTTGTGTTTTTTGTTGAATGCAATATTTTGACGGAGGAAGTTGGTTTTTGGCTGATAGAGGAGAAATGACGGGAAGTATTATTGGACGAATATGAAAAGATATTGTCTGGTTTCCGGGTTATGTTTGTGCCTGTTCATGGGGATATTTTCATGTATTCCGGAAGCTCCGGAGCATAAGACGGAAAAAGATGCCTCAGCCATTGTTCTGGATGAGGATTCGGATATAATCTGGAGGAATTTTTCTTTGGGCCGCGTTTCATTCGATGATAAGGCTCCGCATAGTAAAGGTTCCCGCATTTTTCACCGATTGATTCCAGATACGGAGGTTTATATCCGTCAGTTGTCCCGTATTGTTCTCCATACGTTGTATGAGAGTCCTGAAGAATGTATTGTTCC # Questionable array : NO Score: 9.25 # Score Detail : 1:0, 2:3, 3:3, 4:0.99, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGCACCCACACGGGTGCGTGAATTGAAAC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: F Score: 4.5/4.5 # A,T distribution in repeat prediction: F [8,5] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCGCACCCTCACGGGTGCGTGGATTGAAAC with 94% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-5.00,-7.70] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-3] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [51.7-73.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [9.78,0.64 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], //