Array 1 94971-97172 **** Predicted by CRISPRDetect 2.4 *** >NZ_CYYK01000014.1 Parabacteroides distasonis strain 2789STDY5608822, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== =============================================== =============================== ================== 94971 47 91.5 30 A......C.......................GA.............. TTCTCTGCGTTTTGGTTCAATGCGATTGCA 95048 47 100.0 30 ............................................... TTTGTGCGCTACCAAAAGGCCCTGTCCCTA 95125 47 100.0 30 ............................................... TCAGATTGAGTTTATGACCAAGCTATACGA 95202 47 100.0 30 ............................................... TTCCTTTTCCACGTTATCGATGTAGTATGT 95279 47 100.0 30 ............................................... TCCGTAAGACGGGAGATAGACCGCCTCACG 95356 47 100.0 30 ............................................... ACCGTCCCCTTGGAAAGAGTCATGCTTTGT 95433 47 100.0 31 ............................................... TTCAGCTTTTTGGATTTTTCCAGTTCCTCTG 95511 47 100.0 30 ............................................... TCTCTTTCACTTTTTCCTCTGATAGCTTCG 95588 47 100.0 30 ............................................... AGCCGATCCTTCACGCTTACCCACGGGGAT 95665 47 100.0 29 ............................................... TCCTTAGAAAGAGATAATACTACACTACC 95741 47 100.0 30 ............................................... GCGGCAGTGCCTAAAGCCATTTTAATAGTA 95818 47 100.0 30 ............................................... CTGACTCGGGAAACGAGGAATTACAAAAGA 95895 47 100.0 30 ............................................... ATAAAGCGGGTTGCGTAGACTTGTCGAATA 95972 47 100.0 30 ............................................... TGTTTCTGGCGAAAGGGTCCACGATCACCA 96049 47 100.0 30 ............................................... GCAAGTTCTATATCAAAGAATGACTTGGCT 96126 47 100.0 30 ............................................... TAACACTGTCCGATATATTCTTGCAAACCA 96203 47 100.0 30 ............................................... TACTCAATATTTGATCCGACACATCTATTA 96280 47 100.0 30 ............................................... CCCGAGTATCTAGGTATCGTGAGGTTCCAA 96357 47 100.0 30 ............................................... TTTCGGATTGTACAAGTCACAGGTAGACGC 96434 47 100.0 30 ............................................... AAGGCCGGTCACCTTAGCGATGCCAAGCGC 96511 47 100.0 29 ............................................... TTATTTGAGTTGTCGTACTGATGTAATTC 96587 47 100.0 30 ............................................... GCAATTTGCGAGAATGACTTGGGGCCAAAA 96664 47 100.0 30 ............................................... GATCGCGTTGGCCATTGCGAATCTCAGATG 96741 47 100.0 30 ............................................... TTATAGGAAACTCTATAGTATGCTTCATGC 96818 47 100.0 30 ............................................... CAACCGACGTAATAATATCGTTCTTTAATC 96895 47 100.0 30 ............................................... AGCCGATCCTTCACGCTTACCCACGGGGAT 96972 47 100.0 30 ............................................... CATCCTCTATTTCGTTCATTTTAAGCTTCT 97049 47 100.0 30 ............................................... CATATATTCCTATTCCTTGTGGATGAAATG 97126 47 100.0 0 ............................................... | ========== ====== ====== ====== =============================================== =============================== ================== 29 47 99.7 30 GTTGTGATTTGCTTTCATTTTAGTAACTTTGAGCCATTGGAAACAGC # Left flank : TCTAAACGCTTGGAGAATCGATTATTTCGAGGCAAGGTACGACTTGCTTCCAAATTCGAAAATATTTTTTCACAATAATCTATTGAACAACGCATTACAAATTTTCACTGATTTATTCTAGCGAAAGATTTCGTCTTCTGTCAACCTCAGTCGGACAAACAAGCGACTTTTTATCTGAAAACAATAAGCTTTTCATCCGAAAACAATCGGATTTCCGGTCGGAAACAATAAGGATCGCAGTCGAAAACAGCATACAACGCCGTTATAAACACCTTACTATTGGATCATAATCCCTCTACAAAGCCTATTCATACACTAAAAAATGGCGAAATACGCTGTTTCCGCCTTCCTTTTCTCTCTATTTTCGCATTTCCATAAAAAGCCTTTATCAGCTAAAAGTCTCCATTTTGAGCAAAATGACAGAAAAAGAGATATTTTGATGACATAAAAGAGAATGGCTTACTTTTTGGTGTAGCTGCCCCAATATTTTGCAATATTTT # Right flank : CATATTATCTATTAAAAACTATTAATCAATGGATTACAAAGCACATCAGAATAAAAAAAACAGATATTGTTTCCATTAAAAATCCCGCTTGACAGCGGGATTTTTCTTTTAGAACAACTCCAATTGCTGTCCAGGAGTATTCACTTCCACCGTTTTCTTTCCATAAAAAAGTTCAATATTCCCAAACTGCTTATCCGTAATACACATGATTCCGACATTCCCATGCTCCGGAAGAAAAGATTTAACTCTTTTTACATGTACGGCCGCATTCTCGCTACTAGCACAATGGCGGACATAAATAGAAAACTGAAACATCGTAAATCCATCTCTTTGTAGATTTTTCCTGAAATCCACATACGCCTTTTTCTCCTTCTTCGTCTCGGTAGGCAAATCAAACAACACAAGTACCCACATAACACGATATTCGCTAAACCGATCCATATTACATTTCCGGATAGGAGATTCTACGCAATTCGCCACTAAAGCACTTATACAGAGAA # Questionable array : NO Score: 6.25 # Score Detail : 1:0, 2:3, 3:0, 4:0.99, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTGTGATTTGCTTTCATTTTAGTAACTTTGAGCCATTGGAAACAGC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: NA [Repeat is AT rich:63.83%AT] # Reference repeat match prediction: F [matched GTTGTGATTTGCTTTCAAATTAGTATCTTTGAACCATTGGAAACAGC with 92% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-2.10,-0.90] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [4-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [66.7-80.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [4.87,0.68 Confidence: HIGH] # Array family : II-C [Matched known repeat from this family], // Array 1 237911-240526 **** Predicted by CRISPRDetect 2.4 *** >NZ_CYYK01000007.1 Parabacteroides distasonis strain 2789STDY5608822, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ================================= =================================== ================== 237911 33 97.0 35 ................................T ATAGTCGCTCCGACCGACTGATAGAAACTACCGGA 237979 33 97.0 33 ................................G TCCTGTATTGCCTCAATACTTCCGTTTTCGTCA 238045 33 97.0 32 ................................G CGGAAACGGATATATTGCGCAAAGCCGAGGAC 238110 33 97.0 34 ................................A TGTGTGTTTTTGTAATTAATAAATCAATGCAAAA 238177 33 100.0 32 ................................. TTTTTTATCATTTCGGAATCATATTTGACATC 238242 33 97.0 34 ................................G TCATCATCAACACGGGCAGCGTGGAGGAGGAGGA 238309 33 97.0 32 ................................A TCCTGTATACGGGGTTGCCGGATGTGTCCGTG 238374 33 100.0 35 ................................. ATTTTTTGCTTACCTTTGCAGTAAACAAAATAGAT 238442 33 97.0 35 ................................A TAAAATCACCCACGCACTCACGCACGACAGCCAAG 238510 33 97.0 32 ................................T ATCCATGTATTATGAAGGTGGGCTATGGACGA 238575 33 97.0 32 ................................A TCCGGCTACGGTGCCCAGATAGGATCATCCGG 238640 33 97.0 32 ................................T GATTGAATAAGCCCCCCCTAGGGGTGAAACGT 238705 33 97.0 33 ................................T ATCGTTTCCATATCCTTATATTATTAATGTATA 238771 33 100.0 32 ................................. ATTTTAGGCTCGAGGAACCCAAAGGCGTTCTT 238836 33 100.0 33 ................................. CGTCTTGCTCTACTTCGCAAAACCTAGAGTCTA 238902 33 100.0 34 ................................. GTTTACGTGAGGACTTGTTGCAAGTCTTGCAGGA 238969 33 97.0 33 ................................T TTCATTGTGATCAAACTTGGTTATTAATAGGTC 239035 33 97.0 33 ................................T TGCATGATAGGATAGAGAAATGGATATGTGCGC 239101 33 97.0 34 ................................T CCCATATAGAGGATCCCCGCCTTTTTCGTGACCG 239168 33 97.0 34 ................................A TAATACATGGATAGAACGCAAGGAGTTCGAGAAT 239235 33 100.0 34 ................................. ACCCAGACCCAATAAGTAAGATTAGATTTGGTAG 239302 33 100.0 32 ................................. CGTCAAGTAAGGCCGATTCCGGCCTCACAGTT 239367 33 97.0 33 ................................T TGTATTGTGCGAAGATACGGTATTGCCTCAATC 239433 33 100.0 32 ................................. ATGTACCCGTTGCTTAAATCCGGTGATATAAT 239498 33 100.0 33 ................................. GTGATGATTATGAAAGATTGATTCGATTGTGTA 239564 33 97.0 33 ................................A TTTTAGATATGATGGCCTTGGCGCTCTCGGGCG 239630 33 100.0 33 ................................. TTGAAAGTAAAGATGCCTGAGCGTTTTGACCGA 239696 33 97.0 34 ................................A GACCAATCATCAGAATATACACGTCCATCTGAGC 239763 33 97.0 35 ................................A TGTCAATATAGCCGTCCCTGATCTGGACGAGAAAG 239831 33 97.0 33 ................................T AATATGTGAGGAGCGTAATACATATCAATAATC 239897 33 97.0 33 ................................A AGATCCTCCTTGCCCGATGTTGTCTTATCAATA 239963 33 97.0 33 ................................T AGTAGTGTCTACCTCGTAGGTATAAGGAGTGCC 240029 33 97.0 33 ................................G TGACAACAGGTAGAATGTCACGAATTACCCCTA 240095 33 97.0 33 ................................T CTTATCCTTATCCTCGGCGAAAGGCTCGGAGAA 240161 33 100.0 34 ................................. ACAAGTGGGAACGGCTTACTACCCATGTCGCAAG 240228 33 100.0 34 ................................. TCCGAGGGCTATCTCCTTGAACCACTCATATAGG 240295 33 100.0 33 ................................. TATGCCAAGCATGTGGCGTTAGGGCTTGACTTT 240361 33 97.0 33 ................................T CCTCCAGTACCTTTTTAAGCTGATATAGGCTCA 240427 33 97.0 34 ................................G AGTACAGTCTTGTTGTTAATACTTTTCCTAGAAG 240494 33 97.0 0 ................................A | ========== ====== ====== ====== ================================= =================================== ================== 40 33 98.0 33 GTCGCACCCCGTGTGGGTGCGTGGATTGAAACC # Left flank : TGATAATTATCCGGTATTTCTAATAAAATGATTTACGATTATGCATATTCTTGTGACTTATGATGTGGATACTACGAGCAAAGAAGGAGCTCGCCGCCTACGACATGTGGCTAAGGCTTGCATAGATTATGGCCAAAGGGTACAGAATTCTGTCTTTGAGTGTGAGGTGACAGAAGCACAATATTGTCTCTTGATTGAACGAATCAAGCGTATTATTGATATGTCTCTTGATAGCGTTAGATTTTATATTCTCAATAAAAACGAAAATAAAAGGGTAAAAGTGATAGGTGTTGAAACTGCTTACAAAGTTAATAATGCTCTTATCATATAATTTATGCGAATGTGGAGTATTACGAAAAAAGTAGTATTTTCGCACCCCTTAATAATTAGCAGATTAACCCGTCTATAAGGCGATTGCAGCCATTAAGCTGAAATAAAAATGAGAATTCGCATATTAATAGGTCTAATTTATTGACTTATAATATGTAACTTTGCACACT # Right flank : ATTTGTAAAGCGCAAGCTTTTTCATAAAAGATATACTTACGTGATGAAGTATATCTTCTATGGGACTTGAAGTGTTCTTCTGCGACGAAGGGATTAGTACTTCGTAAAAATAACTTTTTAAAGTCACTGACACTGACATATAATTTCGCTTAGTTATTTCTGCAAAAATCATTCATATGTAAACAAGTTTCGTATATTCGCGACCTTATAATAGTTTAAATATGGCAGAGGAACTACGGATTAAAAATGGTGATAAACAGGAAATGTATGAGACGTTGCTCCCGCAAATCGCCTCATTGGTAGGTAACGAGACCGACCTAATCGCTAACATGGCGAACATCGCCGCAGCACTCAAGCAGACTTTCGGTTTCTTTTGGGTAGGTTTCTACCGGGTCATAGACAATCAGTTGGTATTAGCGCCTTTTCAAGGCCCTATCGCCTGTACACGAATCAAATACGGAAAAGGGGTATGCGGCACGGCTTGGAAGGAGACCCGTACG # Questionable array : NO Score: 5.59 # Score Detail : 1:0, 2:3, 3:0, 4:0.90, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.43, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGCACCCCGTGTGGGTGCGTGGATTGAAACC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: F Score: 4.5/4.5 # A,T distribution in repeat prediction: R [5,7] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCGCACCCTGCGTGGGTGCGTGGATTGAAACA with 94% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-4.20,-4.60] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: NA [73.3-71.7]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [9,0.74 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], //