Array 1 33824-33087 **** Predicted by CRISPRDetect 2.4 *** >NZ_WKMS01000023.1 Parabacteroides distasonis strain BIOML-A16 scaffold23_size95268, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== =============================================== ============================== ================== 33823 47 91.5 30 A......C.......................GA.............. TTCTCTGCGTTTTGGTTCAATGCGATTGCA 33746 47 100.0 30 ............................................... ACAACAAGACTGTACTCGAAGGAGGAGTTA 33669 47 100.0 30 ............................................... ATTCGCTTAAAGCATGAGCCGATACGTTGA 33592 47 100.0 30 ............................................... TGGGTTTGATATACATGACACCCAAAAGGA 33515 47 100.0 29 ............................................... ACACAAAAGCCGGATGCGTTGGCGCAATC 33439 47 100.0 30 ............................................... CGTCGTTTCTCAGATAAAAATAAAGCGATT 33362 47 100.0 29 ............................................... GCTTTCGCACAGACTTATTTTGCGGTACA 33286 47 100.0 29 ............................................... GTTGATTTGATTAAAAGAACAGGGAGACT 33210 47 100.0 30 ............................................... CGCTTTGGATATAATGAGCCTATAGAGTAC 33133 47 100.0 0 ............................................... | ========== ====== ====== ====== =============================================== ============================== ================== 10 47 99.2 30 GTTGTGATTTGCTTTCATTTTAGTAACTTTGAGCCATTGGAAACAGC # Left flank : TCTAAACGCTTGGAGAATCGATTATTTCGAGGCAAGGTACGACTTGCTTCCAAATTCGAAAATATTTTTTCACAATAATCTATTGAACAACGCATTACAAATTTTCACTGATTTATTCTAGCGAAAGATTTCGTCTTCTGTCAACCTCAGTCGGACAAACAAGCGACTTTTTATCTGAAAACAATAAGCTTTTCATCCGAAAACAATCGGATTTCCGGTCGGAAACAATAAGGATCGCAGTCGAAAACAGCATACAACGCCGTTATAAACACCTTACTATTGGATCATAATCCCTCTACAAAGCCTATTCATACACTAAAAAATGGCGAAATACGCTGTTTCCGCCTTCCTTTTCTCTCTATTTTCGCATTTCCATAAAAAGCCTTTATCAGCTAAAAGTCTCCATTTTGAGCAAAATGACAGAAAAAGAGATATTTTGATGACATAAAAGAGAATGGCTTACTTTTTGGTGTAGCTGCCCCAATATTTTGCAATATTTT # Right flank : ATATTATCTATTAAAAACTATTAATCAATGGATTACAAAGCACATCAGAATAAAAAAAACAGATATTGTTTCCATTAAAAATCCCGCTTGACAGCGGGATTTTTCTTTTAGAACAACTCCAATTGCTGTCCAGGAGTATTCACTTCCACCGTTTTCTTTCCATAAAAAAGTTCAATATTCCCAAACTGCTTATCCGTAATACACATGATTCCGACATTCCCATGCTCCGGAAGAAAAGATTTAACTCTTTTTACATGTACGGCCGCATTCTCGCTACTAGCACAATGGCGGACATAAATAGAAAACTGAAACATCGTAAATCCATCTCTTTGTAGATTTTTCCTGAAATCCACATACGCCTTTTTCTCCTTCTTCGTCTCGGTAGGCAAATCAAACAACACAAGTACCCACATAACACGATATTCGCTAAACCGATCCATATTACATTTCCGGATAGGAGATTCTACGCAATTCGCCACTAAAGCACTTATACAGAGAAG # Questionable array : NO Score: 6.22 # Score Detail : 1:0, 2:3, 3:0, 4:0.96, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTGTGATTTGCTTTCATTTTAGTAACTTTGAGCCATTGGAAACAGC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: NA [Repeat is AT rich:63.83%AT] # Reference repeat match prediction: R [matched GTTGTGATTTGCTTTCAAATTAGTATCTTTGAACCATTGGAAACAGC with 92% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-0.90,-2.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-4] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [80.0-66.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.68,4.87 Confidence: HIGH] # Array family : II-C [Matched known repeat from this family], // Array 1 93010-92170 **** Predicted by CRISPRDetect 2.4 *** >NZ_WKMS01000012.1 Parabacteroides distasonis strain BIOML-A16 scaffold12_size129426, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ================================= ====================================== ================== 93009 33 97.0 38 ................................C TGCACGAAAGCGAGCATCGTCAACAAGGCGATTGACTC 92938 33 100.0 32 ................................. ATATTACTTGAAGATTATGATATTGAGGCGGC 92873 33 97.0 34 ................................A ACAGCACGCCGCAAGATACGACGGATTACATAAC 92806 33 100.0 37 ................................. ACAGAACTGTGGACGGTCTGTGCCTCGATATACGAAG 92736 33 100.0 34 ................................. ATTATGAGATTATTGGAGGAAGTTAATACCACCA 92669 33 100.0 35 ................................. ATTATGAGATTATTGGAGGAAGTTAATACCACCAA 92601 33 97.0 34 ................................A GGATACTGAAAACGGAGAATAACAATCATGGCTA 92534 33 97.0 34 ................................C ACGTGGAACAAAGTAGCGGAAGGTCTTAGGCTTT 92467 33 97.0 32 ................................A CAATACCCAAAGCGTTTATGAGTATTACCTAA 92402 33 97.0 34 ................................T TACCATTGGTCGAATACCCGATCTTGACAAGACC 92335 33 97.0 33 ................................C AAGTCATCCACGTTTTTTTGAAACTCCGGATGA 92269 33 97.0 34 ................................T TTGGTAAGCTCAACGTAAAACTCAATTTGAGACT 92202 33 97.0 0 ................................A | ========== ====== ====== ====== ================================= ====================================== ================== 13 33 97.9 34 GTCGCACCCCGTGTGGGTGCGTGGATTGAAACG # Left flank : TGATAATTATCCGGTATTTCTAATAAAATGATTTACGATTATGCATATTCTTGTGACTTATGATGTGGATACTACGAGCAAAGAAGGAGCTCGCCGCCTACGACATGTGGCTAAGGCTTGCATAGATTATGGCCAAAGGGTACAGAATTCTGTCTTTGAGTGTGAGGTGACAGAAGCACAATATTGTCTCTTGATTGAACGAATCAAGCGTATTATTGATATGTCTCTTGATAGCGTTAGATTTTATATTCTCAATAAAAACGAGAATAAAAGGGTAAAAGTGATAGGTGTTGAAACTGCTTACAAAGTTAATAATGCTCTTATCATATAATTTATGCGAATGTGGAGTATTACGAAAAAAGTAGTATTTTCGCACCCCTTAATAATTAGCAGATTAACCCGTCTTTAAGGCGATTGCAGCCATTAAGCTGAAATAAAAATGAGAATTCGCATATTAATAGGTCTAATTTATTGACTTATAATATGTAACTTTGCACACT # Right flank : TTTGTAAAGCGCAAGCTTTTTCATAAAAGATATACTTACGTGATGAAGTATATCTTCTATGGGACTTGAAGTGTTCTTCTGCGACGAAGGGATTAGTACTTCGTAAAAATAACTTTTTAAAGTCACTGACTCTGACATATAATTTCGCTTAGTTATTTCTGCAAAAATCATTCATATGTAAACAAGTTTCGTATATTCGCGACCTTATAATAGTTTAAATATGGCAGAGGAACTACGGATTAAAAACGGTGATAAACAGGAGATGTATGAGACATTGCTCCCGCAAATCGCCTCATTGGTAGGTAACGAGACCGACCTAATCGCTAACATGGCGAACATCGCCGCAGCACTCAAGCAGACTTTCGGTTTCTTTTGGGTAGGTTTCTACCGGGTCATAGACAATCAGTTGGTATTAGCGCCTTTTCAAGGCCCTATCGCCTGTACACGTATAAAATACGGAAAAGGGGTATGCGGCACGGCCTGGAAGGAGGCTCGTACGA # Questionable array : NO Score: 5.65 # Score Detail : 1:0, 2:3, 3:0, 4:0.90, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.49, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGCACCCCGTGTGGGTGCGTGGATTGAAACG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: R Score: 4.5/4.5 # A,T distribution in repeat prediction: F [7,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTCGCACCCTGCGTGGGTGCGTGGATTGAAACA with 94% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-4.60,-4.20] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: NA [71.7-73.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.74,9 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], //