Array 1 110813-110076 **** Predicted by CRISPRDetect 2.4 *** >NZ_WKMV01000011.1 Parabacteroides distasonis strain BIOML-A13 scaffold11_size235621, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== =============================================== ============================== ================== 110812 47 91.5 30 A......C.......................GA.............. TTCTCTGCGTTTTGGTTCAATGCGATTGCA 110735 47 100.0 30 ............................................... ACAACAAGACTGTACTCGAAGGAGGAGTTA 110658 47 100.0 30 ............................................... ATTCGCTTAAAGCATGAGCCGATACGTTGA 110581 47 100.0 30 ............................................... TGGGTTTGATATACATGACACCCAAAAGGA 110504 47 100.0 29 ............................................... ACACAAAAGCCGGATGCGTTGGCGCAATC 110428 47 100.0 30 ............................................... CGTCGTTTCTCAGATAAAAATAAAGCGATT 110351 47 100.0 29 ............................................... GCTTTCGCACAGACTTATTTTGCGGTACA 110275 47 100.0 29 ............................................... GTTGATTTGATTAAAAGAACAGGGAGACT 110199 47 100.0 30 ............................................... CGCTTTGGATATAATGAGCCTATAGAGTAC 110122 47 100.0 0 ............................................... | ========== ====== ====== ====== =============================================== ============================== ================== 10 47 99.2 30 GTTGTGATTTGCTTTCATTTTAGTAACTTTGAGCCATTGGAAACAGC # Left flank : TCTAAACGCTTGGAGAATCGATTATTTCGAGGCAAGGTACGACTTGCTTCCAAATTCGAAAATATTTTTTCACAATAATCTATTGAACAACGCATTACAAATTTTCACTGATTTATTCTAGCGAAAGATTTCGTCTTCTGTCAACCTCAGTCGGACAAACAAGCGACTTTTTATCTGAAAACAATAAGCTTTTCATCCGAAAACAATCGGATTTCCGGTCGGAAACAATAAGGATCGCAGTCGAAAACAGCATACAACGCCGTTATAAACACCTTACTATTGGATCATAATCCCTCTACAAAGCCTATTCATACACTAAAAAATGGCGAAATACGCTGTTTCCGCCTTCCTTTTCTCTCTATTTTCGCATTTCCATAAAAAGCCTTTATCAGCTAAAAGTCTCCATTTTGAGCAAAATGACAGAAAAAGAGATATTTTGATGACATAAAAGAGAATGGCTTACTTTTTGGTGTAGCTGCCCCAATATTTTGCAATATTTT # Right flank : ATATTATCTATTAAAAACTATTAATCAATGGATTACAAAGCACATCAGAATAAAAAAAACAGATATTGTTTCCATTAAAAATCCCGCTTGACAGCGGGATTTTTCTTTTAGAACAACTCCAATTGCTGTCCAGGAGTATTCACTTCCACCGTTTTCTTTCCATAAAAAAGTTCAATATTCCCAAACTGCTTATCCGTAATACACATGATTCCGACATTCCCATGCTCCGGAAGAAAAGATTTAACTCTTTTTACATGTACGGCCGCATTCTCGCTACTAGCACAATGGCGGACATAAATAGAAAACTGAAACATCGTAAATCCATCTCTTTGTAGATTTTTCCTGAAATCCACATACGCCTTTTTCTCCTTCTTCGTCTCGGTAGGCAAATCAAACAACACAAGTACCCACATAACACGATATTCGCTAAACCGATCCATATTACATTTCCGGATAGGAGATTCTACGCAATTCGCCACTAAAGCACTTATACAGAGAAG # Questionable array : NO Score: 6.22 # Score Detail : 1:0, 2:3, 3:0, 4:0.96, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTGTGATTTGCTTTCATTTTAGTAACTTTGAGCCATTGGAAACAGC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: NA [Repeat is AT rich:63.83%AT] # Reference repeat match prediction: R [matched GTTGTGATTTGCTTTCAAATTAGTATCTTTGAACCATTGGAAACAGC with 92% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-0.90,-2.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-4] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [80.0-66.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.68,4.87 Confidence: HIGH] # Array family : II-C [Matched known repeat from this family], // Array 1 93010-92170 **** Predicted by CRISPRDetect 2.4 *** >NZ_WKMV01000016.1 Parabacteroides distasonis strain BIOML-A13 scaffold16_size129426, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ================================= ====================================== ================== 93009 33 97.0 38 ................................C TGCACGAAAGCGAGCATCGTCAACAAGGCGATTGACTC 92938 33 97.0 32 ................................G ATATTACTTGAAGATTATGATATTGAGGCGGC 92873 33 100.0 34 ................................. ACAGCACGCCGCAAGATACGACGGATTACATAAC 92806 33 97.0 37 ................................G ACAGAACTGTGGACGGTCTGTGCCTCGATATACGAAG 92736 33 97.0 34 ................................G ATTATGAGATTATTGGAGGAAGTTAATACCACCA 92669 33 97.0 35 ................................G ATTATGAGATTATTGGAGGAAGTTAATACCACCAA 92601 33 100.0 34 ................................. GGATACTGAAAACGGAGAATAACAATCATGGCTA 92534 33 97.0 34 ................................C ACGTGGAACAAAGTAGCGGAAGGTCTTAGGCTTT 92467 33 100.0 32 ................................. CAATACCCAAAGCGTTTATGAGTATTACCTAA 92402 33 97.0 34 ................................T TACCATTGGTCGAATACCCGATCTTGACAAGACC 92335 33 97.0 33 ................................C AAGTCATCCACGTTTTTTTGAAACTCCGGATGA 92269 33 97.0 34 ................................T TTGGTAAGCTCAACGTAAAACTCAATTTGAGACT 92202 33 100.0 0 ................................. | ========== ====== ====== ====== ================================= ====================================== ================== 13 33 97.9 34 GTCGCACCCCGTGTGGGTGCGTGGATTGAAACA # Left flank : TGATAATTATCCGGTATTTCTAATAAAATGATTTACGATTATGCATATTCTTGTGACTTATGATGTGGATACTACGAGCAAAGAAGGAGCTCGCCGCCTACGACATGTGGCTAAGGCTTGCATAGATTATGGCCAAAGGGTACAGAATTCTGTCTTTGAGTGTGAGGTGACAGAAGCACAATATTGTCTCTTGATTGAACGAATCAAGCGTATTATTGATATGTCTCTTGATAGCGTTAGATTTTATATTCTCAATAAAAACGAGAATAAAAGGGTAAAAGTGATAGGTGTTGAAACTGCTTACAAAGTTAATAATGCTCTTATCATATAATTTATGCGAATGTGGAGTATTACGAAAAAAGTAGTATTTTCGCACCCCTTAATAATTAGCAGATTAACCCGTCTTTAAGGCGATTGCAGCCATTAAGCTGAAATAAAAATGAGAATTCGCATATTAATAGGTCTAATTTATTGACTTATAATATGTAACTTTGCACACT # Right flank : TTTGTAAAGCGCAAGCTTTTTCATAAAAGATATACTTACGTGATGAAGTATATCTTCTATGGGACTTGAAGTGTTCTTCTGCGACGAAGGGATTAGTACTTCGTAAAAATAACTTTTTAAAGTCACTGACTCTGACATATAATTTCGCTTAGTTATTTCTGCAAAAATCATTCATATGTAAACAAGTTTCGTATATTCGCGACCTTATAATAGTTTAAATATGGCAGAGGAACTACGGATTAAAAACGGTGATAAACAGGAGATGTATGAGACATTGCTCCCGCAAATCGCCTCATTGGTAGGTAACGAGACCGACCTAATCGCTAACATGGCGAACATCGCCGCAGCACTCAAGCAGACTTTCGGTTTCTTTTGGGTAGGTTTCTACCGGGTCATAGACAATCAGTTGGTATTAGCGCCTTTTCAAGGCCCTATCGCCTGTACACGTATAAAATACGGAAAAGGGGTATGCGGCACGGCCTGGAAGGAGGCTCGTACGA # Questionable array : NO Score: 5.65 # Score Detail : 1:0, 2:3, 3:0, 4:0.90, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.49, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGCACCCCGTGTGGGTGCGTGGATTGAAACA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: R Score: 4.5/4.5 # A,T distribution in repeat prediction: F [7,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTCGCACCCTGCGTGGGTGCGTGGATTGAAACA with 94% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-4.60,-4.20] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: NA [71.7-73.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.74,9 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], //