Array 1 31068-30715 **** Predicted by CRISPRDetect 2.4 *** >NZ_WKMQ01000040.1 Parabacteroides distasonis strain BIOML-A18 scaffold40_size31076, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== =============================================== ============================== ================== 31067 47 100.0 30 ............................................... CGTCGTTTCTCAGATAAAAATAAAGCGATT 30990 47 100.0 29 ............................................... GCTTTCGCACAGACTTATTTTGCGGTACA 30914 47 100.0 29 ............................................... GTTGATTTGATTAAAAGAACAGGGAGACT 30838 47 100.0 30 ............................................... CGCTTTGGATATAATGAGCCTATAGAGTAC 30761 47 100.0 0 ............................................... | ========== ====== ====== ====== =============================================== ============================== ================== 5 47 100.0 30 GTTGTGATTTGCTTTCATTTTAGTAACTTTGAGCCATTGGAAACAGC # Left flank : GGCGCAAT # Right flank : ATATTATCTATTAAAAACTATTAATCAATGGATTACAAAGCACATCAGAATAAAAAAAACAGATATTGTTTCCATTAAAAATCCCGCTTGACAGCGGGATTTTTCTTTTAGAACAACTCCAATTGCTGTCCAGGAGTATTCACTTCCACCGTTTTCTTTCCATAAAAAAGTTCAATATTCCCAAACTGCTTATCCGTAATACACATGATTCCGACATTCCCATGCTCCGGAAGAAAAGATTTAACTCTTTTTACATGTACGGCCGCATTCTCGCTACTAGCACAATGGCGGACATAAATAGAAAACTGAAACATCGTAAATCCATCTCTTTGTAGATTTTTCCTGAAATCCACATACGCCTTTTTCTCCTTCTTCGTCTCGGTAGGCAAATCAAACAACACAAGTACCCACATAACACGATATTCGCTAAACCGATCCATATTACATTTCCGGATAGGAGATTCTACGCAATTCGCCACTAAAGCACTTATACAGAGAAG # Questionable array : NO Score: 6.06 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:0.8, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTGTGATTTGCTTTCATTTTAGTAACTTTGAGCCATTGGAAACAGC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: NA [Repeat is AT rich:63.83%AT] # Reference repeat match prediction: R [matched GTTGTGATTTGCTTTCAAATTAGTATCTTTGAACCATTGGAAACAGC with 92% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-0.90,-2.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: F [80.0-5.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.27,4.87 Confidence: HIGH] # Array family : II-C [Matched known repeat from this family], // Array 1 19602-18628 **** Predicted by CRISPRDetect 2.4 *** >NZ_WKMQ01000019.1 Parabacteroides distasonis strain BIOML-A18 scaffold19_size112758, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ================================= ====================================== ================== 19601 33 100.0 32 ................................. AGGTGGATGTTATCAATCCCAAGACGGGCAAA 19536 33 100.0 36 ................................. CTTCGCAAGGTGTTCCCGCATTGATACAGGAGCAAA 19467 33 97.0 38 ................................C TGCACGAAAGCGAGCATCGTCAACAAGGCGATTGACTC 19396 33 100.0 32 ................................. ATATTACTTGAAGATTATGATATTGAGGCGGC 19331 33 97.0 34 ................................A ACAGCACGCCGCAAGATACGACGGATTACATAAC 19264 33 100.0 37 ................................. ACAGAACTGTGGACGGTCTGTGCCTCGATATACGAAG 19194 33 100.0 34 ................................. ATTATGAGATTATTGGAGGAAGTTAATACCACCA 19127 33 100.0 35 ................................. ATTATGAGATTATTGGAGGAAGTTAATACCACCAA 19059 33 97.0 34 ................................A GGATACTGAAAACGGAGAATAACAATCATGGCTA 18992 33 97.0 34 ................................C ACGTGGAACAAAGTAGCGGAAGGTCTTAGGCTTT 18925 33 97.0 32 ................................A CAATACCCAAAGCGTTTATGAGTATTACCTAA 18860 33 97.0 34 ................................T TACCATTGGTCGAATACCCGATCTTGACAAGACC 18793 33 97.0 33 ................................C AAGTCATCCACGTTTTTTTGAAACTCCGGATGA 18727 33 97.0 34 ................................T TTGGTAAGCTCAACGTAAAACTCAATTTGAGACT 18660 33 97.0 0 ................................A | ========== ====== ====== ====== ================================= ====================================== ================== 15 33 98.2 34 GTCGCACCCCGTGTGGGTGCGTGGATTGAAACG # Left flank : TGATAATTATCCGGTATTTCTAATAAAATGATTTACGATTATGCATATTCTTGTGACTTATGATGTGGATACTACGAGCAAAGAAGGAGCTCGCCGCCTACGACATGTGGCTAAGGCTTGCATAGATTATGGCCAAAGGGTACAGAATTCTGTCTTTGAGTGTGAGGTGACAGAAGCACAATATTGTCTCTTGATTGAACGAATCAAGCGTATTATTGATATGTCTCTTGATAGCGTTAGATTTTATATTCTCAATAAAAACGAGAATAAAAGGGTAAAAGTGATAGGTGTTGAAACTGCTTACAAAGTTAATAATGCTCTTATCATATAATTTATGCGAATGTGGAGTATTACGAAAAAAGTAGTATTTTCGCACCCCTTAATAATTAGCAGATTAACCCGTCTTTAAGGCGATTGCAGCCATTAAGCTGAAATAAAAATGAGAATTCGCATATTAATAGGTCTAATTTATTGACTTATAATATGTAACTTTGCACACT # Right flank : TTTGTAAAGCGCAAGCTTTTTCATAAAAGATATACTTACGTGATGAAGTATATCTTCTATGGGACTTGAAGTGTTCTTCTGCGACGAAGGGATTAGTACTTCGTAAAAATAACTTTTTAAAGTCACTGACTCTGACATATAATTTCGCTTAGTTATTTCTGCAAAAATCATTCATATGTAAACAAGTTTCGTATATTCGCGACCTTATAATAGTTTAAATATGGCAGAGGAACTACGGATTAAAAACGGTGATAAACAGGAGATGTATGAGACATTGCTCCCGCAAATCGCCTCATTGGTAGGTAACGAGACCGACCTAATCGCTAACATGGCGAACATCGCCGCAGCACTCAAGCAGACTTTCGGTTTCTTTTGGGTAGGTTTCTACCGGGTCATAGACAATCAGTTGGTATTAGCGCCTTTTCAAGGCCCTATCGCCTGTACACGTATAAAATACGGAAAAGGGGTATGCGGCACGGCCTGGAAGGAGGCTCGTACGA # Questionable array : NO Score: 5.80 # Score Detail : 1:0, 2:3, 3:0, 4:0.91, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.63, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGCACCCCGTGTGGGTGCGTGGATTGAAACG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: R Score: 4.5/4.5 # A,T distribution in repeat prediction: F [7,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTCGCACCCTGCGTGGGTGCGTGGATTGAAACA with 94% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-4.60,-4.20] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: NA [71.7-73.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.74,9 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], // Array 1 117888-118165 **** Predicted by CRISPRDetect 2.4 *** >NZ_WKMQ01000017.1 Parabacteroides distasonis strain BIOML-A18 scaffold17_size118214, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== =============================================== ============================== ================== 117888 47 91.5 30 A......C.......................GA.............. TTCTCTGCGTTTTGGTTCAATGCGATTGCA 117965 47 100.0 30 ............................................... ACAACAAGACTGTACTCGAAGGAGGAGTTA 118042 47 100.0 30 ............................................... ATTCGCTTAAAGCATGAGCCGATACGTTGA 118119 47 100.0 0 ............................................... | ========== ====== ====== ====== =============================================== ============================== ================== 4 47 97.9 30 GTTGTGATTTGCTTTCATTTTAGTAACTTTGAGCCATTGGAAACAGC # Left flank : TCTAAACGCTTGGAGAATCGATTATTTCGAGGCAAGGTACGACTTGCTTCCAAATTCGAAAATATTTTTTCACAATAATCTATTGAACAACGCATTACAAATTTTCACTGATTTATTCTAGCGAAAGATTTCGTCTTCTGTCAACCTCAGTCGGACAAACAAGCGACTTTTTATCTGAAAACAATAAGCTTTTCATCCGAAAACAATCGGATTTCCGGTCGGAAACAATAAGGATCGCAGTCGAAAACAGCATACAACGCCGTTATAAACACCTTACTATTGGATCATAATCCCTCTACAAAGCCTATTCATACACTAAAAAATGGCGAAATACGCTGTTTCCGCCTTCCTTTTCTCTCTATTTTCGCATTTCCATAAAAAGCCTTTATCAGCTAAAAGTCTCCATTTTGAGCAAAATGACAGAAAAAGAGATATTTTGATGACATAAAAGAGAATGGCTTACTTTTTGGTGTAGCTGCCCCAATATTTTGCAATATTTT # Right flank : TGGGTTTGATATACATGACACCCAAAAGGAGTTGTGATTTGCTTTCATT # Questionable array : NO Score: 5.76 # Score Detail : 1:0, 2:3, 3:0, 4:0.90, 5:0, 6:0.25, 7:0.01, 8:0.6, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTGTGATTTGCTTTCATTTTAGTAACTTTGAGCCATTGGAAACAGC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: NA [Repeat is AT rich:64.58%AT] # Reference repeat match prediction: F [matched GTTGTGATTTGCTTTCAAATTAGTATCTTTGAACCATTGGAAACAGC with 92% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-2.10,-0.90] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [5-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [66.7-51.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.14,0.41 Confidence: HIGH] # Array family : II-C [Matched known repeat from this family], //