Array 1 83166-86780 **** Predicted by CRISPRDetect 2.4 *** >NZ_JAAIND010000006.1 Enterocloster clostridioformis strain MSK.2.80 NODE_6_length_157455_cov_226.234, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ================================ ===================================== ================== 83166 32 100.0 35 ................................ CAGAATCTTGGTGAGTGTGCAGCGCGTCATCCGAA 83233 32 100.0 37 ................................ CTGCTGATGAAAAACGTATTTTTTGAAGCAGGCTTGC 83302 32 100.0 34 ................................ CAGTCAGAAAACAACAAATATTGCACGGATGAAG 83368 32 100.0 33 ................................ ATCAAATATGGGATAATCAAAATCAAAAATCTT 83433 32 100.0 34 ................................ ATGATATCTCATTTCCGTAACTTGTTTGAACATG 83499 32 100.0 36 ................................ AAGCTGTCCCTGGGGGGTATTTGAAACAGCCGTCAG 83567 32 100.0 34 ................................ ATCACGGAGTTTGCGATAATCTTCACGCTCGTCA 83633 32 100.0 35 ................................ TCACAGTTCCAGCGGAGTATCTCGTAAGTGTGTTA 83700 32 100.0 34 ................................ CTCCATTATCTCTACTGTAATCTCCTACCCAATC 83766 32 100.0 34 ................................ ATCTGACGTATAAACCGCTTCGATGATTTTGCTA 83832 32 100.0 35 ................................ CATTAAAACTGTCACTTACCACGGACGACCACAAC 83899 32 100.0 35 ................................ CGGTATAACAGATTCTACTTCCAAATCTCCTATAC 83966 32 100.0 35 ................................ CTCCGATAACCGACAACGTATATTTGTCTGTGGGA 84033 32 100.0 36 ................................ CGCCATTGCATACTTACTTTTAGTTGGCATCTTTAA 84101 32 100.0 35 ................................ CTCCGATAACCGACAACGTATATTTGTCTGTGGGA 84168 32 100.0 36 ................................ CGCCATTGCATACTTACTTTTAGTTGGCATCTTTAA 84236 32 100.0 34 ................................ CTCCGATAACCGACAACGTATATTTGTCTGTGGG 84302 32 100.0 34 ................................ CTCCGCCATGCTTGATACCATATTGTATCCTTGC 84368 32 100.0 33 ................................ CTCCAACCTGCTACAAAATGCGTCTGTTAACTC 84433 32 100.0 35 ................................ CGGTATAACAGATTCTACTTCCAAATCTCCTATAC 84500 32 100.0 35 ................................ TTAACAAAGGTGGCAAAAGGTGAGACCGACATCGT 84567 32 100.0 36 ................................ CATCGTTGTTGTGCTCCCCGTTCAGGATGTTATTGT 84635 32 100.0 36 ................................ TGTTTTTTCCTTGTCCACCGGCCTCACCTCCTCCCG 84703 32 100.0 34 ................................ CCGCATTTATTTAAAGCAGAACCGGGAGCAGATT 84769 32 100.0 34 ................................ AACCAGCCGGCTATGCCGGCCGCGGAGTAAGCAA 84835 32 100.0 34 ................................ AGTACTTTTTCAAAGAGGTTATGGCACAGCTTTC 84901 32 100.0 35 ................................ TGGGAGTACCTGCCGGTCACCGTGCAGACCAAGGC 84968 32 100.0 34 ................................ CTACAGGAACGCAGGAGCCGGCCATCCAGAAGGG 85034 32 100.0 34 ................................ CAATAAACACACATCTACATTGTACTATACCAAC 85100 32 100.0 34 ................................ AGATGAGGACCTGCAGGACATCCGCAGCGAAGTG 85166 32 100.0 35 ................................ CTGATTGGGGTAGCTCCACCAGTGGTATCAAGAAA 85233 32 100.0 34 ................................ AGATGTTCCGCATCATCAATAAACTGTTGCCCCA 85299 32 100.0 33 ................................ ATCAACTCCAAAATAAATCAGGCCGTCTTCCTG 85364 32 100.0 34 ................................ TACTTTTGCAAGGATAGCCGCAAGTGTGGTCTTT 85430 32 100.0 33 ................................ CAAACACATCTTTATACCATCATCATAACACAG 85495 32 100.0 34 ................................ AGAGGAACATGATGGAAAATCTGCTACGGAAGTG 85561 32 100.0 34 ................................ AAGCAGTCCGTAAGCGGTGTCGGGAGCAAGCTGG 85627 32 100.0 35 ................................ CGTGATGATTAATATAGTACCGGCTCACGCGAAGG 85694 32 100.0 34 ................................ TGCCATGACTTATTATGTTGCAGTTCTTGAAGCC 85760 32 100.0 35 ................................ TGTTTCCAATCAGACAGTTCTTTCTGCATCGTCTC 85827 32 100.0 33 ................................ TCAAAGAGTAAAAAAAGAGGGGTTGGTTATCAG 85892 32 100.0 35 ................................ TTACTGTCAAAATAGATGCCTATACCCATGGTAAT 85959 32 100.0 33 ................................ GGAAATGTGCTATATGCGGCCGGGATGGAGAGA 86024 32 100.0 34 ................................ TGCGAATGGCAAAAGAGATTGAAGCGTTGCCACC 86090 32 100.0 33 ................................ GCACCAGGCTAACAGCACGTTGTAATCCTCACG 86155 32 100.0 34 ................................ ATATGGGCAGTGCCGGCCGTACATCCTGGGGAGA 86221 32 100.0 34 ................................ TGCCCATCGGGAACTCCTCCAGCTGCTTAAGCAG 86287 32 100.0 35 ................................ AGGTCTGTGATTCCCAGCGACTTAAGCTCTGCCAG 86354 32 100.0 35 ................................ CGGTATGGTATCCATGCATACTGATTGGCATTAAT 86421 32 100.0 34 ................................ AGGTTGATGTCCTCTATGCTGCCATAAATACTAT 86487 32 100.0 34 ................................ AAAGCATTCGGCTAACTCATGCAGTACGCTCATT 86553 32 100.0 33 ................................ ATCTACCGTACAGGTATCCATACTATACATGGG 86618 32 100.0 34 ................................ GGAACGGGAGATGATGAAGTGTACTGGAACGGGC 86684 32 100.0 33 ................................ GGTCAGAAGAAATCCACCCAAAGGGATATATAT 86749 32 100.0 0 ................................ | ========== ====== ====== ====== ================================ ===================================== ================== 55 32 100.0 34 GTCTCCACCCTCGTGGTGGAGTGGATTGAAAT # Left flank : TGCTGGTACTGATTACATATGATGTGAATACGGAAACCGCCGCAGGCAGGGCCAGGTTGAGAAAGGTTGCAAAGCAATGTGTCAATTATGGCACCCGCGTCCAGAACTCCGTATTTGAATGTATTTTGGATAATGCCCAGCTGATAAAGCTAAAGGCGATACTGACAGATATAATTGATGAACAGACGGACAGCCTTAGGTTTTACAACCTTGGGAATAAACACACTACAAAAGTGAACCATGTTGGGATGAATAAGGGGATTAATGTGGAAGAACCGCTGATATTCTGACCTGGCCGCGGGCGGTAAAGCGGTATAAAGTATTTTGGTGCGAACCTGAAGCAAACATGGATTTACGGGGAGATTCGCACCGGAAAATTTGCACAATGAGAACGGATTTGCGAGTGTGCAGGGATGGTTGGCCGTGATTTGAGGGGGTGGGATATGATTTGTTTGTGGAAATGTGTTGAAGAATAGTAGTGGTTATTGTGTAGTATTGCT # Right flank : TTCCGGGGTAGTCTCATAGCCGTTCACATAATAGCGTCTCCGCTCTCGCGTCGGATCAAGTTAAAAATCCGCCAATTACAGAATTACATCCAGTTGCCCACTCCTCTGCCCTTTTCGCAAACGCCGCAATCCCTTTTGAACTGCAAAAAAAGGTTAGGTAAAACTAATTCCGTACAACAAAAAAGATGGGGCAAGCCCCATCCCTTCCATAATCTTGTTAGAATGTACAAAGAATGCATACTTCCATTATCCCTTCCATGCCCGAATCTCTTTAAGCCCCGGCACCAGCACCTTAGCGCTGGTTTCCAGAATGATCCGTCCCTTTTCAGCTGTGGCGCCTGAGGGGTCTCCGCCGATTCCGATGGGTTTGCCGTCTGCTGTTTTCCAATCCTCGGATACCCAGCCAATGGATACATTTCCGTAAGGTTTCAGCGCCTGATTGTCTTTAAATGCAGTGGGAATTCCTGCCTCTGAGGTCTCTGACTTTACCAGGGACGGGT # Questionable array : NO Score: 9.26 # Score Detail : 1:0, 2:3, 3:3, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCTCCACCCTCGTGGTGGAGTGGATTGAAAT # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: F Score: 4.5/4.5 # A,T distribution in repeat prediction: R [6,9] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCTCCACCCTCGTGGTGGAGTGGATTGAAAT with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-5.40,-4.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: F [68.3-45.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [9.64,0.37 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], // Array 1 12473-12684 **** Predicted by CRISPRDetect 2.4 *** >NZ_JAAIND010000077.1 Enterocloster clostridioformis strain MSK.2.80 NODE_77_length_15493_cov_140.737, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ================================ ================================================================ ================== 12473 32 100.0 28 ................................ GGTCTAAGTAATACCACTGTCCTCCCAG 12533 32 87.5 28 CCT............................C CGTCCAGATAATACCATCTTCCATTAAT 12593 32 81.2 28 ...............CT....TT.A......C CGTTCATATAATACCATTTGCCGTTGAT 12653 32 78.1 0 ...............C.T...GATA...T... | ========== ====== ====== ====== ================================ ================================================================ ================== 4 32 86.7 49 GGCCTGCCAGCCGGTGAGCATAGCGCCGCTGG # Left flank : TGACCGTAATACCTTCAGACAAGCGCTTTCCCAGGGCCAGACGTGTCCCGCGCACCTTGATGATAACGCTTCCGCTGCCTTTCTTATTCAATATATTCACCGGCGTCCCCTCATTGACCCCCAGGGCCTCAAGACGTCTGGTAATACTCTCATCCACTTGCACGCCCGATACTACATACCGGCCGCCGATTTCACACTCGTTCAGCTTTTTCTGCATTGCTGCCTGTCTCCTTTTGTAGTTGCATTTTAACCATATTCATTGTATGCTTATTGCGAAAAAATGTCAATAAAAAGAGTTTGTTATGACAAACTCAATTTCAAAAAAGAGGATGCAGCGAATCTCTCCGCTGCATCCTTTCTATACGTTTCGGTTCTATTAATATTGATTTTAGGCTTTACTGCACCCAAGCTCCGGAACCATCCACATATTTTCCATCCGGCGTCCATGTATTATAATACATGGCTCCGTTGGTGCCCAGATAGTAGTACTTGCCATTGAT # Right flank : GACGGATTCAGATAATACCAGGTTCCGTCCAGCTTCTGCCAGCCGGTGAGAATATAGCCCTTGTTATTAAAATAATACCATGCGCCGTTAATCTGCTTCCAGCAGGATGCCGGATATCCGCCTGTATCATAAGCATACCACACGCCTGTACCGTCCTGCTCCCAATGCCCGCTTCCGCCATAGCGGCCAAGCTCATCCTCATCAATATACAGGGAGCTGGAATCCTCGGACCATTCACCTGTCTTATTGTTGTAACGGGAAATGGCCCTTACACGGAAGGTATAGTCTCCTTCCCGGTTAAAATAGCCTGCAAAGTTATAGGAGGTTCCTGTGGTGCTGACCGTAGTGACTGTCTTGTCGTCACGGAGGAGTTTCACATCGTAGCTCTTAGCGCCCTCAATCTCCTCCCACTCAGCCGTGGTGTCACTCCACTCCAGATCCGACGCGCCCTGAAGCTTACCGCCGATACGTTTTAGCTCAGCATACACCTCAATGTACTG # Questionable array : NO Score: 3.39 # Score Detail : 1:0, 2:3, 3:0, 4:0.34, 5:-1.5, 6:0.25, 7:0.01, 8:0.6, 9:0.69, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GGCCTGCCAGCCGGTGAGCATAGCGCCGCTGG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [4,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched CTGCTGCCAGCCGGTGAGCATGGCGCCGGAGG with 96% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-0.50,-1.20] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-8] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [55.0-56.7]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.28,0.37 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 18224-20140 **** Predicted by CRISPRDetect 2.4 *** >NZ_JAAIND010000022.1 Enterocloster clostridioformis strain MSK.2.80 NODE_22_length_80121_cov_173.939, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ================================= ===================================== ================== 18224 33 100.0 34 ................................. CAGAGCAGGATAAGCCCGGCCGCCAGGGCCGCAA 18291 33 100.0 35 ................................. CCGCATAAACATAACCTCCATCTTCGGGGAACGAC 18359 33 100.0 36 ................................. ACATGTTGTCGGTGATGGGAACAATGGACGGAGTAG 18428 33 100.0 33 ................................. CCTATCAAAGAATCAACTTCGCACTCTATTATC 18494 33 100.0 33 ................................. CCTCACCGTCATATCGTAAATCAGTTATAAAAG 18560 33 100.0 34 ................................. CCGTACAGATTGTTAAGGTATAGCTTTGCAAGCT 18627 33 100.0 33 ................................. CTTGCCTGCCTGCTGTACCAGTTCCTTAATCCG 18693 33 100.0 34 ................................. AGCCGTTCCAGGGACACATGCAGCTCCAGGCGTT 18760 33 100.0 33 ................................. TGTTGTAACCACTGTGCCGGGGCCAACAGTTAT 18826 33 100.0 35 ................................. GGACCAGCGCATCCATCTCAGGGTATCCTCTCCTC 18894 33 100.0 33 ................................. CCCCTCCATGGTCAGGGCTTCCAGGCGGATGTC 18960 33 100.0 34 ................................. TGGAGAAACAGTCGGAGTATATCTCCTGCGAGGG 19027 33 100.0 34 ................................. CTACAGGAACGCAGGAGCCGGCCATCCAGGAGGG 19094 33 100.0 34 ................................. ACTGCAACACTATCACAATGCCAAACTCCGACCA 19161 33 100.0 34 ................................. CGTATACTCAGACGGTCAGTGTACCGGGAGCAAA 19228 33 100.0 35 ................................. TGCAAAGGATGGAAAAATGCAAGAGGTTGCCGGAT 19296 33 100.0 33 ................................. CTAAAAATAGGTATAAAAAATACAGCTCCGCAA 19362 33 100.0 34 ................................. ACACTCATGTCGAAAATCAACATCCTGATAGTCT 19429 33 100.0 35 ................................. TGTGTTAGAAGATGCGAAAGTCTATGTTGGTTGGG 19497 33 100.0 35 ................................. TGTCCGGTAGACTTATATATATACTGCATGGGACT 19565 33 100.0 37 ................................. AAAGTTGGCCTGTATCTCAATCATAAATCGCTCCTGG 19635 33 100.0 34 ................................. CTCGCTCCCAGGCTTAGCCTCGTAGCTGTACACC 19702 33 100.0 34 ................................. ATAGCCTGTGCCACCTTCCACCAGGGAAGGCCGT 19769 33 100.0 35 ................................. ACAATATGCAAACAAGGTAACGTTTTTTGATACCC 19837 33 100.0 35 ................................. CTGTTCTCATGGTTCTTTCCTTTCCGATTCTTGTT 19905 33 100.0 36 ................................. TGCCAAACTGACCAATTACCGGATTGAGCAGCTAGA 19974 33 100.0 34 ................................. TTTTTTAGTCCCTGGTGAGGAGCCCGGCTGGAAA 20041 33 100.0 34 ................................. ATCCCTCATTGCCTGGAACGCGTCTCTCGCCGTA 20108 33 100.0 0 ................................. | ========== ====== ====== ====== ================================= ===================================== ================== 29 33 100.0 34 GTCGAGGCCCGCGAGGGCCTTGTGGATTGAAAT # Left flank : TCCTGGAATTTCCGGAGAAATGTGGATTTCTGGCAGTGGCCGGAACAGCGGTGGCCCTGAAAGCGGGAAATCTGGCAGCTGTAATGGAACGCCAGGGCAGGGTATTGCGCATTTACGATGTCAATGATAAGAAGGAAGTGATGGCGGAATTTGTCCGTGCGTTCAAGCAGGGAGCTATATATCCGGAACAAAAGAGGCTTGTTTTGAAAGAGTATCCGGCGGAGGCGGCAGAGGCACTGCAGCATGCGGGATATATGCGTGAAATGAAGGATTATGTAGTGTATAAGTAGAGTGTGATTTCTGTGAGGCGGAATCCCATCAAACCAGGTGCGAATGCCAAGCTAACATGAAATGTCCGGGAGATTCGCACCTGAAAAACAGAGGAAAAGCCGTGAAAATAGGCTGGGTGAATGGATGGAGGAGGTTGAAATAAGGAGGGGGGGGGGGCATAATGCATAAAAAGTTAGGGGGATGTGGAGGGATTGTTGGATAGTTTTGCT # Right flank : TATAACATGCTTTATATACATATGCTTTTCCTTTCCACCACTGCTGGATATGTGGTTTATTTTAGCTGGCCGATTTCGTGATAGGCCTCCAGAATACCATTTTTGGCTGCTTTTTATACAGTATCAATATCAACCCAATACCAACACTAACCCAATACCACCCCAATACTAACACCAATTCTTAATCATGCAGAACCATTAGGCAATTGCGAAACTCACTGACCAGAATTTATGCACATCGTATAGGAATGAATCTAAATTGAATATATTAAGCAAGATATTTTGTGCAAGGTGAATATAAAAATTCAGAGTTTCGCAATTACCTAATGCAGAACCATTTCTGAAAGGAGAGGGAAAAGCCATGAAACAGATGTCATTATTCGATGACAGAGAAGTATACAATCCCCTGGCCAGCAGGCTGCGTCCGGATGACCTGGATGGATTTGTGGGACAGGAGCATCTTTTGGGAAAGGGGAAGCTGCTGCGCCAGCTCATAGAGC # Questionable array : NO Score: 9.26 # Score Detail : 1:0, 2:3, 3:3, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGAGGCCCGCGAGGGCCTTGTGGATTGAAAT # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: F Score: 4.5/4.5 # A,T distribution in repeat prediction: R [6,7] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCGAGGCCCGCGAGGGCCTTGTGGATTGAAAT with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-11.00,-10.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: R [53.3-66.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [9.37,0.64 Confidence: HIGH] # Array family : I-C [Matched known repeat from this family], //