Array 1 105026-104387 **** Predicted by CRISPRDetect 2.4 *** >NZ_NPZM01000134.1 Shigella sonnei strain ECCRETH04 36-CREC058-1_NODE_16.ctg_1, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 105025 29 100.0 32 ............................. TCGGCTCGTAGAAATGCAGATTCGCCGGATTA 104964 29 100.0 32 ............................. ATAAAACAACAGGTGAGTTGGGTTTTGAGTAT 104903 29 100.0 32 ............................. ACAATCAGATCACGATCGATGCCTCGCGTGAG 104842 29 100.0 32 ............................. ATTTTCGCAGCGGGTAACCGCGCCCGGCGATC 104781 29 100.0 32 ............................. TTTATGTGGAGTATCAGTGCCTGGGTGGAAAG 104720 29 100.0 32 ............................. TGGGGCTGAGCATGGCTAAACAACCGAGGCGC 104659 29 100.0 32 ............................. GTCAACTGGCACCACAGGATGATGGCGGCGAA 104598 29 96.6 32 ............C................ CAGGCGCTCGACGCGGCACGCAGTCGGGCACA 104537 29 96.6 32 ............C................ TTGTCGGGCTTGTTCGGCGCGGGTTGTTAACT 104476 29 96.6 32 ............C................ CCAGCGAAGGACTTGCTCGTGTGGTTAAAGCC 104415 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================ ================== 11 29 99.1 32 GTGTTCCCCGCGTCAGCGGGGATAAACCG # Left flank : GTGCTTGCCGCAGGTGAAATTGAACCACCACAACCCGCGCCGGATATGTTACCGCCTGCCATCCCTGAACCTGAAACGCTGGGCGATAGCGGTCACCGGGGACGCGGCGGATGAGTATGGTCGTCGTTGTTACAGAAAATGTCCCTCCGCGCTTACGTGGACGGCTCGCGATCTGGCTCCTGGAAGTGCGTGCCGGTGTTTATGTCGGAGATACGTCCAAACGTATTCGGGAGATGATCTGGCAGCAAATCTCTCAACTGGCAGGTTGCGGAAATGTGGTAATGGCCTGGGCGACCAATACCGAGTCGGGTTTTGAATTTCAGACCTGGGGTGAAAATAGACGTATTCCGGTGGATTTGGATGGGGTGCGTTTGGTTTCTTTTCTTCCTGTTGATAATCAATAAGTTATCTGTTCTTTAAAAATAAGGAAATGTTTTAATTTAGTTGGTAGATTGTTGATGCGGAATAAATTTGTTTAAAAACAGTTATGTATGCTTAGT # Right flank : GACGCACTGGATGCGATGATGGACATCACTTGGAGTTCCCCGCCCCTGCGGTAGAACTCCCAGCTCTCATTTTCAAACCCATCAAGACGCCTTCGCCAGCTCTTTCACCAGCGGTAGCATTATCCGTATAACATCACGGCAGCGACGTTCTATTCTTCCAGGAAGCGCCTTATCAATATGTTGTTGATTATCAAGCCTGACATCGTGCCAGCTTGTTCCCGCAGGGAAGGCGGCGGTTTTTGCACGTTGCTGATAGCCATCCTTATTCCCAAGATTCCAGTTAGTTGCCTCCACCGAAAGTACCGCAATACCGGCTTTGTCAAAAACTTCCGCGTCGTTACAGCACCCGGTACCCTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCAATTCCATGACTACGCGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAACAA # Questionable array : NO Score: 6.21 # Score Detail : 1:0, 2:3, 3:0, 4:0.95, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGTCAGCGGGGATAAACCG # Alternate repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: NA [5,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGTCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: R [40.0-71.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.14 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 10315-9310 **** Predicted by CRISPRDetect 2.4 *** >NZ_NPZM01000084.1 Shigella sonnei strain ECCRETH04 36-CREC058-1_NODE_21.ctg_1, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 10314 29 100.0 32 ............................. TTTGCTGAGTGTGCATACACACTTGAACATCA 10253 29 100.0 32 ............................. CTATCAGTAACAACCTGGTAAATATCGGTTTT 10192 29 100.0 32 ............................. TGCACGCCCTGCCGGAATCTCCCCTCGCTCTC 10131 29 100.0 32 ............................. AGTAAGCCGGTTGATTATCGCCAGGGATATTA 10070 29 100.0 32 ............................. GGAAAATAAACGCGTTATTCCTTGATGGGTGC 10009 29 100.0 32 ............................. AGATTCGCGCAATGTACGGTAAAGACAGCACC 9948 29 100.0 32 ............................. GGATTCTGCACTCTTTGATGTAATGGTACTGC 9887 29 100.0 32 ............................. TTTGTTTTGATGATTTATCAGTATATGAATTG 9826 29 100.0 32 ............................. ATTTTTGCGCGGCCTGCATGAAACCAGTGGCA 9765 29 96.6 32 A............................ GTACCGTTGAGCAATGGTGGGTAAATGGTGCT 9704 29 100.0 32 ............................. ATTTAGATACGGCACACAGGAATGCGCTGGAT 9643 29 100.0 32 ............................. CATGGGAGTCCAGATTACAAATTCCAGAATGG 9582 29 100.0 32 ............................. CGCCAGTCTGAGGTTTTCCGGTCATACATCGT 9521 29 100.0 32 ............................. ACCGTTGCTACACATATGATCAGGCATGTGCC 9460 29 100.0 32 ............................. GACGATGAGACGCCCTGGTGTGCCGCGTTCGT 9399 29 100.0 32 ............................. CGTTTAGCTCCGCAATGTTGGATATCACTGAT 9338 29 89.7 0 ...........A........A.......A | ========== ====== ====== ====== ============================= ================================ ================== 17 29 99.2 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CTGGATGAATTGCTTGCGACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATTTAAATATTGCCTGATTACACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTCTTTGTCGCCTCTAAAAAACCTCCATTTTGCCCATCCTGGACTAATCATTATCATTTTCTACAAATTCTGTGGCGTTAATTTTTCGTTGGAGTGAAAATTATTACGTCGGAGTTTGGTGGATTTTAGTCGGTATAGAATTACTTTAAATATTTGGCTTTTCAATCAATGAATTAAGTGCTCTTTAACATAATGGATGTGTTGTTTGTGTGTTTCTGTTAAGTTGGTAGATTGTGACTGACTTAAAAAATCAATAATTAATAATAGGTTATGTTTAGT # Right flank : CCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTCGTATTAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGCTCTCCATATATGAGAAGTTCACAATTATCGATACAAAAAATCAAATTTAATCAAAGTGTTATTTGTATAATCCTTAAACCGTTAAGAAATTTTAACATATTATTTTTTTAATATTAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTGTATTTTTTAAGCTTGTTATTCATTGATTAAGTAATAAATCTGGAAATTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAATCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTTTA # Questionable array : NO Score: 6.22 # Score Detail : 1:0, 2:3, 3:0, 4:0.96, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [3,6] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [3-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [66.7-76.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.92 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //