Array 1 4175619-4174919 **** Predicted by CRISPRDetect 2.4 *** >NZ_UGDW01000001.1 Escherichia coli strain NCTC9031, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 4175618 29 96.6 32 ..........................T.. AGCCAGACACATTGCTTTATCGCGCCACGCTG 4175557 29 96.6 32 ..........................T.. GCGAGTAAGAGTATTGAGTATTTTAATCTCAT 4175496 29 96.6 32 ..........................T.. GCCATAGACTTTATTAACTTCATTGGGGGTTA 4175435 29 96.6 32 ..........................T.. CGTATCACAACCGGCCTCAGTACGTATCAGAA 4175374 29 96.6 32 ..........................T.. AGGGGCAGGGAGTGACGCTGGAGCAGGCGATC 4175313 29 100.0 32 ............................. CGCAGTTCGCTACCGAGCGTGGTTTGCTGGTC 4175252 29 100.0 32 ............................. GCGCACAGCGACGCGAAATCAACGAAAATTTA 4175191 29 100.0 32 ............................. CGATCACCGGCGGCGACCTAGTGGAGATTGAC 4175130 29 96.6 32 .......................C..... TCACCGAACAGGAGGGGCAGAGTGCTGGCCGA 4175069 29 100.0 32 ............................. CAGGGAAACCGTTCGTACTTATCGACCTCAAC 4175008 29 100.0 32 ............................. AAGGGGACGGCTACGGGACGCCGCCTATTGAC 4174947 29 96.6 0 ............T................ | ========== ====== ====== ====== ============================= ================================ ================== 12 29 98.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTGCTTGCCGCAGGTGAAATTGAACCACCACAACCCGCGCCGGATATGTTACCGCCTGCCATCCCTGAACCTGAAACGCTGGGCGATAGCGGTCACCGGGGACGCGGCGGATGAGTATGGTCGTCGTTGTTACAGAAAATGTCCCTCCGCGCTTACGTGGACGGATCGCGATCTGGCTCCTGGAAGTGCGTGCCGGTGTTTATGTCGGAGATACGTCCAAACGTATTCGGGAGATGATCTGGCAGCAAATCTCTCAACTGGCAGGTTGCGGAAATGTGGTAATGGCCTGGGCGACCAATACCGAGTCGGGTTTTGAATTTCAGACCTGGGGTGAAAATAGACGTATTCCGGTGGATTTGGATGGGGTGCGTTTGGTTTCTTTTCTTCCTGTTGATAATCAATAAGTTATCTGTTCTTTAAAAATAAGGAAATGTTTTAATTTAGTTGGTAGATTGTTGATGCGGAATAAATTTGTTTAAAAACAGTTATGTATGCTTAGT # Right flank : GACGCACTGGATGCGATGATGGATATCACTTAGAGTTCCCCGCCCCTGCGGTAGAACTCCCAGCTCTCATTTTCAAACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGCGGTAGCATTATCCGCATAACATCACGGCAGCGACGTTCTATTCTTCCAGGAAGTGCCTTATCAATATGTTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCAGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAATTTGTTGCTTCTACCGAAAGTACGGCAATACCGGCTTTGTCGAAAACTTCGGCATCATTACAACAGCCAGTACCCTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCAATTCCATGACTACGCGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAACAA # Questionable array : NO Score: 5.85 # Score Detail : 1:0, 2:3, 3:0, 4:0.90, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.69, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : GTGTTCCCCGCGCCAGCGGGGATAAATCG # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [2-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [43.3-71.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.92 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 4202082-4201321 **** Predicted by CRISPRDetect 2.4 *** >NZ_UGDW01000001.1 Escherichia coli strain NCTC9031, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 4202081 29 100.0 32 ............................. CTGCGGGCCTGAACGGCGAAAATGTTGGTGAC 4202020 29 100.0 32 ............................. AGATTCGCGCAATGTACGGTAAAGACAGCACC 4201959 29 100.0 32 ............................. GGATTCTGCACTCTTTGATGTAATGGTACTGC 4201898 29 100.0 32 ............................. CCATTCTTGTTGTCGTTCATCAAGCGCAGGGG 4201837 29 96.6 32 .....T....................... TTTGTTTTGATGATTTATCAGTATATGAATTG 4201776 29 100.0 32 ............................. GTATTGCAGAGCCTGATGGGGCATAAGTCGAT 4201715 29 100.0 32 ............................. AGCTTTATTGTTTTCAGGGAAAATAACGCGGC 4201654 29 100.0 32 ............................. CATGGGAGTCCAGATTACAAATTCCAGAATGG 4201593 29 100.0 32 ............................. CGCCAGTCTGAGGTTTTCCGGTCATACATCGT 4201532 29 100.0 32 ............................. CGCCAGTCTGAGGTTTTCCGGTCATACATCGT 4201471 29 100.0 32 ............................. GACGATGAGACGCCCTGGTGTGCCGCGTTCGT 4201410 29 100.0 32 ............................. CGTTTAGCTCCGCAATGTTGGATATCACTGAT 4201349 29 89.7 0 ...........A........A.......A | ========== ====== ====== ====== ============================= ================================ ================== 13 29 98.9 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CTGGATGAATTGCTTGCGACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATTTAAATATTGCCTGATTACACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTCTTTGTCGCCTCTAAAAAACCTCCATTTTGCCCATCCTGGACTAATCATTATCATTTTCTACAAATTCTGTGGCGTTAATTTTTCGTTGGAGTGAAAATTATTACGTCGGAGTTTGGTGGATTTTAGTCGGTATAGAATTACTTTAAATATTTGGCTTTTCAATCAATGAATTAAGTGCTCTTTAACATAATGGATGTGTTGTTTGTGTGTTTCTGTTAAGTTGGTAGATTGTGACTGACTTAAAAAATCAATAATTAATAATAGGTTATGTTTAGT # Right flank : CCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTCGTATTAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGCTCTCCATATATGAGAAGTTCACAATTATCGATACAAAAAATCAAATTTAATCAAAGTGTTATTTGTATAATCCTTAAACCGTTAAGAAATTTTAACATATTATTTTTTTAATATTAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTGTATTTTTTAAGCTTGTTATTCATTGATTAAGTAATAAATCTGGAAATTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAATCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTTTA # Questionable array : NO Score: 6.21 # Score Detail : 1:0, 2:3, 3:0, 4:0.95, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [3,6] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [3-1] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [66.7-76.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.92 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //