Array 1 35835-35982 **** Predicted by CRISPRDetect 2.4 *** >NZ_APYY01000272.1 Escherichia coli P0301867.8 gecP03018678.contig.274, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================ ================================ ================== 35835 28 100.0 32 ............................ TCGACGGGGTGCGGTAAAACCTTTGCGAACGC 35895 28 100.0 32 ............................ TTCACAGGTAACATACTCCATCCGCCCACCAT 35955 28 78.6 0 ................A...A.A.A.TG | ========== ====== ====== ====== ============================ ================================ ================== 3 28 92.9 32 GTTCACTGCCGTACAGGCAGCTTAGAAA # Left flank : AAATTCATCGTCGAGTTGCAGGTTCAGCTGGATCAGAAAGGTGTTTCTCTGGAAGTGAGCCAGGAAGCGCGTAACTGGCTGGCCGAGAAAGGTTACGACCGGGCAATGGGCGCACGTCCGATGGCGCGTGTCATCCAGGACAACCTGAAAAAACCGCTCGCCAACGAACTGCTGTTTGGTTCGCTGGTGGACGGCGGTCAGGTCACCGTCGCGCTGGATAAAGAGAAAAATGAGCTGACTTACGGATTCCAGAGTGCACAAAAGCACAAGGCGGAAGCAGCGCATTAATCTGATTGTTAGGTAGGTTGGTCAAGTCCGTAATCTCGAAAGAGGTTACGGACTTTTTGTTTATGGGGTGGAGGAGGTTCAGACCCTTTTTTTAATGATGATGGTAAGTTGTTGATAATTAGTGCTGCGGGAAGGTAAGGATAAAAAAGGGTGCTGCAGGAGAATGGGATGGTTTTGCTTTATTAACAACGGGCTAAACGTGTAGTATTTGA # Right flank : GCGAAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTGACTCGCTTCGCTCGCCCTGCGGGCAGCCCACTCACTGCGTTCGTGGTCTGTCCAACTGGCTGCGCCAGTTGTCGAACCCCGGTCGGGGCTTCTCACCCCCCCTGGAGTGCATTATGCGAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTCGAACCCCCGATACGTTGCCGTATACACACTTTCCAGGCGTGCTCCTTCAGCCACTCGGACACCTCACCAAATTGTTTTGCTACCAAACCTCATGGGTGGCAACGGGGCGCTACTATAGGGAGTTGGAGTAAAACGGTCAAGAAGAATTTTAATGATAATTATTGTTTGCTCATACTGTAAACAAGTTGTGCAGTATATCTACATCGAGACAAGTTACGGACTTATACTTCCAAAGTACTTCATACATATCACAAAATA # Questionable array : NO Score: 5.01 # Score Detail : 1:0, 2:3, 3:0, 4:0.65, 5:0, 6:0.25, 7:0.02, 8:0.4, 9:0.69, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCACTGCCGTACAGGCAGCTTAGAAA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: NA [Repeat is AT rich:51.72%AT] # Reference repeat match prediction: F [matched GTTCACTGCCGTACAGGCAGCTTAGAAA with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-8.00,-7.70] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-7] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [58.3-53.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.28,0 Confidence: HIGH] # Array family : I-F [Matched known repeat from this family], // Array 1 76138-75377 **** Predicted by CRISPRDetect 2.4 *** >NZ_APYY01000445.1 Escherichia coli P0301867.8 gecP03018678.contig.449, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 76137 29 100.0 32 ............................. TGCAGATGTATTTGCCGGGCATTTAGGCGTGA 76076 29 100.0 32 ............................. GAAATAATGCTTTCCTTTTACAAACCGCGCTT 76015 29 100.0 32 ............................. CAATCAATAATATTAACCACATATTCAATTTC 75954 29 100.0 32 ............................. TGCAGATGTATTTGCCGGGCATTTAGGCGTGA 75893 29 100.0 32 ............................. GCACTGTAAATGAAATAGAGCGGCATTTATTG 75832 29 100.0 32 ............................. TATGACCGCACTCAACGTTATTACAGGCGCAA 75771 29 100.0 32 ............................. ACCACATCCACGCCGCCGATAGTCAGAGTGAT 75710 29 100.0 32 ............................. AATCCTGGTCGCTGGGTTATTTTGTATTGTGG 75649 29 100.0 32 ............................. AGATTGGTTTTCCAGTCTTTGGCGGTTGGCAA 75588 29 93.1 32 .T..........C................ CTGTGCCGGTAGCCCCCTCCCTGTTCAGCCTC 75527 29 93.1 32 .T..........C................ ATTTCTTTAATTATTTAGCTGATGCTTTTAAA 75466 29 93.1 32 .T..........C................ GGTAAAAACACGGTCTGAACCGACATTCATGT 75405 29 96.6 0 .T........................... | ========== ====== ====== ====== ============================= ================================ ================== 13 29 98.1 32 GAGTTCCCCGCGTCAGCGGGGATAAACCG # Left flank : | # Right flank : GGCGCACTGGATGCGATGATGGATATCACTTAGAATTCCCCGCCCCTGCGGTAGAACTCCCAGCTCCTATTTTCAAACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGCGGTAGCATTATCCGCATAACATCACGGCAGCGACGTTCTATTCTTCCTGGAAGTGCCTTATCAATATGTTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTTCCAAGATTCCAGTTAGTCGCTTCCACCGAAAGTACAGCAATGCCCGCTTTGTCGAATATTTCTGCGTCATTACAACACCCAGTGCCTTTCGGATAATTTTTATTCAAATCCGGATTGGTCGTTGCGGCAATTCCATGACTACGCGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAACAA # Questionable array : NO Score: 6.16 # Score Detail : 1:0, 2:3, 3:0, 4:0.90, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGTCAGCGGGGATAAACCG # Alternate repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,6] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GAGTTCCCCGCGTCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [1-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [43.3-0.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.27,5.65 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 469-196 **** Predicted by CRISPRDetect 2.4 *** >NZ_APYY01000449.1 Escherichia coli P0301867.8 gecP03018678.contig.453, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 468 29 100.0 32 ............................. TCGCGGCCATCACCGGGCGCGGCGTGGATGTT 407 29 100.0 32 ............................. CCCTGTTTTCACATGTTTTGTCACGGAAAAAT 346 29 100.0 32 ............................. TATTATTCAGCATGAAAATGGCTCAATGCCAT 285 29 100.0 32 ............................. GACGCCGCCGCCGCGAAGCCGTTTCCGATGTT 224 29 93.1 0 .A..........................A | ========== ====== ====== ====== ============================= ================================ ================== 5 29 98.6 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CTGGATGAACTTTTGGCGACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACCCATAAATATTTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTCTTTGTCGCCTCTGAAAAACCTCCATTTTGCCCATCTTGGACTAATCATTATCATTCTCTACAAATTCTGTGGCGTTAATTTTTCGTTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTGCTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTGGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : ACCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGTTCTCCATGCCAGAGAGGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTAT # Questionable array : NO Score: 5.99 # Score Detail : 1:0, 2:3, 3:0, 4:0.93, 5:0, 6:0.25, 7:0.01, 8:0.8, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: NA [5,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-14.00,-14.90] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [2-1] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [68.3-73.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.28 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //