Array 1 144790-144937 **** Predicted by CRISPRDetect 2.4 *** >NZ_JAJCFV010000004.1 Escherichia coli strain MSK.19.94 PMCINAEB_4, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================ ================================ ================== 144790 28 100.0 32 ............................ TCGACGGGGTGCGGTAAAACCTTTGCGAACGC 144850 28 100.0 32 ............................ TTCACAGGTAACATACTCCACCCGCCCACCAT 144910 28 78.6 0 ................A...A.A.A.TG | ========== ====== ====== ====== ============================ ================================ ================== 3 28 92.9 32 GTTCACTGCCGTACAGGCAGCTTAGAAA # Left flank : AATTCATCGTCGAGTTGCAGGTTCAGCTGGATCAGAAAGGTGTTTCTCTGGAAGTGAGCCAGGAAGCGCGTAACTGGCTGGCCGAGAAAGGTTACGACCGGGCAATGGGCGCACGTCCGATGGCGCGTGTCATCCAGGACAACCTGAAAAAACCGCTCGCCAACGAACTGCTGTTTGGTTCGCTGGTGGACGGCGGTCAGGTCACCGTCGCGCTGGATAAAGAGAAAAATGAGCTGACTTACGGATTCCAGAGTGCACAAAAGCACAAGGCGGAAGCAGCGCATTAATCTGATTGTCAGGTAGGTTGATGAAGTCCGTAATCTCGAAAGAGGTTACGGACTTTTTATTTATGGGGGGGAGGAGGTTCAGACCCTTTTTTTGATGATGATGGTTAGTTGTTGATAATTAGTCCTGCTGGAAGGTAAGGATAAAAAAGGGTGGCAGCAGGAGAATGGGATGGTTTTGCTTTATTAACAACGGGCTAAACGTGTAGTATTTGA # Right flank : GCGAAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTGACTCGCTTCGCTCGCCCTGCGGGCAGCCCACTCACTGCGTTCGTGGTCTGTCCAACTGGCTGCGCCAGTTGTCGAACCCCGGTCGGGGCTTCTCACCCCCCCTGGAGTGCATTATGCGAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTGACTCGCTTCGCTCGCCCTGCGGGCAGCCCACTCACTGCGTTCGTGGTCTGTCCAACTGGCTGCGCCAGTTGTCGAACCCCGGTCGGGGCTTCTCACCCCCCCTGGAGTGCATTATGCGAAAAAAAACTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTCGAACCCCCGATACGTTGCCGTATACACACTTTCCAGGCGTGCTCCTTCAGCCACTCGGACACCTCACCAAATTGTTTTGT # Questionable array : NO Score: 5.01 # Score Detail : 1:0, 2:3, 3:0, 4:0.65, 5:0, 6:0.25, 7:0.02, 8:0.4, 9:0.69, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCACTGCCGTACAGGCAGCTTAGAAA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: NA [Repeat is AT rich:51.72%AT] # Reference repeat match prediction: F [matched GTTCACTGCCGTACAGGCAGCTTAGAAA with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-8.00,-7.70] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-7] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [58.3-53.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.28,0 Confidence: HIGH] # Array family : I-F [Matched known repeat from this family], // Array 1 60105-59649 **** Predicted by CRISPRDetect 2.4 *** >NZ_JAJCFV010000011.1 Escherichia coli strain MSK.19.94 PMCINAEB_11, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 60104 29 100.0 32 ............................. GCGGCTGGGGTGGGCTGGTCGGCGATGCTTGC 60043 29 100.0 32 ............................. GGCCGGACGGTAGCAGCAATTTTACGGCCATT 59982 29 100.0 32 ............................. AGATTGGTTTTCCAGTCTTTGGCGGTTGGCAA 59921 29 93.1 32 .T..........C................ CTGTGCCGGTAGCCCCCTCCCTGTTCAGCCTC 59860 29 93.1 32 .T..........C................ TGTTTTAGAGATGTGTATTCAAAATCAGATAG 59799 29 100.0 32 ............................. AAATTGATCCCAGGGTGATTATCGTGGGGATC 59738 29 93.1 32 .T..........C................ GGTAAAAACACGGTCTGAACCGACATTCATGT 59677 29 96.6 0 .T........................... | ========== ====== ====== ====== ============================= ================================ ================== 8 29 97.0 32 GAGTTCCCCGCGTCAGCGGGGATAAACCG # Left flank : CGTGCTTGCTGCTGGAGAAATACAACCGCCGGCCCCACCTGAAGATGCACAGCCTGTTGCCATTCCGCTTCCTGTTTCACTGGGAGATGCAGGCCATCGGAGTAGCTGAAATGAGTATGTTGGTCGTGTTCACTGAAAATGTACCTCCGCGCTTACGAGGCAGATTAGCCATCTGGTTGTTGGAGGTACGTGCAGGGGTGTATGTAGGTGTTGTATCCGCAAGAATCCGTGAAATGATCTGGGAACAAATATCTGGACTGGCGGAAGAAGGCAATGTGGTGATGGCATGGGCTACGAATACGGAATCGGGATTTGAGTTCCAGACATTTGGGGTAAACAGGCGTACTCCGGTAGATTTGGATGGTTTAAGGTTGGTCTCTTTTTTACCTGTTTAAAAACAAAGAATTAGCTGATCTTTAATAATAAGGAAATGTTACATTAAGGTTGGTGGGTTGTTTTTATGGGAAAAAATGCTTTAAGAACAAATGTATACTTTTAGA # Right flank : GGCGCACTGGATGCGATGATGGATATCACTTAGAATTCCCCGCCCCTGCGGTAGAACTCCCAGCTCCTATTTTCAAACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGCGGTAGCATTATCCGCATAACATCACGGCAGCGACGTTCTATTCTTCCTGGAAGTGCCTTATCAATATGTTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTTCCAAGATTCCAGTTAGTCGCTTCCACCGAAAGTACAGCAATGCCCGCTTTGTCGAATATTTCTGCGTCATTACAACACCCAGTGCCTTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCAATTCCATGACTACGCGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAACAA # Questionable array : NO Score: 6.09 # Score Detail : 1:0, 2:3, 3:0, 4:0.85, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.98, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGTCAGCGGGGATAAACCG # Alternate repeat : GTTCCCCGCGCCAGCGGGGATAAACCG # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGTCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-5.60,-7.20] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [1-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [43.3-68.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.92 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 87147-85653 **** Predicted by CRISPRDetect 2.4 *** >NZ_JAJCFV010000011.1 Escherichia coli strain MSK.19.94 PMCINAEB_11, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 87146 29 100.0 32 ............................. TAAATTACCCACAGAATCAGAATCTAAGACGG 87085 29 100.0 32 ............................. GTCGCCGTTCCGGCGGTATGTGTCTGAAGCGC 87024 29 100.0 32 ............................. GCGGTTATCGCTGAATCCACTATTCAGGTGAA 86963 29 100.0 32 ............................. TCACCGGGTCAGATACTGATGTTATGGCTTAT 86902 29 100.0 32 ............................. GGCTTATTAATATCACCCTTACCGACTACGGC 86841 29 100.0 32 ............................. GGGCCGACAAAACGTGTCAGAGTTCTGAAATT 86780 29 100.0 32 ............................. CCCGCAACAACTCAGGGAACCAGACACTAAAC 86719 29 100.0 32 ............................. AGTCTTTAATCAAAATGGATTTTTATAATGAA 86658 29 100.0 32 ............................. TCTGGTGTTTCTCTCGATGCGCGATATGTCTA 86597 29 100.0 32 ............................. ATATCTGTCCTGTCCATGATCGGGAATACTCT 86536 29 100.0 32 ............................. CTTGTACTCCAGAAAAAATATAGAAGCGCCAT 86475 29 100.0 32 ............................. CCAGCGAACATTGACTCAGATGCCATTGCTGG 86414 29 100.0 32 ............................. CCGCTAATGCGCGTATCTCATGGAAGGTTGGA 86353 29 100.0 32 ............................. TTTTTGTACGTCATGATGCGTCTCCATGTAGC 86292 29 100.0 33 ............................. CCCAGCCTGGCTTTCGGGTAAATCACCTAAAAC 86230 29 100.0 32 ............................. ATTACGGAGCTGGGAATGACTCCCAGCAAAAT 86169 29 100.0 32 ............................. TGGTACGGAACATGCTGTCTATATTTCGACCA 86108 29 100.0 32 ............................. CCAGTGGAGCCGCCGCGAATATACACACGATT 86047 29 100.0 32 ............................. GCGAGGGGCAGCCGTTCGCGCTGCATGTTGAT 85986 29 100.0 32 ............................. ACGAATCTGAACAGACGTGTGACTTAATCGTT 85925 29 100.0 32 ............................. AAAAAACAGTGGTACTACCGCCCCGCCGAACA 85864 29 100.0 32 ............................. TAAGGCCGTCGCCGGATCAGCCTGGCTATGCC 85803 29 93.1 32 .A.C......................... TTCTTGCGGGTGTTGCAAATATTCTTCACGTA 85742 29 93.1 32 .A.C......................... GACGCCGCCGCCGCGAAGCCGTTTCCGATGTT 85681 29 89.7 0 .A.C........................A | ========== ====== ====== ====== ============================= ================================= ================== 25 29 99.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTACTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGTGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTGCTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : CCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGTTCTCCATGCCAGAGAGGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATATTGAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTATGTTTTTTAAGCTTGTTATTCATTGGTTAAGTAATAAATCTGGAAGTTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAACCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTTT # Questionable array : NO Score: 6.21 # Score Detail : 1:0, 2:3, 3:0, 4:0.95, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,5] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [3-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [68.3-75.0]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.65 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //