Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4123 |
Symbol | |
ID | 6971278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3819131 |
End bp | 3820507 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643387877 |
Product | transcriptional regulator |
Protein accession | YP_002272317 |
Protein GI | 209400373 |
COG category | [K] Transcription |
COG ID | [COG3710] DNA-binding winged-HTH domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0000465407 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACTTAG AAAATAAATT CTCATATCAT TTTCTTGAGG GATTAACGCT CACGGAAGAT GGAATTCTTA CTCAAGGAAA TGAGCAAGTT TATATTCCAC CGAAAGAGTT AGGTGTATTA ATAGTATTAC TTGAATCCGC TGGTCATGTC GTACTGAAAG ATATGATCAT CGAATCAGTA TGGAAAAATA TTATTGTTAG TGATGAGTCC CTGACAAGAT GTATCTATTC TTTGCGCTGC ATTTTTGAAA AAATTGGCTA TGATCGTTGC ATAGAAACAA TCTACCGGAA AGGTTATCGT TTCAGTGGGC AGGTTTTCAA AACTAAAATA AATGAAGATA ACACTTCAGA CTATTCTATC GCTATATTCC CTTTCACTAC TTCATTGAAA ACACTGGATC CATTAATACT TAATCAGGAA TTAGTACAAA TCATTTCAAA TAAAAAAATC GATGGCCTCT ATACCTATCC GATGGCTGCG ACAAATTTTT GTAATGATCA CATATCGCAA AATTCATTTT TGAGAAGATT CAATCCAGAT TATTTCGTTA CAGGAAGAAT AAACCAGAAT AATGCAGTGA ACACTTTATA CATTGAGTTG ACCGACGCTA AAAACCTTTT CCTCATCGCC AGTAATCATC TACCTGTTGA TGAACTACAT AATACATCAC AATTTATTAT AGAAAATATC CTTCAAACAG TACATAATCC ACAACGATCT GTAAGATTAA CTAAGCAGGA CCAAGGATAT AAGAATCATT ATTTATCAGA TGAAATGTTA GCCGGAAAGA AAGAACTTTA CGAGTTCACC CCTGAAAGCA TTTACAGGGC CATGACTATA TTTGATGGAT TACAAAATAA AAGTGATATA CAGACGCTAA AAACAGAATG TTATTGCCTT CTAGCGGAAT GCCATATGTC TTTGGCACTT CATGGAAAAA GTGAACTTGA ACTTGCTGCT CAAAAAGCAT TAGAGCTTTT AGATTATGTA TCAGACATAA CCACTGTCGA TGGAAAAATT TTAGCTATTA TGGGACTGAT AACTGGTCTG TCTGGACAAG CAAAAGTATC TCATATCTTA TTTGAACAGG CTAAGATACA CTCAACTGAT ATAGCCTCTC TCTACTACTA TAGGGCACTT GTCAACTTTC ATAATGAAAA AATTGAAGAG GCAAGGATTT GTATAGACAA ATCACTACAA CTCGAACCCA GAAGACGAAA AGCAGTTGTG ATAAAAGAAT GTGTAGATAT GTATGTGCCT AACCCGCTCA AAAAAAACAT GAAACTCTAC TATAAAGAAA CTGAGAGTGG AAGCCATCGA GTTATAATTG ACAACATTTT GAAATTAAAG CAGCTGACGA GAATTTGTAT GCGATAA
|
Protein sequence | MDLENKFSYH FLEGLTLTED GILTQGNEQV YIPPKELGVL IVLLESAGHV VLKDMIIESV WKNIIVSDES LTRCIYSLRC IFEKIGYDRC IETIYRKGYR FSGQVFKTKI NEDNTSDYSI AIFPFTTSLK TLDPLILNQE LVQIISNKKI DGLYTYPMAA TNFCNDHISQ NSFLRRFNPD YFVTGRINQN NAVNTLYIEL TDAKNLFLIA SNHLPVDELH NTSQFIIENI LQTVHNPQRS VRLTKQDQGY KNHYLSDEML AGKKELYEFT PESIYRAMTI FDGLQNKSDI QTLKTECYCL LAECHMSLAL HGKSELELAA QKALELLDYV SDITTVDGKI LAIMGLITGL SGQAKVSHIL FEQAKIHSTD IASLYYYRAL VNFHNEKIEE ARICIDKSLQ LEPRRRKAVV IKECVDMYVP NPLKKNMKLY YKETESGSHR VIIDNILKLK QLTRICMR
|
| |