Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3982 |
Symbol | hypE |
ID | 6967485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3681383 |
End bp | 3682393 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643387751 |
Product | hydrogenase expression/formation protein HypE |
Protein accession | YP_002272194 |
Protein GI | 209396976 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0309] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR02124] hydrogenase expression/formation protein HypE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATAATA TCCAACTCGC CCACGGTAGC GGCGGCCAGG CGATGCAGCA ATTAATCAAC AGCCTGTTTA TGGAAGCCTT TGCCAACCCG TGGCTGGCAG AGCAGGAAGA TCAGGCACGT CTTGATCTGG CGCAGCTGGT AGCGGAAGGC GACCGTCTGG CGTTCTCCAC CGACAGTTAC GTTATTGACC CGCTGTTCTT CCCTGGCGGT AATATCGGCA AGCTGGCGAT TTGCGGCACC GCGAATGACG TTGCGGTCAG TGGCGCTATT CCGCGCTATC TCTCCTGTGG CTTTATCCTC GAAGAAGGAT TGCCGATGGA GACACTGAAA GCCGTAGTGA CCAGCATGGC AGAAACCGCC CGCACGGCAG GCATTGCCAT CGTTACTGGC GATACTAAAG TGGTGCAGCG CGGCGCGGCA GATAAACTGT TTATCAACAC CGCGGGCATG GGCGCAATTC CGACGAATAT TCACTGGGGC GCACAGACGC TAACCGCAGG CGATATATTG CTGGTTAGCG GTACACTCGG CGACCACGGG GCGACTATCC TTAACCTGCG TGAGCAGCTG GGGCTGGATG GCGAACTGGT CAGCGACTGC GCGGTGCTGA CGCCGCTTAT TCAGACGCTG CGTGACATTC CCGGCGTGAA AGCGCTGCGT GATGCCACCC GTGGTGGTGT AAACGCGGTG GTTCATGAGT TCGCGGCAGC CTGCGGTTGC GGTATTGAAA TTTCTGAATC AGCGCTGCCG GTTAAACCTG CCGTGCGCGG CGTTTGCGAA TTGCTGGGAC TGGACGCCCT GAACTTTGCC AACGAAGGCA AACTGGTGAT CGCCGTTGAA CGCAACGCGG CAGAGCAAGT GCTGGCAGCG TTACATTCCC ATCCACTGGG GAAAGACGCG GCGCTGATTG GTGAAGTGGT GGAACGTAAA GGTGTTCGTC TTGCCGGTCT GTATGGCGTG AAACGAACCC TCGATTTACC ACACGCCGAA CCGCTTCCGC GTATATGCTA A
|
Protein sequence | MNNIQLAHGS GGQAMQQLIN SLFMEAFANP WLAEQEDQAR LDLAQLVAEG DRLAFSTDSY VIDPLFFPGG NIGKLAICGT ANDVAVSGAI PRYLSCGFIL EEGLPMETLK AVVTSMAETA RTAGIAIVTG DTKVVQRGAA DKLFINTAGM GAIPTNIHWG AQTLTAGDIL LVSGTLGDHG ATILNLREQL GLDGELVSDC AVLTPLIQTL RDIPGVKALR DATRGGVNAV VHEFAAACGC GIEISESALP VKPAVRGVCE LLGLDALNFA NEGKLVIAVE RNAAEQVLAA LHSHPLGKDA ALIGEVVERK GVRLAGLYGV KRTLDLPHAE PLPRIC
|
| |