Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2855 |
Symbol | hypE |
ID | 6145066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2929146 |
End bp | 2930156 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641617724 |
Product | hydrogenase expression/formation protein HypE |
Protein accession | YP_001744879 |
Protein GI | 170681565 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0309] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR02124] hydrogenase expression/formation protein HypE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.431476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATAATA TCCAACTCGC CCACGGTAGC GGCGGCCAGG CGATGCAGCA ATTAATCAAC AGCCTGTTTA TGGAAGCCTT TGCCAACCCG TGGCTGGCAG AGCAGGAAGA TCAGGCACGT CTTGATCTGG CGCAGCTGGT AGCGGAAGGC GACCGTCTGG CGTTCTCCAC CGACAGTTAC GTTATTGACC CGCTGTTCTT CCCTGGCGGT AATATCGGCA AGCTGGCGAT TTGCGGCACC GCGAATGACG TTGCGGTCAG TGGCGCTATT CCGCGCTATC TCTCCTGTGG CTTTATCCTC GAAGAAGGAT TGCCGATGGA GACACTGAAA GCCGTAGTGA CCAGCATGGC AGAAACCGCC CGCGCGGCAG GCATTGCCAT CGTTACTGGC GACACTAAAG TGGTGCAGCG CGGCGCGGCA GATAAACTGT TTATCAACAC CGCTGGCATG GGCGCAATTC CGGCGAATAT TCACTGGGGC GCACAAACGC TAACCGCAGG CGATGTTCTG CTGGTGAGTG GAACTCTCGG CGACCACGGG GCGACTATCC TTAACCTGCG TGAGCAGCTG GGGCTGGATG GCGAACTGGT CAGCGACTGT GCGGTACTAA CGCCGCTTAT TCAGACGCTG CGTGACATTC CCGGCGTGAA AGCGCTGCGT GATGCCACCC GTGGTGGTGT AAACGCGGTG GTACATGAGT TCGCGGCAGC CTGCGGTTGT GGTATTGAAC TTTCAGAAGC GGCACTGCCG GTTAAACCTG CTGTGCGTGG CGTTTGCGAG CTGCTGGGAC TGGACGCCCT TAACTTTGCC AACGAAGGCA AACTGGTGAT CGCCGTTGAA CGCAACGCGG CAGAGCAAGT GCTGGCAGCG TTACATTCCC ATCCACTGGG GAAAGACGCG GCGCTGATTG GTGAAGTGGT GGAACGTAAA GGTGTTCGTC TTGCCGGTCT GTATGGCGTG AAACGAACCC TCGATTTACC ACACGCCGAA CCGCTTCCGC GTATATGCTA A
|
Protein sequence | MNNIQLAHGS GGQAMQQLIN SLFMEAFANP WLAEQEDQAR LDLAQLVAEG DRLAFSTDSY VIDPLFFPGG NIGKLAICGT ANDVAVSGAI PRYLSCGFIL EEGLPMETLK AVVTSMAETA RAAGIAIVTG DTKVVQRGAA DKLFINTAGM GAIPANIHWG AQTLTAGDVL LVSGTLGDHG ATILNLREQL GLDGELVSDC AVLTPLIQTL RDIPGVKALR DATRGGVNAV VHEFAAACGC GIELSEAALP VKPAVRGVCE LLGLDALNFA NEGKLVIAVE RNAAEQVLAA LHSHPLGKDA ALIGEVVERK GVRLAGLYGV KRTLDLPHAE PLPRIC
|
| |