Gene Elen_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3074 
Symbol 
ID8417409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3574207 
End bp3575229 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content69% 
IMG OID645026054 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_003183406 
Protein GI257792800 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000468886 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCA CGGTCATGCT GGGGCACGGC AGCGGCGGGA CGATGATGAA GCGCATCATC 
GACGATGTGT TCTTCGCCGC GTACGCCGGC GACGAGCTGC TGCGCGGCGA CGACGCGGCG
GTGCTGCCCG CGCCCGCTCC GGGCGAGCGG CTGGCGTTCT CCACCGACAG CTTCGTGGTG
ACGCCGCATT TCTTCCCGGG CGGCGACATC GGACGCCTCG CCGTGTGCGG CACGGTGAAC
GACGTGGCCA CGAGCGGCGC CGTGCCGCGC TACCTCAGCT GCGGCTTCGT GCTGGAGGAG
GGCTTCCCCA TTGAGGATCT CAAGCGCATC TGCGCCTCCA TGGCGGAATG CGCGCAGGAG
GCCGGCGTGC ATCTGGTCAC CGGCGACACG AAAGTGGTGA ACCGCGGCCA CGGCGACGGC
GTGTACATCA ACACGAGCGG CGTGGGCACC ATTCCCGAAG GCGTGAACCT GGGTGGCGCG
CAGTGCAAGC CGGGCGACAA AGTGCTGGTC ACCGGCACGC TGGGTGATCA CGGCATCACC
ATCATGAGCT GCCGCGAGAG CTTGAGCTTC TCGGCCGATC TGGAAAGCGA CGCGGCCCCG
CTCAACCACC TCATCGCCGA GGTGTTGGCG GCGGCGCCGA ACACGCGCTG CTTCCGCGAC
CCGACGCGCG GCGGCCTGGC CTCCACGCTG AACGAGCTGG CTGCCCAGTC GAACACGGAC
ATCACGGTGG AGGAAGACGC CATCCCCGTG AAGCCGGCCG TGCAGGGCGC GTGCGAGATG
CTGGGCTACG ACGTGCTGCA GGTGGCGAAC GAGGGCAAGA TGGTGTGCGT TGTGGCGGCC
GAGGAGGCCG ACGCAGCGCT CGCGGCCATG CGCGCGAACC GGTACGGCGC CGATGCGGCC
ATCATCGGCG AGGTGTCGGC CGCCCGTCCC GAGCGCGGCT CCAAGGTGTT CCTGCGCACG
GCGTTCGGCG GTACGCGCAT CCTCGACATG CTGGTGGGCG AGCAATTGCC GCGCATTTGC
TAG
 
Protein sequence
MDTTVMLGHG SGGTMMKRII DDVFFAAYAG DELLRGDDAA VLPAPAPGER LAFSTDSFVV 
TPHFFPGGDI GRLAVCGTVN DVATSGAVPR YLSCGFVLEE GFPIEDLKRI CASMAECAQE
AGVHLVTGDT KVVNRGHGDG VYINTSGVGT IPEGVNLGGA QCKPGDKVLV TGTLGDHGIT
IMSCRESLSF SADLESDAAP LNHLIAEVLA AAPNTRCFRD PTRGGLASTL NELAAQSNTD
ITVEEDAIPV KPAVQGACEM LGYDVLQVAN EGKMVCVVAA EEADAALAAM RANRYGADAA
IIGEVSAARP ERGSKVFLRT AFGGTRILDM LVGEQLPRIC