Gene Msed_0933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0933 
Symbol 
ID5104363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp859929 
End bp860987 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content47% 
IMG OID640506836 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_001191029 
Protein GI146303713 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.900318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTGC CCGATAGAAA GAACGTAATT ACCCTACTCC ACGGGGCTGG AGGGACGTAC 
ATGCATTCCC TGATAAGGGA CGTCTTTTTG AAGCTAAATG ATGGATTTGG CGAGGTGGGA
CTAGAAATGA TGGACGATGC AGCCGTGGTT AATGGAATAG TGTTCACTAC GGATTCGTTC
GTCATTAGGC CCATTTTCTT CAGAGGTGGA GATATAGGAA GATTAAGCGT GAGTGGCACA
GTAAATGATA TAGCGATGAT GGGGGGAGAT CCTCAGGCCC TGAGTCTTGG AGTAGTATTA
GAGGAAGGAT TCCCCAAGGA TATGTTGGAG AAGATAGTTG AAAGCATTAA GAAGACTGCT
GAGGAAGCGA ACGTCCACGT AGTTACTGGA GACACGAAGG TCATGGAGAG GGGAAACTTA
GATAAAATTG TCATTAACAC CGCCGGAATA GGTACAAGAC CTAGGCAATT GGATCACAAC
ATTGAGACCT TGAGGAAAAG TAGGCAACCT TCCCGCTGGT TAGTTCCCAC TAACCTTAGG
GATGGAGATA AAATTGTGGT CACCGGTACC CTTGGGGATC ATGCCATAGC GGTCCTATCT
TCAAGGGAAG GAGTGGGATT TGAGTCAAAT GTCATGTCCG ATGTTGCCCC TCTCAATAAG
ATGATCATGA ACCTACTGGA AGTAGGAGGA ATAGCTGATG CCAAGGATCC AACGAGAGGA
GGACTGGCAG ATCTGCTTCA GGACTGGTCG GAAAAGTCTG GACTTGGAAT CTTCATAAGG
GAGAGTGACA TCCCAGTGAA GGATGAGGTC AGAGCTGCCG TCGAGTTTCT AGGAATGGAC
GTTCTAGAGT TGGGTAATGA GGGAAAGGCC GTCTTAGCAG TTTCGCCTGA ATATGTTAAG
GACGTTATGG ACGCGTTACA TTCAGATCCG CTTGGGAAGG ACGCAACAAT AATAGGAGAG
GTCAGAAAGG ATTTAGAGGG GGTAATAATG GAGACTGTGG TCGGGGGAAA CAGGTATGTT
GGAAGACCCT TAGGGGATCC AGTTCCTAGA ATATGCTAG
 
Protein sequence
MELPDRKNVI TLLHGAGGTY MHSLIRDVFL KLNDGFGEVG LEMMDDAAVV NGIVFTTDSF 
VIRPIFFRGG DIGRLSVSGT VNDIAMMGGD PQALSLGVVL EEGFPKDMLE KIVESIKKTA
EEANVHVVTG DTKVMERGNL DKIVINTAGI GTRPRQLDHN IETLRKSRQP SRWLVPTNLR
DGDKIVVTGT LGDHAIAVLS SREGVGFESN VMSDVAPLNK MIMNLLEVGG IADAKDPTRG
GLADLLQDWS EKSGLGIFIR ESDIPVKDEV RAAVEFLGMD VLELGNEGKA VLAVSPEYVK
DVMDALHSDP LGKDATIIGE VRKDLEGVIM ETVVGGNRYV GRPLGDPVPR IC