Gene Msed_1241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1241 
Symbol 
ID5104156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1219394 
End bp1220740 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content52% 
IMG OID640507132 
Productmercuric reductase 
Protein accessionYP_001191325 
Protein GI146304009 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID[TIGR02053] mercuric reductase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.446801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAAGC TGGCCATAAT CGGGTATGGG GCGGCCGGAT TTGCTGCAAT GATTAAGGCC 
AACGAGTTGG GAGTTAAACC TGTTCTAATA GGGAAGGGGG AAATAGGAGG AACTTGCGTT
AACGTAGGCT GTGTCCCTTC TAAGAGGATG TTATACATCG CAGAAATCTA CAAGAAAGCT
AGGGAGGTCA CGGGAAGCGA GGTATATCCG CCCTTCAGTT CCTTTCAGGA GAAAGATGGT
CTAGTTCAGG AGATGAGGAA GACGAAGTAT GAAGACCTCC TCTCATACTA TGACGTGGAA
CTGATTCAGG GCGAGGCCAG GTTCATCTCT CCCCACGCGG TAAAGGTGAA TGGCCAGGTA
ATTGAGGCCG AGAAATTCGT GATTGCAACT GGGTCATCTC CCCTAATCCC AAGGATACCT
GGCCTGGACA AGGTTGGGTT CTGGACCAAC AGAGAGGCCC TGTCCCCGGA CAGGAGAATT
GACTCCCTTG CAGTGATAGG TGGAAGGGCC CTGGCCCTCG AGTTCGCACA GATGTACTCT
AGGATGAAGG TTGAGGTCGC AATCCTTCAG AGGAGTCCCG TACTGATCCC AGACTGGGAG
CCTGAGGCCT CAGTAGAGGC AAGGAGGATC ATGGAAAATG ACGGAGTTGC TGTAGTCACA
GGTGTGAACG TCAAGGAGGT CAGGAAAGGG GCAGGAAAGA TCGTGATAAC GGACAAGGGC
GAAGTTGAGG CTGACGAGAT ACTCCTGGCC ACGGGGAGGA AGCCCAACGT TGACCTAGGA
CTTGAGAATG CTGGTGTTCG CCTAAATGAG AGGGGAGGAA TTAAGGTGGA CGACGAACTG
AGGACTGACA ACCCACACAT ATACGCGGCA GGGGACGTCT TGGGAGGTAA AATGTTGGAG
GCCCTAGCAG GTAGGCAAGG TTCAATTGCC ACAGAGAACG CGTTAACAGG TTCGCACAAA
AGAGTGGACG AGAACGCGGT CCCTCAGGTC ATCTTTACTC AGCCCAACCT TGCGAGGGTT
GGTCTCACGG AGGCTGAAGC TAGAGCTAAG GAAGGTGAAG TTGAGGCAAG GGTTCTTCCC
ATGAGCTCAG TGGCAAAGGC TGAGATCATC AACTCTAGAC TGGGTTTCGT CAAGATGGTA
ACCATGAACG GGAGGATCGT TGGAGTTCAT GCCGTCGGAG AGAACGTCGC GGAGATGATC
GGTGAAGCTG CTCTTGCGAT AAGGTTTGGG GCCACGGTGC ACGACCTCAT AGATACAGTG
CACATGTTTC CAACAATCGC CGAATCCCTC AGGTTAGTTG CACTGGCATT TAGGTCTGAC
GTGAGTAGGT TAAGCTGTTG TGTGTGA
 
Protein sequence
MHKLAIIGYG AAGFAAMIKA NELGVKPVLI GKGEIGGTCV NVGCVPSKRM LYIAEIYKKA 
REVTGSEVYP PFSSFQEKDG LVQEMRKTKY EDLLSYYDVE LIQGEARFIS PHAVKVNGQV
IEAEKFVIAT GSSPLIPRIP GLDKVGFWTN REALSPDRRI DSLAVIGGRA LALEFAQMYS
RMKVEVAILQ RSPVLIPDWE PEASVEARRI MENDGVAVVT GVNVKEVRKG AGKIVITDKG
EVEADEILLA TGRKPNVDLG LENAGVRLNE RGGIKVDDEL RTDNPHIYAA GDVLGGKMLE
ALAGRQGSIA TENALTGSHK RVDENAVPQV IFTQPNLARV GLTEAEARAK EGEVEARVLP
MSSVAKAEII NSRLGFVKMV TMNGRIVGVH AVGENVAEMI GEAALAIRFG ATVHDLIDTV
HMFPTIAESL RLVALAFRSD VSRLSCCV