Gene Msed_1429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1429 
Symbol 
ID5104799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1399327 
End bp1400433 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content47% 
IMG OID640507317 
ProductNADH-ubiquinone oxidoreductase, chain 49kDa 
Protein accessionYP_001191510 
Protein GI146304194 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.519726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTTA TAGGAGATAT AGGGCCTTAC TGCCTGACCT CAGACGGGCT CAGAGAAGGA 
CCGTGCATTA AGGACCAGAG CGAAGAGGAG AGTCTGGGCT ATGGTTCATT CAAGTTCGTT
TATGGCCCGT CAGCTGGGGG TCTACTGGAG TCAGTGGGAT TTGAAATTAC AACTTATGGA
GAAAGCATAG AGAAAATAAA TCATTTACCA TATAAGGGTA GACAAATAAC TCTCTCCGGG
CTGACAATTG GAGATGCCCT CCTTAGAGTT GAGAGAATTA ACGGAGCGTT CACCGCATCT
CATTCCATCT CTTTTCTTCA GGCAATTGAG AGTGCCTTAG AGATAGAGGT GCCTCACGAC
GTTGTTATTT CTAGGATGGC TCAGCTCGAG CTTGAAAGGA TAAGGAATAA TCTACTCGTG
ATTCAGAGGG TATTGGAATC GGCATCGTTC CTGGTTCCAT CATTTCTTCT TCTCCAGCAG
ATAGAGGAGG TTAACAGGGC CATAGCGAGG TCTTGCGGTC ATAGATATTT CTTTGGGGCT
AACTACCCAG GAGGAGTTAG ATGTGAGCTT AAGCTTCCGT CTCTTAAGAT TTCGGACATA
GAGAGAACGC TAGAGAACAG GATCTTCATT GACAGACTCC AAGGCAATGG AGTGGTTAAG
GATAGCTTCT CCATTGGGCC TGTCGCCAGG GCCTCTGGGT TCAAGTACGA CGCAAGGCTA
GACTCGGATT TTCTAGCTTA CAGGAACTTT GACCTGAGAA TCCCTACTCA GGATCAGGGA
GATGCCTTCT CCAGGATCCT AGTTCGTCTG GAGGAGATTA AGGAATCGCT TAGGCTACTC
CAGGAGCTCA AGGTAAAACC CTGTAGTTTC ACCATGAAGA TAAGGGATGG AGAGGGAATA
GGAAGAGTTG AGAGCCCATC TGGAGATCTG GCTTACCTCA CAAGGGTGAG GAGTGGCCAC
GTGGAGAGAG CATACCTTTT AGCTCCATCA AAGGTAAACA TGAGACTTTT CCTCAAGTCC
ATGCCTGGAA ATATCTTCAC TGACTTTCCC TTCAACTGGG AAAGTTTCGG GATCTGGATA
TCTGAGCTCG AGGTCGATCT GGAATGA
 
Protein sequence
MRVIGDIGPY CLTSDGLREG PCIKDQSEEE SLGYGSFKFV YGPSAGGLLE SVGFEITTYG 
ESIEKINHLP YKGRQITLSG LTIGDALLRV ERINGAFTAS HSISFLQAIE SALEIEVPHD
VVISRMAQLE LERIRNNLLV IQRVLESASF LVPSFLLLQQ IEEVNRAIAR SCGHRYFFGA
NYPGGVRCEL KLPSLKISDI ERTLENRIFI DRLQGNGVVK DSFSIGPVAR ASGFKYDARL
DSDFLAYRNF DLRIPTQDQG DAFSRILVRL EEIKESLRLL QELKVKPCSF TMKIRDGEGI
GRVESPSGDL AYLTRVRSGH VERAYLLAPS KVNMRLFLKS MPGNIFTDFP FNWESFGIWI
SELEVDLE