Gene Msed_1897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1897 
Symbol 
ID5103284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1840528 
End bp1841745 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content49% 
IMG OID640507784 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_001191961 
Protein GI146304645 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.412503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.283556 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACAAGTC AAGTTGATGA TTTAATGAAA AGTGGCGGGA TGGAGCTGGA CATAGTCCCC 
GTCACTGGAG AACTCAATGT GGGGCCTCAA CATCCAGGAT CAGGGCACAT GAGAATTTTC
GTTAAGCTCA ATGGAGATAT TATAGAGGAC GCGGAAATTG ACCCTGGGTA CGTGCACAGG
GCTGTTGAGA AACTGGGAGA GAACAGGAAC TACATTCATT TGATCCCGCT CGTGGAAAGG
CCTGCAATCC TTGATTCAGT TAATATGAAC CTTGGATACA TCCTCGCGGT CGAGAAGATA
CTTAACGTTG ACGTTCCGGA GAGGGCACAA TACCTGAGGA GCTTTGCCGC AGAGATAAGC
AGGATAGCAA GTCACCTGTA CGGCATGGGT ATCCTGGCCA TCTTCATTGG GCATTCCACA
GGTTTCATGT GGGGGTTCGG CGATAGGGAA GTTTGGGTTC AGATTCTTGA GGCGCTGACG
GGTGCAAGGA TAACTAACTC CTACATCCTT CCAGGGGGAG TCAGAAGGGA TCTGACCCCA
GCCGTCATAG AGATGACCAA GAAGGCCATT GCATACCAAC GTAAGAAGAT GAAGGATTGG
GAGAGGATTT TCCTAAATAA CCCCAACATT AAGGCAAGGT TACAGGACGT TGGTGTTATG
ACACGGGAAC AGGCTATTGA GTGGGGAGCT GCTGGACCAA ACCTCAGAGG CTCTGGGGTT
TACTACGACG CTAGAAAGGC CGAGCCCTAT GGCGCTTATT CCAAGCTAGA CTTTGAAATA
CCGGTTTATA AGGAAGGAGA TGGTTACGCA AGGACCCTGG TCAGGTTTGA GGAAATTGAA
CAGAGCCTGA GAATACTGGA GCAGGTGATA AAGGATATTC CTGAAGGACA AATCCTGAGC
GATAGGTTCT TCAAGCAAAT TCCGCCCACC AGGTTGAAAA AGTGGTGGGA AGGACAAAAG
AGGATCGTTA TGCCAGGTTA TTACGCCTCC TTCAGGCCGC CTAAGGGTGA GGCGATTTCA
AGGGTTGAGG CAGGGAGAGG CGAACTCGTG TATTACGTGG TGAGCGACGG CTCAGCTAAA
CCCTATAGGT TGAGGATGAT AACTCCGTCA TACAGGTCAA TATACGTGAT GAAGAATCTC
CTGAAAGGGG CTAGATACGC GGATCTGGTC TCGATTTACG GTAGCCTAGA CTATTTCCCA
CCGGAGGCTG ATAGATAA
 
Protein sequence
MTSQVDDLMK SGGMELDIVP VTGELNVGPQ HPGSGHMRIF VKLNGDIIED AEIDPGYVHR 
AVEKLGENRN YIHLIPLVER PAILDSVNMN LGYILAVEKI LNVDVPERAQ YLRSFAAEIS
RIASHLYGMG ILAIFIGHST GFMWGFGDRE VWVQILEALT GARITNSYIL PGGVRRDLTP
AVIEMTKKAI AYQRKKMKDW ERIFLNNPNI KARLQDVGVM TREQAIEWGA AGPNLRGSGV
YYDARKAEPY GAYSKLDFEI PVYKEGDGYA RTLVRFEEIE QSLRILEQVI KDIPEGQILS
DRFFKQIPPT RLKKWWEGQK RIVMPGYYAS FRPPKGEAIS RVEAGRGELV YYVVSDGSAK
PYRLRMITPS YRSIYVMKNL LKGARYADLV SIYGSLDYFP PEADR