Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1897 |
Symbol | |
ID | 5103284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1840528 |
End bp | 1841745 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507784 |
Product | NADH dehydrogenase subunit D |
Protein accession | YP_001191961 |
Protein GI | 146304645 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.412503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.283556 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACAAGTC AAGTTGATGA TTTAATGAAA AGTGGCGGGA TGGAGCTGGA CATAGTCCCC GTCACTGGAG AACTCAATGT GGGGCCTCAA CATCCAGGAT CAGGGCACAT GAGAATTTTC GTTAAGCTCA ATGGAGATAT TATAGAGGAC GCGGAAATTG ACCCTGGGTA CGTGCACAGG GCTGTTGAGA AACTGGGAGA GAACAGGAAC TACATTCATT TGATCCCGCT CGTGGAAAGG CCTGCAATCC TTGATTCAGT TAATATGAAC CTTGGATACA TCCTCGCGGT CGAGAAGATA CTTAACGTTG ACGTTCCGGA GAGGGCACAA TACCTGAGGA GCTTTGCCGC AGAGATAAGC AGGATAGCAA GTCACCTGTA CGGCATGGGT ATCCTGGCCA TCTTCATTGG GCATTCCACA GGTTTCATGT GGGGGTTCGG CGATAGGGAA GTTTGGGTTC AGATTCTTGA GGCGCTGACG GGTGCAAGGA TAACTAACTC CTACATCCTT CCAGGGGGAG TCAGAAGGGA TCTGACCCCA GCCGTCATAG AGATGACCAA GAAGGCCATT GCATACCAAC GTAAGAAGAT GAAGGATTGG GAGAGGATTT TCCTAAATAA CCCCAACATT AAGGCAAGGT TACAGGACGT TGGTGTTATG ACACGGGAAC AGGCTATTGA GTGGGGAGCT GCTGGACCAA ACCTCAGAGG CTCTGGGGTT TACTACGACG CTAGAAAGGC CGAGCCCTAT GGCGCTTATT CCAAGCTAGA CTTTGAAATA CCGGTTTATA AGGAAGGAGA TGGTTACGCA AGGACCCTGG TCAGGTTTGA GGAAATTGAA CAGAGCCTGA GAATACTGGA GCAGGTGATA AAGGATATTC CTGAAGGACA AATCCTGAGC GATAGGTTCT TCAAGCAAAT TCCGCCCACC AGGTTGAAAA AGTGGTGGGA AGGACAAAAG AGGATCGTTA TGCCAGGTTA TTACGCCTCC TTCAGGCCGC CTAAGGGTGA GGCGATTTCA AGGGTTGAGG CAGGGAGAGG CGAACTCGTG TATTACGTGG TGAGCGACGG CTCAGCTAAA CCCTATAGGT TGAGGATGAT AACTCCGTCA TACAGGTCAA TATACGTGAT GAAGAATCTC CTGAAAGGGG CTAGATACGC GGATCTGGTC TCGATTTACG GTAGCCTAGA CTATTTCCCA CCGGAGGCTG ATAGATAA
|
Protein sequence | MTSQVDDLMK SGGMELDIVP VTGELNVGPQ HPGSGHMRIF VKLNGDIIED AEIDPGYVHR AVEKLGENRN YIHLIPLVER PAILDSVNMN LGYILAVEKI LNVDVPERAQ YLRSFAAEIS RIASHLYGMG ILAIFIGHST GFMWGFGDRE VWVQILEALT GARITNSYIL PGGVRRDLTP AVIEMTKKAI AYQRKKMKDW ERIFLNNPNI KARLQDVGVM TREQAIEWGA AGPNLRGSGV YYDARKAEPY GAYSKLDFEI PVYKEGDGYA RTLVRFEEIE QSLRILEQVI KDIPEGQILS DRFFKQIPPT RLKKWWEGQK RIVMPGYYAS FRPPKGEAIS RVEAGRGELV YYVVSDGSAK PYRLRMITPS YRSIYVMKNL LKGARYADLV SIYGSLDYFP PEADR
|
| |