Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1898 |
Symbol | |
ID | 5103285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1841745 |
End bp | 1842803 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507785 |
Product | NADH dehydrogenase subunit H |
Protein accession | YP_001191962 |
Protein GI | 146304646 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0180885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.286914 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATTC TATTCGATAT CAGATATTAT ATTCTCTACC CATCATTTTT CGCTCCAATA ATTCTTCCAG GGTTAATTTT CACTGCTATA CTCCTCCTTA CAACTATCTG GTTTGAGAGA AAGGCTGCTG CGAGGGTTCA AATGAGGATT GGTCCCTATT ACGCTTCCAA GAGACTGGGA GGGTACCTTC AGTTGGTCGC TGATGCTCTG AAATTCGTGT TCTCAGAGGT TATAGTGCCC GAGGGAGTTA ACCCAACCCT GTTCGCGCTG ACGCCAGTGC TCGTTGTGGC CATGTCCTTT CTCCCCCTTG CAGTGATACC AGTGTCAGTG ATCCCACCTT CTGGTTCAAT CTTCTCGATC TATTTCCACG ACTTCTACGA TCCCAACGTG GGGTTAGGAG TTTTGGTGGG TCTATTTACC CAGTATAACA TGCTATTAAT CCTGGCAATA GAGTCCATTT ATCCAGCGAT GATCATCCTA ATGGCCTGGA GTACCAACAA TAGGTTTGCC ATAGTTGGGG CAGTCAGAGA ATCCTACCTT TCCGTGTCCT ATGACGTGCT TTTGCTCATG TCCACCATAT CCATGGCCCT GGAGTATCAT ACGCTGGATC TAGTGAAGAT AGTTCAGACG GGGGTACCCG GAATTCTTGC CAATCCTCTT GCAGCAGTGA CCTTCTTCAT TGCAATGATA ATCGGTAGTG CGAGGTTCCC CTTTGATATA GCTGAGGCCG ATACTGAGCT CGTTCTTGGA CCAGCGACGG AGTACAGCGG TCTCCTCTTT GTGTTAACCA TGGCAGGCTC CTACGTGGGG AACTTTGTGT ACGCCCTGGT GTTTACTGAC ATGTTCCTAT GGGGCTGGTA CCCGCTTTCA GGATTCCCAG GAGCCCTTCT CACGGTTATT AAGGCTTCAA TTCTCGTGTT TTTCTCGGTG TTCCTCAGGT CAGTCTACGG GAGGTATAGA TTGGATCAGG CCCTTAGGGG AAGCTGGAAA TATATATTCC CCTTGGCCAT AGCCTCCCTA TTTCTAGGTT TAGTGGTGGG TTACCTATGG ATTCAGTAA
|
Protein sequence | MNILFDIRYY ILYPSFFAPI ILPGLIFTAI LLLTTIWFER KAAARVQMRI GPYYASKRLG GYLQLVADAL KFVFSEVIVP EGVNPTLFAL TPVLVVAMSF LPLAVIPVSV IPPSGSIFSI YFHDFYDPNV GLGVLVGLFT QYNMLLILAI ESIYPAMIIL MAWSTNNRFA IVGAVRESYL SVSYDVLLLM STISMALEYH TLDLVKIVQT GVPGILANPL AAVTFFIAMI IGSARFPFDI AEADTELVLG PATEYSGLLF VLTMAGSYVG NFVYALVFTD MFLWGWYPLS GFPGALLTVI KASILVFFSV FLRSVYGRYR LDQALRGSWK YIFPLAIASL FLGLVVGYLW IQ
|
| |