Gene Msed_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1898 
Symbol 
ID5103285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1841745 
End bp1842803 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content48% 
IMG OID640507785 
ProductNADH dehydrogenase subunit H 
Protein accessionYP_001191962 
Protein GI146304646 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0180885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.286914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTC TATTCGATAT CAGATATTAT ATTCTCTACC CATCATTTTT CGCTCCAATA 
ATTCTTCCAG GGTTAATTTT CACTGCTATA CTCCTCCTTA CAACTATCTG GTTTGAGAGA
AAGGCTGCTG CGAGGGTTCA AATGAGGATT GGTCCCTATT ACGCTTCCAA GAGACTGGGA
GGGTACCTTC AGTTGGTCGC TGATGCTCTG AAATTCGTGT TCTCAGAGGT TATAGTGCCC
GAGGGAGTTA ACCCAACCCT GTTCGCGCTG ACGCCAGTGC TCGTTGTGGC CATGTCCTTT
CTCCCCCTTG CAGTGATACC AGTGTCAGTG ATCCCACCTT CTGGTTCAAT CTTCTCGATC
TATTTCCACG ACTTCTACGA TCCCAACGTG GGGTTAGGAG TTTTGGTGGG TCTATTTACC
CAGTATAACA TGCTATTAAT CCTGGCAATA GAGTCCATTT ATCCAGCGAT GATCATCCTA
ATGGCCTGGA GTACCAACAA TAGGTTTGCC ATAGTTGGGG CAGTCAGAGA ATCCTACCTT
TCCGTGTCCT ATGACGTGCT TTTGCTCATG TCCACCATAT CCATGGCCCT GGAGTATCAT
ACGCTGGATC TAGTGAAGAT AGTTCAGACG GGGGTACCCG GAATTCTTGC CAATCCTCTT
GCAGCAGTGA CCTTCTTCAT TGCAATGATA ATCGGTAGTG CGAGGTTCCC CTTTGATATA
GCTGAGGCCG ATACTGAGCT CGTTCTTGGA CCAGCGACGG AGTACAGCGG TCTCCTCTTT
GTGTTAACCA TGGCAGGCTC CTACGTGGGG AACTTTGTGT ACGCCCTGGT GTTTACTGAC
ATGTTCCTAT GGGGCTGGTA CCCGCTTTCA GGATTCCCAG GAGCCCTTCT CACGGTTATT
AAGGCTTCAA TTCTCGTGTT TTTCTCGGTG TTCCTCAGGT CAGTCTACGG GAGGTATAGA
TTGGATCAGG CCCTTAGGGG AAGCTGGAAA TATATATTCC CCTTGGCCAT AGCCTCCCTA
TTTCTAGGTT TAGTGGTGGG TTACCTATGG ATTCAGTAA
 
Protein sequence
MNILFDIRYY ILYPSFFAPI ILPGLIFTAI LLLTTIWFER KAAARVQMRI GPYYASKRLG 
GYLQLVADAL KFVFSEVIVP EGVNPTLFAL TPVLVVAMSF LPLAVIPVSV IPPSGSIFSI
YFHDFYDPNV GLGVLVGLFT QYNMLLILAI ESIYPAMIIL MAWSTNNRFA IVGAVRESYL
SVSYDVLLLM STISMALEYH TLDLVKIVQT GVPGILANPL AAVTFFIAMI IGSARFPFDI
AEADTELVLG PATEYSGLLF VLTMAGSYVG NFVYALVFTD MFLWGWYPLS GFPGALLTVI
KASILVFFSV FLRSVYGRYR LDQALRGSWK YIFPLAIASL FLGLVVGYLW IQ