Gene Tpen_0180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0180 
Symbol 
ID4600882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp154552 
End bp155925 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content55% 
IMG OID639772934 
Producthydrogenase 4 subunit D 
Protein accessionYP_919593 
Protein GI119719098 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.210029 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTGCCT TCACGAGGAG AAAGGCCTTC GAAGTAGCGG TAGCAGGCTC CCTGGTCGCT 
CTAGCAGGCG TAGTGTGCAA CGCTGCTCTC TACCTCTCTA GCCAGATGAG ACCTTCCCAG
TACGTACTCG CATACTACAA CGGCTTAGCG CTTGGCTTCG TCAATGATTC CCTGAGTGTT
TTGGTAAGCC TGATGGTAGC TCTTAACGGT CTGGCCACGA TTGTTTTCTC GAAGCACTAC
ATGTCCCCGT CGAACAGAGA ACACCCGGTA AGCCTCGAGG AAACCCCGAG GTACTACGGT
TGGCTCCTTC TCTTCGAGGC ATCAGCTCTA GGCTTTGTCT ACTCGTCGAC CCTTTTGTCG
ATGGTGGCTT TCTTTGAGCT TACAAGCCTG TGCTCGTACG CACTGATAAG CTACTACGAG
GACCCAGAGT CCAGGAGATC AGGCCTGCAA GCCTTCATCG TAACACACAT CGCCGCGCTT
GGTCTATACC TCGCGGCCGG CTTGACCTAT GCGAGCACGG GGAGCGTCGA GGTGGTAGCA
CTCAAAGGGA TGCCCGAGGA GCTGAAGACC ATAGCCGAGC TCCTCCTGCT GGTAGCGGCG
GCCGGAAAGT CGGCCCAGAT ACCCTTCCAT AGGTGGTTGC CGGACGCCAT GGTTGCGCCT
ACACCTGTTT CCGCCTACCT ACATGCGGCG GCAATGGTCA AGCTAGGAGC CTACCTAATG
CTTAGAGTGC TCGTCGACGC GGGATACACG CATACAGTCG CGCTCGCATG CCTAGCTGTG
GGTGTGGTCT CCATGGCTTA CGGGTGTGCG ATGTACTTCC CGCAACTGGA CATGAAAAGA
TTACTGGCTT ACTCGACGAT AACCCAGCTC TCCTACATAT TCATGGGAGC AGGCTTGGCC
TCTCTTGGTT CAGCAATGGC CTTAAAGGGA GCAGAACTCC ACATATTTAC CCACGGCTTC
GCCAAGGAAC TGTTCTTCCT AGTAGCCGGG CTGATATCGT TCTCCGCAGG CACCAGGATG
CTCGACAAGA TAAGCGGGTT GAGAGCGGCA AGAACCGCAG CTGTAGGCTT TACCGTGGCG
GCGCTATCCG TGACGGGCGT ACCGCCCTTC GGACTTTTCT GGAGCAAGAT CCTGCTCATA
CTAGGAGGCT ACTCCGCCGG TAGCATTGTT GCAGCTATCA TCGCTACGGC GATGCTCTGC
GAGTCCATAG TGTGTTTCGC CTGGTTCCTC AGAGTATTCA CGAAATGCGT TGGAGGCGAG
CCCTCGGATG CCGTGAAAAG CATGCAAAGA GAGCCTACAA CAATGAAGAT GACAGTGTAC
TTCCTGGCCT TAATGTCGTT GCTCGCACCG TTCCTGGCTA CGCCATTCCT GTAA
 
Protein sequence
MFAFTRRKAF EVAVAGSLVA LAGVVCNAAL YLSSQMRPSQ YVLAYYNGLA LGFVNDSLSV 
LVSLMVALNG LATIVFSKHY MSPSNREHPV SLEETPRYYG WLLLFEASAL GFVYSSTLLS
MVAFFELTSL CSYALISYYE DPESRRSGLQ AFIVTHIAAL GLYLAAGLTY ASTGSVEVVA
LKGMPEELKT IAELLLLVAA AGKSAQIPFH RWLPDAMVAP TPVSAYLHAA AMVKLGAYLM
LRVLVDAGYT HTVALACLAV GVVSMAYGCA MYFPQLDMKR LLAYSTITQL SYIFMGAGLA
SLGSAMALKG AELHIFTHGF AKELFFLVAG LISFSAGTRM LDKISGLRAA RTAAVGFTVA
ALSVTGVPPF GLFWSKILLI LGGYSAGSIV AAIIATAMLC ESIVCFAWFL RVFTKCVGGE
PSDAVKSMQR EPTTMKMTVY FLALMSLLAP FLATPFL