Gene Tpen_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1081 
Symbol 
ID4601693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1017576 
End bp1018790 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content60% 
IMG OID639773858 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_920483 
Protein GI119719988 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.152309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAGTA CGAGTAGCGG TGTCGTATAC TCGAGGCTGT TACTCAGGGA GGGCGACGAG 
TCCGTCTACG AGCTCTTCAT AGGGCCGCAA CACCCATCCT CGGGCCACAT GAGGTTCATA
GTGAGGCTTC AGGGAGACGT AATAGTGTCC GTCGACCCGG ACATAGGCTA CGTGCACCGC
ACGATGGAGA AGCTCGCCGA GGGGCGGGAG GCCATAAAGG CGATACCGCT CCTCGAAAGG
CTGACGATAA TAGACTCGCA TAACGCCACG GTAGGCCTGG TAACGGCGAT GGAGAGGCTT
CTCGACGTGG AGCCTCCGCC GCGGGCCCTC TACCTCAGGA CTCTTCTCTC GGAGATAAAC
AGGATAGCGA GCCACCTGTA CGGGATGGGC ATAGCCGGGA TCATGCTGAA CCACTCGACG
ATGTTCATGT GGGCGTTCGG GGACCGCGAG GTGTGGCTCC AGCTCGCCGA GGAATTGACG
GGGGCCAGGC TCACACACAC CTACAGCGTG CCAGGGGGTG TGAGGAGGGA TCTTCCGCAG
GGCTTCGGGG AGAAGTTCGA GAAAGCAGCT AGGTACATGG AGAGGAGGTT GCAGGATTAC
ATGAGGATCT TCCTGGAGAA CCCCCAGGTG GTCGCGAGGT ACGAAGGCGT AGGGGTGTTG
AAGAAGTCCG AGGCCTCCAG GCTCGGGGTC GTGGGCCCGA ACCTACGCGC GAGCGGCGTG
AAATACGACG CTAGGCTCGC GGACGACTAC GGTGCCTACA AGGACCTCGA GTTCGAGGTT
CCAACCCGAG AGGAGGGCGA CTGTATGGCT AGGATGCTGG TTAGAGTGGA GGAGATAAAG
CAGAGCATCT CGATCATACG CCAAGTACTC CGGAAGATGC CGGATGGACC CATACTCTCC
GAGAAGTACC TCAAGCTCCT GCCGCCCAAG ACTCGCGAGA GGGTTTTGCA GGAGGGGAGG
GTCAAGTTCC CGGCGCTCTT CGCCTCCCTG AAGTTGCCCG CCGGCGAAGC CGTGGCTAGG
GCCGAGATGG GGCACGGCGA GATATTCTAC CACATCACGG GGGACGGGTC GGCGAAGCCG
TACAGGCTCC GGGTCGTAAC GCCCTCCTTC AGGAACGTCA TACTGTTCAG GTACCTGGCC
CCCGGTCACA GGTTTATGGA TTTCCCCGCG ATATACGGTT CTCTGGACTA CTTCCCTCCC
GAGGCGGATA GGTGA
 
Protein sequence
MLSTSSGVVY SRLLLREGDE SVYELFIGPQ HPSSGHMRFI VRLQGDVIVS VDPDIGYVHR 
TMEKLAEGRE AIKAIPLLER LTIIDSHNAT VGLVTAMERL LDVEPPPRAL YLRTLLSEIN
RIASHLYGMG IAGIMLNHST MFMWAFGDRE VWLQLAEELT GARLTHTYSV PGGVRRDLPQ
GFGEKFEKAA RYMERRLQDY MRIFLENPQV VARYEGVGVL KKSEASRLGV VGPNLRASGV
KYDARLADDY GAYKDLEFEV PTREEGDCMA RMLVRVEEIK QSISIIRQVL RKMPDGPILS
EKYLKLLPPK TRERVLQEGR VKFPALFASL KLPAGEAVAR AEMGHGEIFY HITGDGSAKP
YRLRVVTPSF RNVILFRYLA PGHRFMDFPA IYGSLDYFPP EADR