Gene Tpen_0091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0091 
Symbol 
ID4601387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp71159 
End bp72589 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content56% 
IMG OID639772845 
Product3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases-like 
Protein accessionYP_919504 
Protein GI119719009 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.28891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCCCTGA AGGAAACACT CAAGCTTTTA GAGTCCAGCG ATAAGCTGAT GCGGATCGGC 
GGGGAGCTAG CACCCTCCAT CGAGATACCC TGGCTTTCTA TCCAGCTCTC CCGGGAGACG
GATAAAGCCT TGCTGTACGA CAGGGTTAAA GGAGGTGCCC TCAGCCTTGC GACGAACCTC
TTCTATGGAA AGTTGCCGCT AGTTCTGAAC GCGCCGTCGG TCGAGCGTAT AATCGAAAGG
GCGGCCCGAG TGCTCCACCT GTTGAGCACC CCAGCCGCGC AACCAACGGA CAAGCTCGGG
CTACTGGCTT CGCTTGCGTG GATTGCCGAC TATTACCCGA CGGTCAAGGA GGCTCCTGGT
GGAGCCGTAC GCATTCTCCA GGGCTACGAC GTAAATCTTC TCGAAATGCC CTTCCTCAAG
CACAGCCGTG AGGAGGAATA CCCGGTCTTA GTGAACCCCA TCATCGTGGC GTCCAGCAGG
GCCACAGGTG ACAGGGAGAT AGCCTCGCAG AGAGTACAGG TGGTAGACGA GAAAACAATG
GTACTGCACG TCCCGAGGGG TTCGCCCCTT CAGTTCCTCA TCGAGAAATC CTCGCGTAGC
AGGTCTAGCC TGGATGTAGG AATACTCGCC GGGACGGTTC CAGCCCTTTA CCCAGCGTCG
CTTCTAACGT GGCTAACACC GGTGGACAAA ATCCTCTTAG CCGGAGTCCT CTCCGAGAGC
AAAATCTCCG TAGTGAAGAC TGAGGAGGGT CTGGTAATCC CGTATCCAAC GGAAGTAGCG
ATAATCGGCG AGCTGGAACC CGGCGACGAG AGGCCGGAGG GGAGAATGCT GTACGAAAAC
GGGGAGGTAT ACGGGGGATC ACCAATGCCC GTAGTACACG TAAAGAAGAT ACTTCTATCT
CCAAGCCCCG TGTTCTACTC TTCGATCATC CACCCGGAGA GGAGCGACGT AGCGCAGGTA
TACTCCTTGG TCGCCAGCCT CCTTGTGTTG CTTATGAAGT CTTTGGCCCC GGAGATCCAG
GAGCTAAGAT TCCTCGGCCA CGACGCTTTT AGAACAGTCA TAGCCAAAGT TGCGACGCAC
AGGAGAGAGC GCTTGCTAAG TATTGGAGGT GAAATACTGT TCCTGTCCGC TGCCGTGAAT
CCTTATGTCG ATACAGTAAT CCTTGTGGGG CCGGACGTAG ACGTGGAGGA TCCAACGATC
CTAGCGAAGG TTCTCCTAGA GAACGTCGAC CCGGACAGAG ACATTATCCA CGTGGACCAG
GCTGACGCAG AACTCTTACC GCGGAGTAGC AGGCGAGGGA AAGTGATTGT ACTGGCGAAC
GCACCGGGAG CAAGGGGGGC CTCGCGCGTA GAGTCTAAAT TAGAGAAAAC GCCCGAGTCT
GAACGATTGT TCTTGGAGCT TACCCGGAAA CTTCGCGGCT CATCATCCTG A
 
Protein sequence
MSLKETLKLL ESSDKLMRIG GELAPSIEIP WLSIQLSRET DKALLYDRVK GGALSLATNL 
FYGKLPLVLN APSVERIIER AARVLHLLST PAAQPTDKLG LLASLAWIAD YYPTVKEAPG
GAVRILQGYD VNLLEMPFLK HSREEEYPVL VNPIIVASSR ATGDREIASQ RVQVVDEKTM
VLHVPRGSPL QFLIEKSSRS RSSLDVGILA GTVPALYPAS LLTWLTPVDK ILLAGVLSES
KISVVKTEEG LVIPYPTEVA IIGELEPGDE RPEGRMLYEN GEVYGGSPMP VVHVKKILLS
PSPVFYSSII HPERSDVAQV YSLVASLLVL LMKSLAPEIQ ELRFLGHDAF RTVIAKVATH
RRERLLSIGG EILFLSAAVN PYVDTVILVG PDVDVEDPTI LAKVLLENVD PDRDIIHVDQ
ADAELLPRSS RRGKVIVLAN APGARGASRV ESKLEKTPES ERLFLELTRK LRGSSS