Gene Tpen_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0404 
Symbol 
ID4601498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp369207 
End bp370559 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content58% 
IMG OID639773169 
ProductUbiD family decarboxylase 
Protein accessionYP_919816 
Protein GI119719321 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGGTTTA CCGTATTCGT GGAGCCTGGA GCATCGGAGG TTGTGCCAGC GCTCGGGCTG 
GATTTCCGGG AACTAGTCGG ATACCACCAG AAGCTGGGGA GGTTGAAGCA GCTTCAGAAA
ATGGTGGAGT TGGAGTTTGA AGCGGCTTAC TACATGAAAA AGTACCAGCC CGATCCCGTG
CTGATGAATA CGCGTTATGG AAGGCTGATC TCCAACGTTC TTGCAAGACG CGAAACGGTA
TACGAGGTTG TAGGGGGGAA AAGCGACTCG GAGGTCTACG CCAAGTTCCT AAGGGCTATG
GAGAACCCCC AGCCCCTCTC AAGGATAGAG AGTAGCACTG GTTTAAAGGA GGTGGCTGTG
GACCTCTTCA GACTGCCTGT TCCCAAGTTT TTTGAGCGCG ACGGAGGTAG GTACATAACC
GCGGGGGTAT TCATCGCGAA GGATCCTCTT ACGGGCGCTG TCAACGCGAG CATTCACAGA
GCGATGATCC TTGACGAGGA GAGCCTCGCC GTGAGGCTTG TACCGAGGCA CCTGTACCAG
ATACACAGGA ACGCGGAGAA AGCTGGGAGG AACCTCCCCG CCGCCATACT GATAGGCGCG
CCCCCGCTGG TCTACCTCTG CGCGGCGTCG AGCCCACCCT TCGGCGTGTA CGAGGTGGAG
GTGGCTAACG CGCTGGCGGG GGGCAGGCTT ACGGGTACGG ACTCGCTGCT CGACGGAGTT
GTTCTACCGC TACCGGTGGA GGTGGTTCTT CTAGGGGAGT TCATAGCCGG GAGGAGGGCC
AAGGAGGGCC CCTTCGTGGA CATCCTGGGA ACCTACGACA TCGTCAGGGA GGAGCCGGTT
TTCCGCGTGG AGTCGATCCT TACGCGCCAG GACCCGCTCT TCTACTCGAT CCTGCCCTCG
GGCCTCGAAC ACATACTGTT GATGGGCTTT CCAAGGGAGG CCGCGATATG GAGCGTTGCG
TCGAGAACGG CTACGGGTGT AAGGAAGGTT AGGCTAACGC CTGGGGGAGG GGGGTGGCTT
CACGCGGTGA TATCTATGGA GAAAACCACG GAAGGCGACC CCAAGAATGT TATACTGGCC
GCTTTCGCCG CTCACCCCTC GCTTAAAACG GTCGTAGTCG TTGATGCCGA CGTAGACCCC
GACGACCCCC TAGACGTTGA ATGGGCGCTT GCAACACGCA TGCAACCAGA CGAAGACATC
GTGATCATAA AGGGAGCCAG GGGTAGTAGC CTTGACCCCT CCGCCGACCA GGTAACTCTT
CAGACGTCTA AGCTTGGGAT AGACGCTACC CGACCACTCT CGAAGGACAA AAGCCTCTTC
GAGAAGGCGC GGATACCCTT CACCGAACAC TAA
 
Protein sequence
MRFTVFVEPG ASEVVPALGL DFRELVGYHQ KLGRLKQLQK MVELEFEAAY YMKKYQPDPV 
LMNTRYGRLI SNVLARRETV YEVVGGKSDS EVYAKFLRAM ENPQPLSRIE SSTGLKEVAV
DLFRLPVPKF FERDGGRYIT AGVFIAKDPL TGAVNASIHR AMILDEESLA VRLVPRHLYQ
IHRNAEKAGR NLPAAILIGA PPLVYLCAAS SPPFGVYEVE VANALAGGRL TGTDSLLDGV
VLPLPVEVVL LGEFIAGRRA KEGPFVDILG TYDIVREEPV FRVESILTRQ DPLFYSILPS
GLEHILLMGF PREAAIWSVA SRTATGVRKV RLTPGGGGWL HAVISMEKTT EGDPKNVILA
AFAAHPSLKT VVVVDADVDP DDPLDVEWAL ATRMQPDEDI VIIKGARGSS LDPSADQVTL
QTSKLGIDAT RPLSKDKSLF EKARIPFTEH