Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0091 |
Symbol | |
ID | 4601387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 71159 |
End bp | 72589 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639772845 |
Product | 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases-like |
Protein accession | YP_919504 |
Protein GI | 119719009 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.28891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCCCTGA AGGAAACACT CAAGCTTTTA GAGTCCAGCG ATAAGCTGAT GCGGATCGGC GGGGAGCTAG CACCCTCCAT CGAGATACCC TGGCTTTCTA TCCAGCTCTC CCGGGAGACG GATAAAGCCT TGCTGTACGA CAGGGTTAAA GGAGGTGCCC TCAGCCTTGC GACGAACCTC TTCTATGGAA AGTTGCCGCT AGTTCTGAAC GCGCCGTCGG TCGAGCGTAT AATCGAAAGG GCGGCCCGAG TGCTCCACCT GTTGAGCACC CCAGCCGCGC AACCAACGGA CAAGCTCGGG CTACTGGCTT CGCTTGCGTG GATTGCCGAC TATTACCCGA CGGTCAAGGA GGCTCCTGGT GGAGCCGTAC GCATTCTCCA GGGCTACGAC GTAAATCTTC TCGAAATGCC CTTCCTCAAG CACAGCCGTG AGGAGGAATA CCCGGTCTTA GTGAACCCCA TCATCGTGGC GTCCAGCAGG GCCACAGGTG ACAGGGAGAT AGCCTCGCAG AGAGTACAGG TGGTAGACGA GAAAACAATG GTACTGCACG TCCCGAGGGG TTCGCCCCTT CAGTTCCTCA TCGAGAAATC CTCGCGTAGC AGGTCTAGCC TGGATGTAGG AATACTCGCC GGGACGGTTC CAGCCCTTTA CCCAGCGTCG CTTCTAACGT GGCTAACACC GGTGGACAAA ATCCTCTTAG CCGGAGTCCT CTCCGAGAGC AAAATCTCCG TAGTGAAGAC TGAGGAGGGT CTGGTAATCC CGTATCCAAC GGAAGTAGCG ATAATCGGCG AGCTGGAACC CGGCGACGAG AGGCCGGAGG GGAGAATGCT GTACGAAAAC GGGGAGGTAT ACGGGGGATC ACCAATGCCC GTAGTACACG TAAAGAAGAT ACTTCTATCT CCAAGCCCCG TGTTCTACTC TTCGATCATC CACCCGGAGA GGAGCGACGT AGCGCAGGTA TACTCCTTGG TCGCCAGCCT CCTTGTGTTG CTTATGAAGT CTTTGGCCCC GGAGATCCAG GAGCTAAGAT TCCTCGGCCA CGACGCTTTT AGAACAGTCA TAGCCAAAGT TGCGACGCAC AGGAGAGAGC GCTTGCTAAG TATTGGAGGT GAAATACTGT TCCTGTCCGC TGCCGTGAAT CCTTATGTCG ATACAGTAAT CCTTGTGGGG CCGGACGTAG ACGTGGAGGA TCCAACGATC CTAGCGAAGG TTCTCCTAGA GAACGTCGAC CCGGACAGAG ACATTATCCA CGTGGACCAG GCTGACGCAG AACTCTTACC GCGGAGTAGC AGGCGAGGGA AAGTGATTGT ACTGGCGAAC GCACCGGGAG CAAGGGGGGC CTCGCGCGTA GAGTCTAAAT TAGAGAAAAC GCCCGAGTCT GAACGATTGT TCTTGGAGCT TACCCGGAAA CTTCGCGGCT CATCATCCTG A
|
Protein sequence | MSLKETLKLL ESSDKLMRIG GELAPSIEIP WLSIQLSRET DKALLYDRVK GGALSLATNL FYGKLPLVLN APSVERIIER AARVLHLLST PAAQPTDKLG LLASLAWIAD YYPTVKEAPG GAVRILQGYD VNLLEMPFLK HSREEEYPVL VNPIIVASSR ATGDREIASQ RVQVVDEKTM VLHVPRGSPL QFLIEKSSRS RSSLDVGILA GTVPALYPAS LLTWLTPVDK ILLAGVLSES KISVVKTEEG LVIPYPTEVA IIGELEPGDE RPEGRMLYEN GEVYGGSPMP VVHVKKILLS PSPVFYSSII HPERSDVAQV YSLVASLLVL LMKSLAPEIQ ELRFLGHDAF RTVIAKVATH RRERLLSIGG EILFLSAAVN PYVDTVILVG PDVDVEDPTI LAKVLLENVD PDRDIIHVDQ ADAELLPRSS RRGKVIVLAN APGARGASRV ESKLEKTPES ERLFLELTRK LRGSSS
|
| |