Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0404 |
Symbol | |
ID | 4601498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 369207 |
End bp | 370559 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639773169 |
Product | UbiD family decarboxylase |
Protein accession | YP_919816 |
Protein GI | 119719321 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGGTTTA CCGTATTCGT GGAGCCTGGA GCATCGGAGG TTGTGCCAGC GCTCGGGCTG GATTTCCGGG AACTAGTCGG ATACCACCAG AAGCTGGGGA GGTTGAAGCA GCTTCAGAAA ATGGTGGAGT TGGAGTTTGA AGCGGCTTAC TACATGAAAA AGTACCAGCC CGATCCCGTG CTGATGAATA CGCGTTATGG AAGGCTGATC TCCAACGTTC TTGCAAGACG CGAAACGGTA TACGAGGTTG TAGGGGGGAA AAGCGACTCG GAGGTCTACG CCAAGTTCCT AAGGGCTATG GAGAACCCCC AGCCCCTCTC AAGGATAGAG AGTAGCACTG GTTTAAAGGA GGTGGCTGTG GACCTCTTCA GACTGCCTGT TCCCAAGTTT TTTGAGCGCG ACGGAGGTAG GTACATAACC GCGGGGGTAT TCATCGCGAA GGATCCTCTT ACGGGCGCTG TCAACGCGAG CATTCACAGA GCGATGATCC TTGACGAGGA GAGCCTCGCC GTGAGGCTTG TACCGAGGCA CCTGTACCAG ATACACAGGA ACGCGGAGAA AGCTGGGAGG AACCTCCCCG CCGCCATACT GATAGGCGCG CCCCCGCTGG TCTACCTCTG CGCGGCGTCG AGCCCACCCT TCGGCGTGTA CGAGGTGGAG GTGGCTAACG CGCTGGCGGG GGGCAGGCTT ACGGGTACGG ACTCGCTGCT CGACGGAGTT GTTCTACCGC TACCGGTGGA GGTGGTTCTT CTAGGGGAGT TCATAGCCGG GAGGAGGGCC AAGGAGGGCC CCTTCGTGGA CATCCTGGGA ACCTACGACA TCGTCAGGGA GGAGCCGGTT TTCCGCGTGG AGTCGATCCT TACGCGCCAG GACCCGCTCT TCTACTCGAT CCTGCCCTCG GGCCTCGAAC ACATACTGTT GATGGGCTTT CCAAGGGAGG CCGCGATATG GAGCGTTGCG TCGAGAACGG CTACGGGTGT AAGGAAGGTT AGGCTAACGC CTGGGGGAGG GGGGTGGCTT CACGCGGTGA TATCTATGGA GAAAACCACG GAAGGCGACC CCAAGAATGT TATACTGGCC GCTTTCGCCG CTCACCCCTC GCTTAAAACG GTCGTAGTCG TTGATGCCGA CGTAGACCCC GACGACCCCC TAGACGTTGA ATGGGCGCTT GCAACACGCA TGCAACCAGA CGAAGACATC GTGATCATAA AGGGAGCCAG GGGTAGTAGC CTTGACCCCT CCGCCGACCA GGTAACTCTT CAGACGTCTA AGCTTGGGAT AGACGCTACC CGACCACTCT CGAAGGACAA AAGCCTCTTC GAGAAGGCGC GGATACCCTT CACCGAACAC TAA
|
Protein sequence | MRFTVFVEPG ASEVVPALGL DFRELVGYHQ KLGRLKQLQK MVELEFEAAY YMKKYQPDPV LMNTRYGRLI SNVLARRETV YEVVGGKSDS EVYAKFLRAM ENPQPLSRIE SSTGLKEVAV DLFRLPVPKF FERDGGRYIT AGVFIAKDPL TGAVNASIHR AMILDEESLA VRLVPRHLYQ IHRNAEKAGR NLPAAILIGA PPLVYLCAAS SPPFGVYEVE VANALAGGRL TGTDSLLDGV VLPLPVEVVL LGEFIAGRRA KEGPFVDILG TYDIVREEPV FRVESILTRQ DPLFYSILPS GLEHILLMGF PREAAIWSVA SRTATGVRKV RLTPGGGGWL HAVISMEKTT EGDPKNVILA AFAAHPSLKT VVVVDADVDP DDPLDVEWAL ATRMQPDEDI VIIKGARGSS LDPSADQVTL QTSKLGIDAT RPLSKDKSLF EKARIPFTEH
|
| |