Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0263 |
Symbol | |
ID | 4601681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 231333 |
End bp | 232445 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773018 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_919676 |
Protein GI | 119719181 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01928] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.161308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGATCA GGAAGCTCGA GCTCTTCTAC CTCGAGATGA AGCTTAAAGA AGAGTTTAGG ACGAGCTTTG GAAGTGTTTC CACGAGGCCG GTTGTGCTAG TAAGGGTGGA GGAGAAGGGG GGCGAGGAGG GGTGGGGAGA ACTTGTAGCA GACCGGGGCC CCTGGTATAG CTACGAGACC TACGAGACGT CGAGCCTGGT GATCCGGAAG TTTATCGCGC CGAGCCTGGT GGGAAGAGAT ATTGAGGAGG TGGAAGATTT TCACAGACTG GTGGCTTCGA TACGTGGATA CCCCATGGCG AAGACTGCTG TCGAGGAGGC GCTGCTGGAC CTAGAGGCGA GAATGGAGGG GAGGAGCGTC TCGGAGGTTC TCGGAGGCTC CAGGCAGGAA ATAGAGGCGG GGGTTAGCAT AGGTATAAAG GCTAGCGTTG AGGAGCTTCT AAGAGAGGTA AGGAGGAGGC TGGAAGAGGG TTACCAGCGT ATAAAGGTGA AGATTAAGCC GGGCTACGAC GTCGAGGTCG TGAAGAGGAT AAGGGAGGAG TTCGGGGACA TCAGGCTCCA AGTAGATGCT AACGGAGCGT ACTCTTTGAG AGACATCGGA GTGTTCAGGG AGCTCGATAG GTTTAACCTG CTCATGCTAG AGCAACCCCT CGCCTACGAC GATCTCTACG AGCACAGCGT TTTGAGCAGG AAGATCTCTA CCCCTGTGTG CCTGGATGAG AGTATTAGGA GCTTGCACGA CCTGGTGGTT GCAAGCATCC TGGGGTCAGC GGAGGTCGTG AACGTTAAGC CTGCAAGGGT TGGGGGAGTG CTCAAGGCTA AAAGCATCTT AGAAGTCGCG GCGAAGCTCG GGTGGGGGGC GTGGGTTGGC GGAATGCTGG AAACAGGCAT AGGCAGGGCG TTCCTAGTCG CCCTCGCGTC CCTTCCGTTC GTTAACTACC CTAACGATAT CTCCGCGAGC AACCGCTACT GGGATGAGGA CATAGTCGAG CCTCCCTGGG AGATAACCCC TCGCGGCACG ATAGCTGTTC CCAGGAGGCG CGGGCTCGGA GTAGAGGTAA AGAGAGAGCT CGTAGACAGG CTTTCGCTGG AGAGGTGGAC TGCCACGTAC TGA
|
Protein sequence | MEIRKLELFY LEMKLKEEFR TSFGSVSTRP VVLVRVEEKG GEEGWGELVA DRGPWYSYET YETSSLVIRK FIAPSLVGRD IEEVEDFHRL VASIRGYPMA KTAVEEALLD LEARMEGRSV SEVLGGSRQE IEAGVSIGIK ASVEELLREV RRRLEEGYQR IKVKIKPGYD VEVVKRIREE FGDIRLQVDA NGAYSLRDIG VFRELDRFNL LMLEQPLAYD DLYEHSVLSR KISTPVCLDE SIRSLHDLVV ASILGSAEVV NVKPARVGGV LKAKSILEVA AKLGWGAWVG GMLETGIGRA FLVALASLPF VNYPNDISAS NRYWDEDIVE PPWEITPRGT IAVPRRRGLG VEVKRELVDR LSLERWTATY
|
| |