Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1837 |
Symbol | |
ID | 4602074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1779300 |
End bp | 1780370 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639774610 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_921235 |
Protein GI | 119720740 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.106418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACATAG AAGCTTGGGT GTGCGAAGTC CCGCTTAGGG AGCCCTTAAG GATTTCTGCC GCGACCTATC ACACCCAGCG CTCCATAGTT CTCCGCTTGA GGGACGGAGA TCTAGAGGGT TGGGGCGAGG CTTGCCAGTC GAGGAGGGTT CTCGGAGAGA GCTTCGAGGA CGCGCTGGAG TCTCTGAGAG CGAGCGTCGA CGCAATCAAG AGGGCGGAGT ACGACTCTTT GGAGAAGATT CACAGGTTCA CCGAGGAGCT CAACGCTACC CCGTCGATAA AGGCCGCTGT TAACATGGCT CTCCTGGACC TCTACGCGAA GTCGGAGGGC AGGCCTCTGT GGAGGCTTCT CGGGGGCTTC AGGGAGGAGG TGAGGACGGA CATCACGATA GGGATCATGC GGCCGGAGGA GATGGCGGAG AGGGCTCTGC GCTACGCTGA GAAGGGGTTC CGGATATTCA AGCTCAAGCT GGGCGAGAAC CCGGAGGAGG ACGTTCTAAG AGTTAAGGCT GTGCGGGACG TCGTTGGAGA CGCGACGATA AGAGTGGACG CGAACGAGGG GTGGACGCGC GAGGACGCCG TACGGGTGAT AGAGAGGATA GCCGACTACG GCGTTGAACT CGTGGAGCAA CCGCTCAGAC ACGACGACAT AGAAGGCCTT AGGGCCTTGA GGAGGGAAAG CCCGATACCC ATAGCTGTAG ACGAGTCCGT GAAGACGGCG CGCGACGCGC TGCTGGTAGC CAAGAAGGAG GCGGCCGACA TCATAAACAT CAAGCTGATG AAAAGCAGGG GGATTACGGG CGCAATCAGG ATAATCATGG TAAGCGAGGC GGCCGGGCTG AAGAACATGG TCGGATGCTT CTCCGAGTCT AGGCTTGGCA TCACGGCTAC CGCTTACCTC GCCCAGGCAT TCTCCAACGT GCTGTTCTAC GACTTGGACT GCGACATACT GAGCGCCGAC CCCGTGTTCA CGGGAGGCTC TGAGCCTATT GGTGACAGAA GAAGAGCTTC CCCGAAGCCT GGGCTGGGAA TCTCGCCGAG TAACTTCAAC GTATTGAAGA AGATCCTCTA G
|
Protein sequence | MDIEAWVCEV PLREPLRISA ATYHTQRSIV LRLRDGDLEG WGEACQSRRV LGESFEDALE SLRASVDAIK RAEYDSLEKI HRFTEELNAT PSIKAAVNMA LLDLYAKSEG RPLWRLLGGF REEVRTDITI GIMRPEEMAE RALRYAEKGF RIFKLKLGEN PEEDVLRVKA VRDVVGDATI RVDANEGWTR EDAVRVIERI ADYGVELVEQ PLRHDDIEGL RALRRESPIP IAVDESVKTA RDALLVAKKE AADIINIKLM KSRGITGAIR IIMVSEAAGL KNMVGCFSES RLGITATAYL AQAFSNVLFY DLDCDILSAD PVFTGGSEPI GDRRRASPKP GLGISPSNFN VLKKIL
|
| |