Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0888 |
Symbol | |
ID | 4600831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 835956 |
End bp | 836924 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639773666 |
Product | phosphoesterase, DHHA1 |
Protein accession | YP_920292 |
Protein GI | 119719797 |
COG category | [R] General function prediction only |
COG ID | [COG2404] Predicted phosphohydrolase (DHH superfamily) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.141577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGATCT ACGCCTTTGT ACTCCACGGG GACCTTGATG GGCTCTCCGC TACAGCCACG GTTGCCGCCG CGTTGAAGCA CGCGGAGAAG GACGCGGAGC TGAGGTTTTA CTTCTCACAG CCCTACGAGC TGGACAAAGA CCTAGCCCGC GTGGATCCCC GAGCGGAGGG CATCTACATA GTTGACCTAG CGATAGACGC GGACGTGTGG CCGAGGCTCA GCGCGGAGCT CGGCAAGCTT GTGGCGTCGA AGAGGGTGAC TTGGATCGAC CACCACCCTT CTACCATTGA GCGGGTCGAG GAGCTGAAGA GGATCGGGGT GGAAGCCATG CTGGCAGAGG CCGCGTCGGC GTCGACTATA GCTAGGAGCT TCCTGGACAG AGTCCCGGAC CCCGCCTTCT TCGAGAAGAT AATCACGATA GGAGAGGTTG CCGACAGGGC TGTGAGCGTA GCACGGGAGG ACCCGCTCTT CCACTACGTC GAGGTTCTAA GCCTGGTCCT CGGGTACCGC GTGCGGGACG AGCAGATAAG GAGAAGGATA CTCAGAGCCT GGATCACCGA GAGGGTAGTA GTCCCCGACG AAGTAGCAAA GGCCGCAAGT GAAGCGGAGA AGGTGTTTCA GGACCTCCTT AGGGAGGCGC GCCTCAGGGT AGTCTACCGC TCCGAGAAGG TTGTAGCCGT GGATATGAGG GACAAGAGGG TATACGGCTT CGCCGGCATG CTGGCATCCA TCATCGCCAG CGAGGAGGGG AGGATCGCGT TGATTCTCAG CAGGGTCGGC GAGGCGGCAC TCCTAACGCT GAGAGCACCT CCCGGGGCAA AGGGGAACCC GTCGAAGACT GCATGGGATG TGGCATCGCG ATACGGGGGG TCGGGCGGCG GCCACGCAGG CGCTGCATCC TTCAAGGTTC CCGGGACCTA CGCCGAGAAA GTTCTCGCGG AGATCGTACG TGCACTAGAA ACTGGCTAA
|
Protein sequence | MRIYAFVLHG DLDGLSATAT VAAALKHAEK DAELRFYFSQ PYELDKDLAR VDPRAEGIYI VDLAIDADVW PRLSAELGKL VASKRVTWID HHPSTIERVE ELKRIGVEAM LAEAASASTI ARSFLDRVPD PAFFEKIITI GEVADRAVSV AREDPLFHYV EVLSLVLGYR VRDEQIRRRI LRAWITERVV VPDEVAKAAS EAEKVFQDLL REARLRVVYR SEKVVAVDMR DKRVYGFAGM LASIIASEEG RIALILSRVG EAALLTLRAP PGAKGNPSKT AWDVASRYGG SGGGHAGAAS FKVPGTYAEK VLAEIVRALE TG
|
| |