Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0158 |
Symbol | |
ID | 4601267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 134798 |
End bp | 135958 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639772912 |
Product | glycosyl transferase family protein |
Protein accession | YP_919571 |
Protein GI | 119719076 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAGCC GCGCTATCGC CGAGGTAGTC TTGGAGGGTA AGGCTTCGCA GGGCCGAAAG GTGTCGGTCA TCATACCGTC GTACAGGGGG TCTGAGAGGT TAGTCAGGCT TGTGAAGAGG GTGGCGAGCC TCCCCTACGA GGACAAGGAA GTAGTAGTCG TTGTGGATGA GCCTCTGAGG GAGGTGGCCG AGGAACTAAG GAGGATCGGC GGGGTGAAGC TCATCCTAAG GCCTAAGAGG GGCGGCAAGG TTAGCGCGCT GAACGAGGCG CTTAGAGAGT CTAGCGGCGA GGTCGTAATA TTCCTGGACG ACGACGTATA CGTGGAGGAC GACCGCTTCA TCGAGAAGGT TTTGAAGGCT ATGGAGGGCT ACGACATAGC CGACATCAAG AAAGTTATAG TCGACACGGG GGGTATTCTC TCGAAGCTCG TCTACATCGA GTACGCATCC TACAACTTTG CAAGTAAGCT GATGGCGAGG GCCGCTAGGA GAACAGTCGC CGTCAACGGG GCGGCGTTCG CCGTCAGGAG AAAAGCGCTG GACGAGATAG GGTACTTCCG CCCATCGATA TCCGAGGACT TCGACATAGC CCTGAGGTCG TTTAAAGCTA AGCATAACTT TACGTACATC GAGAACACCT ACGTACTCAA CTATCCTCCG AGCGATTTTA GGAAGTGGTT TAAGCAACGC AAAAGGTGGG CAATAGGCCT CGCCGCCTGG CTGGAAGAGA ACTTCGCGGA CGCGCTTAAA ACGCTTCTCA GAATGCCGCA CGCCGTGATC CCCGGGCTCC TGCTGGCTCT ACCGTCGCTT TCGAGCGCTT TGATAACGTT CGTCCTCAGC AACCACGTCT ACGAGAAGAC GGCTTACCTC TTCATGCTCA CGTTGTCGTC CCTAGTAGCC CAGGCGCTTC CATTCGCCTC GATCCTGCTT CTGAACATCC AGCTCATATA CCTCGTAAAG GCGGGAGCAA TCCTCACAGC GTTCTTCGTG TTCCTATTCT GGCAGTTCGC GGCATCACGC GCCGTGAAGA TGAAGTCATA CCTGTACCTA TACCCTGTCT ACTTCTTCGT TTACCAGCCA CTCTGGCTTA CGATACTCTT AGCCGGCTTC ATTCGAGTTA TAGTCCTTAG AAGGAAGAGC GTCGAAGACT GGGTTGTCTA A
|
Protein sequence | MSSRAIAEVV LEGKASQGRK VSVIIPSYRG SERLVRLVKR VASLPYEDKE VVVVVDEPLR EVAEELRRIG GVKLILRPKR GGKVSALNEA LRESSGEVVI FLDDDVYVED DRFIEKVLKA MEGYDIADIK KVIVDTGGIL SKLVYIEYAS YNFASKLMAR AARRTVAVNG AAFAVRRKAL DEIGYFRPSI SEDFDIALRS FKAKHNFTYI ENTYVLNYPP SDFRKWFKQR KRWAIGLAAW LEENFADALK TLLRMPHAVI PGLLLALPSL SSALITFVLS NHVYEKTAYL FMLTLSSLVA QALPFASILL LNIQLIYLVK AGAILTAFFV FLFWQFAASR AVKMKSYLYL YPVYFFVYQP LWLTILLAGF IRVIVLRRKS VEDWVV
|
| |