Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1778 |
Symbol | |
ID | 4601939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1720614 |
End bp | 1722164 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774551 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_921176 |
Protein GI | 119720681 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0058] Glucan phosphorylase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTCCG AGCCCAGGGT TGTGGTTAGC GTTACGCCCG AGCTGGCCCT TGACGACGGC TACACGTTCG CCGGAGGCCT AGGCGTGCTG GAAGGCGACA AGTTCTACGC GGCGGCGAAG CTGGGGCTGA AGTACTACGC CTTGACGCTG TTCTACAGGA ACGGCTACGT GGACTACGCG TTCGATGACT CGCTGAACCC GGTCGCCAAG CCGCAACCCC AGCCTGCGAG CTTCCTGGGG TCTCTCAAGG ACGGCGGCGA GCTGGAGGTC TTCCTGAAGG GGGAGAAGGT AGCCGTCAAG GCATGGGAGT ACGAGCATGG AAGCGCTAAG GCTGTGTTCT TCGAGCCCGT AAGCCCGGAT TGGGCGCGTA GCCTCGGCGA GAGGGTTTTC CTGGAGAGGG ACGCGGAGGA GAGGTTCTAC AAGTACATCC TCCTCGCGCG CGCCGCCGTA GCCTACATGA AGGACAGGAT AGGCCTGGAG AACATAGCGT ACATAGACCT CCAGGAGGCG TACACCGCCG TGATACCCCT AGTGTTCAAG ATACCCGGGA GGTACCGGCT GGTGATACAC ACTCCCGGGC CCTGGGGGCA CCCGTCCTTC CCGAGGGACC TCTTTGCCAA GGAGCTCGGC TACCGGTTCA TCGAGAACCC CGTTGTGCTG ACAAGTATCG GGGCAGCCAC CGCCTACGAG GTAGTAATGG TCAGCTCGAA GCACTTCGAC ATAATGAGGC GCGTGATACC CCAGTACTAC CACAAAGCGA GGTTCGTGAC TAACGGCGTA AACATAGATA GGTGGATGAA CCCCAAGCTG AGAAACCTGT TCGCGAGCGG GAGCCTCGAC GTCGCTACGC TGAGAGGCGT CAGGCTCGAG ATGCGGGACC AGCTCGTCAG GTTCCTGAAG TCCAGGAAAC AGGTCAACGT TGACCAGGAC ACCTTTATCT TCGCGTGGAC GCGCAGAGTG ACGAAGTACA AGAGGCCCTA CTTCCCCGTC AGGCTCATAG AGGAGCTCGG CGACAGGGAC ACGCTCTTCG TTCTCGGCGG GAAAGCGCAC CCGGAGGACA AGGAGGGGTT GCAGTACATG AGGAAGTTCA AGGAGCTGGA GAAAACGCGG CCCAACGTTG TCTACGTCCA CGACTACTCC GTGGAGAGCG CTAAGATCAT ACTCTCGGGG GCCGATGTGC TGGCATTTAC GCCTTTCCCC GGGTGGGAGG CTTCGGGGAC GAGCTTCATG AAGGCAGGGG TTAACGCTGT CCCGTCCATC GCTTCGCGCG ACGGCGCCGT AGTAGAACTC CTCACGGACG GGGTGAACGG GTGGCTGTTC GGGGAGGACA TAAGGGAACT GATAGACTTC GGGAAAGACC CCCGCGTGAG CGAGATCGAC GAGAAGGACT ACGAGGAGTT CAAGAGGAAG TACGCCCAGG CTAAGGATCT CTACGCGAAC GACAGGGAAG GCTTCCTCAA GGTCGCGCTG AGCGCTGTCC TCTCGCTGAC GATGCGCGTC GACATAGTGA GGGCACTGAG GGAGTACTAC CCGGACCTCG TACAAACCTA G
|
Protein sequence | MDSEPRVVVS VTPELALDDG YTFAGGLGVL EGDKFYAAAK LGLKYYALTL FYRNGYVDYA FDDSLNPVAK PQPQPASFLG SLKDGGELEV FLKGEKVAVK AWEYEHGSAK AVFFEPVSPD WARSLGERVF LERDAEERFY KYILLARAAV AYMKDRIGLE NIAYIDLQEA YTAVIPLVFK IPGRYRLVIH TPGPWGHPSF PRDLFAKELG YRFIENPVVL TSIGAATAYE VVMVSSKHFD IMRRVIPQYY HKARFVTNGV NIDRWMNPKL RNLFASGSLD VATLRGVRLE MRDQLVRFLK SRKQVNVDQD TFIFAWTRRV TKYKRPYFPV RLIEELGDRD TLFVLGGKAH PEDKEGLQYM RKFKELEKTR PNVVYVHDYS VESAKIILSG ADVLAFTPFP GWEASGTSFM KAGVNAVPSI ASRDGAVVEL LTDGVNGWLF GEDIRELIDF GKDPRVSEID EKDYEEFKRK YAQAKDLYAN DREGFLKVAL SAVLSLTMRV DIVRALREYY PDLVQT
|
| |