Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1114 |
Symbol | |
ID | 4600856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1051814 |
End bp | 1052962 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639773891 |
Product | GHMP kinase |
Protein accession | YP_920516 |
Protein GI | 119720021 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGGGAG GGAGGTTCAC TGCATCGGCT CCGGGTAGGG TGGATTTCCT GAACACGCAC CAGGACTACA AGGGGCTCCC GGTCGTACCC GTAGCTATAA ACCTGCGTAC GTACGTGGAT GTTCTGGGAA GGAGCGAGCT GTTCGAGGTT AAAAGCGAGG CTCTGTGCGC GGAGGGCTTA GAGTGCGTAG ACAGGTTTCC GCCCACCAAT CCTCCTCTGG TCGAGGGGAG GTGGTGGGGG AACTACCTGC GCGCCGTTGT GAGGGCTGTC GAGGAGTACC TCGGGAAGCC CCTCCCGGAG GGCTTCAGGG CTGTCGTGAG GAGCGAGGTG CCCGTGGGTA GCGGTTTGTC GAGTAGCGCT GCGCTCGAAG TCTCGTTTCT AAAGGCTATC GACTACTACT TCAACCTCGG GCTCGGGAAA AAGGAGCTAG CGGAGCTGGC ATTCCAGGCT GAGAACAGGA TTGCGGGTAT ACCTTGCGGC AGGCTAGACC AGTACGCGTC CGCGTACGGC GGCGTGATAC TCCTCAAGCC CAGACCCCCG GTTGAAGTCG AGGAGCTAGA GCCGGGTAGC CTCAGGTTCG TAGTCGTAGA CTCCGGTATA CGCCACAGCG TGGCAGACAT CCACCCGAAG AGGCAGGAAG AGATAAACCG GGGGCTGAGA GCGCTCATGG AGGACCCCTC CGTTCCTCCT GGCCTTAAGA GGCTACTCGG GTACCGTTAC GACGAGCCCA GGTGGGAGGA GCTATCCCTG GAGGATCTCC AGCCGTACCT AGACAGGCTG GACGAGGCTT CGAGAAAGAG GATACTGTTC ACGTTGCTAA TGCAGGCATC CACCTCGAGG GCTGTCGGGA TTCTAAGGAG GAAAGGCTGG AAGCCGCGGG AACTGGCCCC CGAGGTGAAC TACCAGCACG AGCTCCTGAG AGACCTCTAC GAGGTTAGCC TCCCGGAGCT CGAGAGGATA CGTGACGCGA TGCTCCGCTC GGGTGCCCTA GCAGCCAAGA TAAGCGGGGC CGGGATGGGC GGAAGCCTTC TAGCCTTAAC CGAGGGCGGA GAAGAGGAGG TCGTGGGCTC GGCGCTAAGG GAGGGCGGCA AAAAAGCTTG GATTCTCGTA CCGGACGAGG GCGCTCGGAT CGACCGCGTA GACGGCTAG
|
Protein sequence | MAGGRFTASA PGRVDFLNTH QDYKGLPVVP VAINLRTYVD VLGRSELFEV KSEALCAEGL ECVDRFPPTN PPLVEGRWWG NYLRAVVRAV EEYLGKPLPE GFRAVVRSEV PVGSGLSSSA ALEVSFLKAI DYYFNLGLGK KELAELAFQA ENRIAGIPCG RLDQYASAYG GVILLKPRPP VEVEELEPGS LRFVVVDSGI RHSVADIHPK RQEEINRGLR ALMEDPSVPP GLKRLLGYRY DEPRWEELSL EDLQPYLDRL DEASRKRILF TLLMQASTSR AVGILRRKGW KPRELAPEVN YQHELLRDLY EVSLPELERI RDAMLRSGAL AAKISGAGMG GSLLALTEGG EEEVVGSALR EGGKKAWILV PDEGARIDRV DG
|
| |