Gene Tpen_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1114 
Symbol 
ID4600856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1051814 
End bp1052962 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content61% 
IMG OID639773891 
ProductGHMP kinase 
Protein accessionYP_920516 
Protein GI119720021 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGGAG GGAGGTTCAC TGCATCGGCT CCGGGTAGGG TGGATTTCCT GAACACGCAC 
CAGGACTACA AGGGGCTCCC GGTCGTACCC GTAGCTATAA ACCTGCGTAC GTACGTGGAT
GTTCTGGGAA GGAGCGAGCT GTTCGAGGTT AAAAGCGAGG CTCTGTGCGC GGAGGGCTTA
GAGTGCGTAG ACAGGTTTCC GCCCACCAAT CCTCCTCTGG TCGAGGGGAG GTGGTGGGGG
AACTACCTGC GCGCCGTTGT GAGGGCTGTC GAGGAGTACC TCGGGAAGCC CCTCCCGGAG
GGCTTCAGGG CTGTCGTGAG GAGCGAGGTG CCCGTGGGTA GCGGTTTGTC GAGTAGCGCT
GCGCTCGAAG TCTCGTTTCT AAAGGCTATC GACTACTACT TCAACCTCGG GCTCGGGAAA
AAGGAGCTAG CGGAGCTGGC ATTCCAGGCT GAGAACAGGA TTGCGGGTAT ACCTTGCGGC
AGGCTAGACC AGTACGCGTC CGCGTACGGC GGCGTGATAC TCCTCAAGCC CAGACCCCCG
GTTGAAGTCG AGGAGCTAGA GCCGGGTAGC CTCAGGTTCG TAGTCGTAGA CTCCGGTATA
CGCCACAGCG TGGCAGACAT CCACCCGAAG AGGCAGGAAG AGATAAACCG GGGGCTGAGA
GCGCTCATGG AGGACCCCTC CGTTCCTCCT GGCCTTAAGA GGCTACTCGG GTACCGTTAC
GACGAGCCCA GGTGGGAGGA GCTATCCCTG GAGGATCTCC AGCCGTACCT AGACAGGCTG
GACGAGGCTT CGAGAAAGAG GATACTGTTC ACGTTGCTAA TGCAGGCATC CACCTCGAGG
GCTGTCGGGA TTCTAAGGAG GAAAGGCTGG AAGCCGCGGG AACTGGCCCC CGAGGTGAAC
TACCAGCACG AGCTCCTGAG AGACCTCTAC GAGGTTAGCC TCCCGGAGCT CGAGAGGATA
CGTGACGCGA TGCTCCGCTC GGGTGCCCTA GCAGCCAAGA TAAGCGGGGC CGGGATGGGC
GGAAGCCTTC TAGCCTTAAC CGAGGGCGGA GAAGAGGAGG TCGTGGGCTC GGCGCTAAGG
GAGGGCGGCA AAAAAGCTTG GATTCTCGTA CCGGACGAGG GCGCTCGGAT CGACCGCGTA
GACGGCTAG
 
Protein sequence
MAGGRFTASA PGRVDFLNTH QDYKGLPVVP VAINLRTYVD VLGRSELFEV KSEALCAEGL 
ECVDRFPPTN PPLVEGRWWG NYLRAVVRAV EEYLGKPLPE GFRAVVRSEV PVGSGLSSSA
ALEVSFLKAI DYYFNLGLGK KELAELAFQA ENRIAGIPCG RLDQYASAYG GVILLKPRPP
VEVEELEPGS LRFVVVDSGI RHSVADIHPK RQEEINRGLR ALMEDPSVPP GLKRLLGYRY
DEPRWEELSL EDLQPYLDRL DEASRKRILF TLLMQASTSR AVGILRRKGW KPRELAPEVN
YQHELLRDLY EVSLPELERI RDAMLRSGAL AAKISGAGMG GSLLALTEGG EEEVVGSALR
EGGKKAWILV PDEGARIDRV DG