Gene Tpen_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1647 
Symbol 
ID4601241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1594696 
End bp1596270 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content65% 
IMG OID639774420 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_921045 
Protein GI119720550 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0299074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGTAG CCAGGGTAAG CGAGATAAAG CTTCTCGACA GGGAGGCCGC GGAGAAGTAC 
GGCGTAAAGG AGGAGATCCT GATGGAGAAT GCCGGCGCCA GCGTAGCCAG GCTTGCCGTG
TCGCTCATAG GGCTCCCCAT GAGCGCGGCA GTTGTCTGCG GGCCGGGGAA CAACGGGGGA
GACGGGCTCG TAGCGGCTAG GCACCTTTCA AGCATGGGCG CGGACGTCAA GGTCTTCCTG
GTGGCGGCCC CGGACAAGCT GGCAGGCATA GTCAAGGAGA ACTACGAGCG CGTAGTCAAG
GCCGGGATAG CCGTGGAAGT AGTGGATGAG GAGAGGGCGG AGGGGCTCTC CGAGGAGCTC
TCCCTCTTCG ACGTCGTCGT AGACGCGCTG TTCGGGACGG GCCTCTCCAG GCCCCTGGAG
GGTGTCTACA GGAAGGTTGT AGAGGCGATA AACGGTAGCG GTTCCCTAGT GATAAGCGTC
GACATCCCCT CGGGGGTCCA CGGGGACACG GGGCAGGTTC TAGGCGTAGC TGTGAGAGCT
GACTACACCG TGACGTTCGG GCTCCCGAAG CTCGGGAACC TCATGTACCC CGGCGCCGAG
CTAGGAGGCG AGCTGTACGT ACACCACATC TCCTACCCCA GGGCGTTGCT CGAGGACAGC
CGGTTGAAGG TGGAGACGAA CGACCCCGTA CCCCTGCCGC CGAGGAGGCC GGACACGCAC
AAGGGGGACT ACGGGAAGGC GCTGTTCGTC GCGGGCTCGC GCAGGTACAT GGGGGCACCC
CTGCTCTGCT CAAAGTCCTT CCTGAAGGCG GGCGGCGGGT ACTCTAGGCT CGCGACGATT
AAGTCCATCG TGCCGTTCCT AGGCGTGAGG GCCCCTGAGG TGGTCTACGA GGCGCTCGAG
GAGACAGCCT CGGGCACGGT CGCCTACGGC AACCTCGAGA GGATACTGGA GCTCTCGAAG
TCCTCGGACA TAGTGGCCGT GGGGCCGGGG CTCGGGCTCG AGGAGGAGAC GCTGAGGCTT
GTCTGCGACC TAGCTAGGAG CGTCGAGAAG CCGCTCATAG TGGACGGCGA CGGCCTCACC
GCGGTTGCCC GATGCGGCGA GTACATCTCG GAGAGAAGGG CTCCCACAGT GCTCACCCCG
CACGCCGGCG AGATGTCGAG GCTGACGGGG AAGAGCGTGG AGGAGGTAAG GGCGTCCCGG
GTCGACGCGG CGCTGGAGCT CGCCGGGAAG CTTAAAGCCT ACGTGGTGCT CAAAGGGGCT
CACACGGTCA TAGCAACCCC GGATGGGAGG GCGTACATCA ACCTGTCGGG CAACCCCGGC
ATGGCGACCG CGGGCTCCGG GGACGTGCTG GTAGGGGCCA TAGCGGCGCT CTACGGGCTC
GGCCTGGGCT TCGAGGAGGC TGTGAGGATG GGCGTCTTCG TGCACGGGCT CGCAGGGGAC
ATAGCGGCGG AGGAGCGCGG GCAGGACGGC TTAACTTCAG TGACTCTGAT GAACTACCTC
CCCAAGGCTC TGAGGGCGCT GAGGGAAGAC TTCGAGAGCG TGCTCGAAAG GTATACCATT
AAGGTCCTGC CGTAG
 
Protein sequence
MKVARVSEIK LLDREAAEKY GVKEEILMEN AGASVARLAV SLIGLPMSAA VVCGPGNNGG 
DGLVAARHLS SMGADVKVFL VAAPDKLAGI VKENYERVVK AGIAVEVVDE ERAEGLSEEL
SLFDVVVDAL FGTGLSRPLE GVYRKVVEAI NGSGSLVISV DIPSGVHGDT GQVLGVAVRA
DYTVTFGLPK LGNLMYPGAE LGGELYVHHI SYPRALLEDS RLKVETNDPV PLPPRRPDTH
KGDYGKALFV AGSRRYMGAP LLCSKSFLKA GGGYSRLATI KSIVPFLGVR APEVVYEALE
ETASGTVAYG NLERILELSK SSDIVAVGPG LGLEEETLRL VCDLARSVEK PLIVDGDGLT
AVARCGEYIS ERRAPTVLTP HAGEMSRLTG KSVEEVRASR VDAALELAGK LKAYVVLKGA
HTVIATPDGR AYINLSGNPG MATAGSGDVL VGAIAALYGL GLGFEEAVRM GVFVHGLAGD
IAAEERGQDG LTSVTLMNYL PKALRALRED FESVLERYTI KVLP