Gene Tpen_0901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0901 
Symbol 
ID4602224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp849510 
End bp850469 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content61% 
IMG OID639773680 
ProductROK family protein 
Protein accessionYP_920305 
Protein GI119719810 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGTACG CCGTAGCAGT GGATATAGGC GCTACGCAGA CCCGCGTAGC CCTCGGGAAC 
GACGAGGGGG AGATTCTAGA GCTCCACGTC TTCAAGACGT CTTCCTTCCC CGGGCCCGAC
GAGTACCTGC GCCACATAGC CGGCTTGGCT TTAAGCCTAG AGAAAAAGCA CGGCGTGGAA
GTAGAGGGTA TAGGGGTGGG ATCTCCGGGC CCCCTCGACA TGAAGAAGGG GGAGGTACTC
AAATCCGTGA ACATGCCTTT CGATAGGCTA CCCGTAGTCT CCGCCCTGAA GTCCCTGACG
GGGAAGAAAG TGGCTTTTGC GAACGACGCG GTGACGGCGG CGGTTGGGGA GAAGTACTGG
GGCGCGGGGA GGGGGTTGGA GAACCTGGTC TACGTCACTA TAAGCACGGG GATAGGCGCT
GGGATCTACG TCGACGGGGA GCTCCTCCTC GGCAAGCACG GGAACGCCCA CGAAGTAGGC
CACGTCGTCG TCGACTCGGG GGAAGAGATG ACGTGTGGTT GCGGCAAGAA GGGACACTGG
GAGGCCTACT GCTCCGGGTC CGGCATACCC AGGTACGCGA AGTTCCTGGC GGCCAGAAAC
CCCGAGCTCT GGGAGAAGAG CCCCCTCAAG TCCAGGGAGC CCCTCACGGC TAAAGACGTA
TTCGACGCGT TCCGGGAGGG CGACGCGCTG GCAAGGCTTG TTATGGAGAG AGTTAGGAAG
TTCAACGCGT ACGGGTTCGC GGTGCTCGTC AACGTGTACG ACCCCGAGAT AATCACAGTG
GGTGGATCGG TCGCCCTGAA CAACCCGGAC GTCCTGCTCG CCGGGCTGAA GGAGGAGGTC
GAGAAGTACG CTTTAAACGT GGTTCCTGAG ATCAGGCTGA CCCCGCTCGG AGACAAGATA
GGGGTTCTGG GAGCGCTGGC GCTGGGCCTC GGCCTCGAAA AGAAGGTCCC CCTCGTTTAA
 
Protein sequence
MKYAVAVDIG ATQTRVALGN DEGEILELHV FKTSSFPGPD EYLRHIAGLA LSLEKKHGVE 
VEGIGVGSPG PLDMKKGEVL KSVNMPFDRL PVVSALKSLT GKKVAFANDA VTAAVGEKYW
GAGRGLENLV YVTISTGIGA GIYVDGELLL GKHGNAHEVG HVVVDSGEEM TCGCGKKGHW
EAYCSGSGIP RYAKFLAARN PELWEKSPLK SREPLTAKDV FDAFREGDAL ARLVMERVRK
FNAYGFAVLV NVYDPEIITV GGSVALNNPD VLLAGLKEEV EKYALNVVPE IRLTPLGDKI
GVLGALALGL GLEKKVPLV