Gene Teth514_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_0049 
Symbol 
ID5877701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp53601 
End bp54548 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content39% 
IMG OID641540396 
ProductROK family glucokinase 
Protein accessionYP_001661708 
Protein GI167038723 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00642511 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAGAA TCCTATGCGG TGTTGACTTA GGAGGGACAA AGATAAGCAC AGGACTCGTG 
GATGAGAAGG GCAATATCAT AAGAAGCACA AAAATACCTA CTATGGCTGA AAAAGGGCCA
GAAGAAGTGA TTAAGCGAAT AGAACAAAGT GTTTATGACG TTTTAAAAGA AGCAGGATTA
AAATTATCTG ATTTAAAAGG AGTTGGGATT GGATCTCCAG GGCCCCTTGA TGCCAAAAGG
GGGGTAGTTA TAAGCCCACC TAATCTTCCG GGTTGGGACA ATGTCCCTAT TGTGGATATT
TTATCTCATA AGCTGGGAGT TAAAGTAAAA TTAGAGAATG ATGCAAATGC TGCAGCTATT
GGAGAACATT TATTTGGCGC AGGAAAAGGT ATAGACAATT TTGTTTATAT AACTGTAAGT
ACTGGGATTG GCGGTGGCGT GATAATAGAA GGGAAACTTT ATAGCGGAGA AAATTCTAAT
GCAGCTGAGA TTGGGCACCA TACTATAAAT TTTAATGGAC CTCGGTGTAA TTGCGGAAAT
TATGGATGTT TTGAGGCTTT TGCTTCTGGT ACAGCTATTG CGAGATTTGC TCAAGAGGGA
ATTCAAAATG GGAAAGATAC AATGATAAGG GATTTAGCTA AGGATGGAGT AGTAAAATCA
GAACATGTAT TTGAAGCGGC AAAATTGGGA GATGAGTTTG CGAAAGAATT GGTTGACAAT
GAAGCTTTTT ATCTTGGAGT GGGAATTTCA AATATCATGG CTTTTTATAA TCCAAAAAAA
ATTGCCATAG GTGGTGGCGT ATCCACTCAA TGGGATATGC TTTATGATAA AATGATGGAA
ACAATAAAGA AAAAAGCATT GAAGCCTAAT GCCGAGGTTT GTGAGGTAGT AAGAGCTGAA
TTAGGAGAAA ACGTGGGAGT TTTAGGAGCG GCAGCATTAC TCTTATAA
 
Protein sequence
MGRILCGVDL GGTKISTGLV DEKGNIIRST KIPTMAEKGP EEVIKRIEQS VYDVLKEAGL 
KLSDLKGVGI GSPGPLDAKR GVVISPPNLP GWDNVPIVDI LSHKLGVKVK LENDANAAAI
GEHLFGAGKG IDNFVYITVS TGIGGGVIIE GKLYSGENSN AAEIGHHTIN FNGPRCNCGN
YGCFEAFASG TAIARFAQEG IQNGKDTMIR DLAKDGVVKS EHVFEAAKLG DEFAKELVDN
EAFYLGVGIS NIMAFYNPKK IAIGGGVSTQ WDMLYDKMME TIKKKALKPN AEVCEVVRAE
LGENVGVLGA AALLL