Gene Teth514_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1061 
Symbol 
ID5876908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1098694 
End bp1099632 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content39% 
IMG OID641541416 
ProductROK family glucokinase 
Protein accessionYP_001662696 
Protein GI167039711 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.204957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTG GTGTTGACCT TGGAGGCACT AATATTGCTG TTGGATTGGT GGATGAAGAG 
GGACGTATAG TAGCTACAGG TAGCAGGCCG ACAAAACCGG AAAGGGGTTA TGAAGCAGTT
GCCAAAGACA TAGCGGATAT AGCTATAGAA CTTATTAAGA GGAGTAATCT AAGTGGAAGT
GATATTAAGT CAATGGGAAT AGGAGTCCCG GGTGTAGCAG ATAGCGAAAG AGGCATAGTA
ATACGAGCGG TGAATCTCTT TTGGACAAAA GTTCCTCTTG CAAAAGAAAT AAGAAAATAC
ATTGATTTGC CAATATATAT GGACAATGAT GCTAATGTAG CAGCTTTGGC AGAAGCAGCA
TATGGGGCTG GCAAAGGTTC TAAGTCCTCA GTGACAATTA CCTTAGGAAC AGGAGTAGGT
TCTGGTTTTA TCCTTGATGG TAAAATATAC AATGGTGCAC ATCATTTTGC ACCTGAGCTT
GGACACATAG TGATAGGAGA CAATGGTATA AGATGTAACT GTGGTAAAAT TGGATGTTTA
GAGACATATG CTTCAGCAAC TGCTTTAATA AGAGAAGGGA AAAAGGCAGT AGAGAAGAAT
CCCAATTCTC TTATTTTAAA ATTTGCAAAT GGAGATATAA ACAGCATAAC GGCTAAAAAC
GTAATTGATG CTGCGAAGCA GTATGATGAA GACGCCATGA GGATTTTTAA TGACTATGTC
AAATACTTAG CTATTGGGAT TGTAAATGTG ATAAACATGT TTGACCCTGA AGTGATAATA
TTGGGTGGCG GAGTTGCAAA TGCAGGGGAT TTTCTCATAA AACCTCTTAA AAAAGAAGTG
GCAGAGAATA TTTTATTCAA AGACTTACCG TATGCTGACA TAAGAAAAGC AGAGCTTGGA
AATGATGCAG GGATCATCGG TGCTGCCATA TTAAGTTAA
 
Protein sequence
MRIGVDLGGT NIAVGLVDEE GRIVATGSRP TKPERGYEAV AKDIADIAIE LIKRSNLSGS 
DIKSMGIGVP GVADSERGIV IRAVNLFWTK VPLAKEIRKY IDLPIYMDND ANVAALAEAA
YGAGKGSKSS VTITLGTGVG SGFILDGKIY NGAHHFAPEL GHIVIGDNGI RCNCGKIGCL
ETYASATALI REGKKAVEKN PNSLILKFAN GDINSITAKN VIDAAKQYDE DAMRIFNDYV
KYLAIGIVNV INMFDPEVII LGGGVANAGD FLIKPLKKEV AENILFKDLP YADIRKAELG
NDAGIIGAAI LS