Gene Athe_0552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0552 
Symbol 
ID7408678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp621716 
End bp622666 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content42% 
IMG OID643714935 
Productglucokinase, ROK family 
Protein accessionYP_002572451 
Protein GI222528569 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATTACA TAGGAATTGA CTTGGGAGGA ACAAACATTG CAGCAGGAAT TGTGGATGAA 
GAAGGGAAGA TTATAAAAAA AGGTTCTGTG CCAACAGGAG CGCACAGACA TTATACAGAG
ATTATGAAGG ATATGGCAGA GCTTTCTTTA AATCTTGTAA AGGAATGTGG ACTTACACTT
GATGATATTC ATTCTGTTGG AATTGGAAGT CCAGGTGCAC CTGACAATGA AAAGGGCATG
ATTCTTTACA GCAACAATAT TGCGTTTTTG AATGTCCCTA TGAGAGAAGA GATACAAAAA
TACATTCCAA AGCCTGTTAA CATAGAAAAC GATGCAAACT GTGCAGCATA TGGCGAGTAT
ATAGCAGGCG GTGCAAAAGG CACAAGGATT TCGGTAACAA TAACTCTTGG CACGGGAATT
GGTGGTGGAA TTATAATTGA TGGTAAGATT TACACAGGTT CGCACCATGC AGGTGCTGAG
CTTGGGCATA TGGTGATTTG TGTTGACGGT GAGCAGTGCA CATGTGGCCG AAAAGGGTGC
TGGGAAGCGT ATGCGTCTGC AACAGCTCTT ATTCGTATGA CAAGAGAGGC TGCAGCAAGA
AATATCAATG GTACTATCAT GAAACTTGTA AATGGCGATA TTTCAAAGAT TGATGCAAAA
ACAGCTTTTG ACGCAAAGCG AATGGGAGAC AGCACTGGTA CAGCAATTGT TGACAGGTAT
GTAAAATACC TTGCTGAAGG CCTTGCAAAC ATCTGCAATA TATTTGAACC CGAAGTTATA
TGTATTGGCG GTGGAGTTAG CAAAGAAGGA GAGTATCTTT TAGAGCCTGT GAGAAGGCTT
GTATATGAAA AATTCTACTG CAAACAGGTT CCAATGCCCA AAATCATTCC TGCCGTTTTG
GGCAATGATG CTGGTATAAT TGGGGCTGCG CTTTTAGCAA AGCAGCTGTG A
 
Protein sequence
MYYIGIDLGG TNIAAGIVDE EGKIIKKGSV PTGAHRHYTE IMKDMAELSL NLVKECGLTL 
DDIHSVGIGS PGAPDNEKGM ILYSNNIAFL NVPMREEIQK YIPKPVNIEN DANCAAYGEY
IAGGAKGTRI SVTITLGTGI GGGIIIDGKI YTGSHHAGAE LGHMVICVDG EQCTCGRKGC
WEAYASATAL IRMTREAAAR NINGTIMKLV NGDISKIDAK TAFDAKRMGD STGTAIVDRY
VKYLAEGLAN ICNIFEPEVI CIGGGVSKEG EYLLEPVRRL VYEKFYCKQV PMPKIIPAVL
GNDAGIIGAA LLAKQL