Gene Athe_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0939 
Symbol 
ID7407840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1040504 
End bp1041673 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content39% 
IMG OID643715308 
Productgalactokinase 
Protein accessionYP_002572817 
Protein GI222528935 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTG AAGAACTTTT GAAGATGTTC AAAGATGTAT ATGGCGAAAA TAAAGAGCCT 
ATCAGGTGCT TTTTCTCAAG TGGGAGAGTA AACCTAATCG GTGAGCATAC AGACTACAAC
GGTGGGTATG TTTTCCCTGC GGCACTCAAT GTTGGTACCA CCGTTCTTGC TCGAAAAAGA
AACGACAAAA AAATTTGCTT GTATGCAACA GACCTAAAAG AGCTTGTTGA AGCAGACATC
GATAAGATTG ATGAGTATAA AAATATCAGA TGGGGAAATT ACCAGCTTGG TGTTGTAAAA
GAGCTAAAAG AGGCAGGATA TGAAGTTGGA GGTCTTGACA TGCTCTTTCA CGACACTGTA
CCGCACGGAG CAGGGCTTTC ATCATCTGCT GCAATTGAGT GTGCAACTGG AATTGCAGTT
TACTCCCTAT TCAATAACAA GCCAATTGAC AGGCTCAAGC TTAGCTTCAT ATGCCAAAGA
GCTGAGAACA GGTTTGTTGG GGTAAACTGT GGTATTATGG ATCAGTTTGC TTCAAGTCTT
GGGAAAAAAG ACCATGCCAT TTTTCTCAAC ACACGCACTA TGGAGTACAG GTATGTACCT
TTAAAACTTG GTGATTACAA GATTGTCATT AGCAACACTA ACAAGAAAAG AAGTCTTGCA
GACTCAAAAT ACAATGAAAG AAGGTCTCAG TGCGAAAAAG GGCTTGAACT TTTGAAAAAG
GAACTTAACA TTTCGTGCCT TGGCGAGCTT GACGTTGAGA CGTTTGAAAA GTACAAAGAT
TTAATTGATG ACGAAATAAT CCTAAAAAGA GTGAGACATG TTGTGTACGA AGACGATAGA
GTTTTAAAAT CCATTGAAGT TTTGCAAAAA GGTAACCTTG AAGCTTTTGG CAAGCTTATG
ATACAATCTC ATATATCGCT AAGAGACGAC TATGAGGTAA CAGGACTTGA GCTTGATACA
CTTTTTGAAG AAGCTCTGAA AATAGAGGGT GTAATTGGTA CACGAATGAC TGGTGCTGGT
TTTGGTGGTT GCACAGTTTC AATTGTTCAT AAGGATGCAA TAGAAGAGTT TATAAGAAAA
GTAGGGGAAA GCTATTATGC AAAGACAGGT CTTAAAGCAG ATTTTTACAC ATTTGAGATT
GACGATGGTG CAAGGGAGAT AGAAATTTAA
 
Protein sequence
MKIEELLKMF KDVYGENKEP IRCFFSSGRV NLIGEHTDYN GGYVFPAALN VGTTVLARKR 
NDKKICLYAT DLKELVEADI DKIDEYKNIR WGNYQLGVVK ELKEAGYEVG GLDMLFHDTV
PHGAGLSSSA AIECATGIAV YSLFNNKPID RLKLSFICQR AENRFVGVNC GIMDQFASSL
GKKDHAIFLN TRTMEYRYVP LKLGDYKIVI SNTNKKRSLA DSKYNERRSQ CEKGLELLKK
ELNISCLGEL DVETFEKYKD LIDDEIILKR VRHVVYEDDR VLKSIEVLQK GNLEAFGKLM
IQSHISLRDD YEVTGLELDT LFEEALKIEG VIGTRMTGAG FGGCTVSIVH KDAIEEFIRK
VGESYYAKTG LKADFYTFEI DDGAREIEI