Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0939 |
Symbol | |
ID | 7407840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1040504 |
End bp | 1041673 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643715308 |
Product | galactokinase |
Protein accession | YP_002572817 |
Protein GI | 222528935 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | [TIGR00131] galactokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATTG AAGAACTTTT GAAGATGTTC AAAGATGTAT ATGGCGAAAA TAAAGAGCCT ATCAGGTGCT TTTTCTCAAG TGGGAGAGTA AACCTAATCG GTGAGCATAC AGACTACAAC GGTGGGTATG TTTTCCCTGC GGCACTCAAT GTTGGTACCA CCGTTCTTGC TCGAAAAAGA AACGACAAAA AAATTTGCTT GTATGCAACA GACCTAAAAG AGCTTGTTGA AGCAGACATC GATAAGATTG ATGAGTATAA AAATATCAGA TGGGGAAATT ACCAGCTTGG TGTTGTAAAA GAGCTAAAAG AGGCAGGATA TGAAGTTGGA GGTCTTGACA TGCTCTTTCA CGACACTGTA CCGCACGGAG CAGGGCTTTC ATCATCTGCT GCAATTGAGT GTGCAACTGG AATTGCAGTT TACTCCCTAT TCAATAACAA GCCAATTGAC AGGCTCAAGC TTAGCTTCAT ATGCCAAAGA GCTGAGAACA GGTTTGTTGG GGTAAACTGT GGTATTATGG ATCAGTTTGC TTCAAGTCTT GGGAAAAAAG ACCATGCCAT TTTTCTCAAC ACACGCACTA TGGAGTACAG GTATGTACCT TTAAAACTTG GTGATTACAA GATTGTCATT AGCAACACTA ACAAGAAAAG AAGTCTTGCA GACTCAAAAT ACAATGAAAG AAGGTCTCAG TGCGAAAAAG GGCTTGAACT TTTGAAAAAG GAACTTAACA TTTCGTGCCT TGGCGAGCTT GACGTTGAGA CGTTTGAAAA GTACAAAGAT TTAATTGATG ACGAAATAAT CCTAAAAAGA GTGAGACATG TTGTGTACGA AGACGATAGA GTTTTAAAAT CCATTGAAGT TTTGCAAAAA GGTAACCTTG AAGCTTTTGG CAAGCTTATG ATACAATCTC ATATATCGCT AAGAGACGAC TATGAGGTAA CAGGACTTGA GCTTGATACA CTTTTTGAAG AAGCTCTGAA AATAGAGGGT GTAATTGGTA CACGAATGAC TGGTGCTGGT TTTGGTGGTT GCACAGTTTC AATTGTTCAT AAGGATGCAA TAGAAGAGTT TATAAGAAAA GTAGGGGAAA GCTATTATGC AAAGACAGGT CTTAAAGCAG ATTTTTACAC ATTTGAGATT GACGATGGTG CAAGGGAGAT AGAAATTTAA
|
Protein sequence | MKIEELLKMF KDVYGENKEP IRCFFSSGRV NLIGEHTDYN GGYVFPAALN VGTTVLARKR NDKKICLYAT DLKELVEADI DKIDEYKNIR WGNYQLGVVK ELKEAGYEVG GLDMLFHDTV PHGAGLSSSA AIECATGIAV YSLFNNKPID RLKLSFICQR AENRFVGVNC GIMDQFASSL GKKDHAIFLN TRTMEYRYVP LKLGDYKIVI SNTNKKRSLA DSKYNERRSQ CEKGLELLKK ELNISCLGEL DVETFEKYKD LIDDEIILKR VRHVVYEDDR VLKSIEVLQK GNLEAFGKLM IQSHISLRDD YEVTGLELDT LFEEALKIEG VIGTRMTGAG FGGCTVSIVH KDAIEEFIRK VGESYYAKTG LKADFYTFEI DDGAREIEI
|
| |