Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1905 |
Symbol | |
ID | 7407318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2010712 |
End bp | 2012247 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716277 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_002573766 |
Protein GI | 222529884 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0197843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTATTT TGAACTCATC CCAGATGAAG GAAATTGACA GGAAAGCTTC GCAAGAAATA GGGATACCTG AAGTTGTGTT GATGGAGAAT GCTGGTTTTT GTGTTTTTGA AGAGATAAGA AAGGATTTTG AAACTCTTGA TGACAAAAAC ATTGCAGTTT TTTGCGGAAA AGGCAACAAT GGCGGCGATG GATTTGTTGT TGCAAGGTAT CTTGCCCAGG TTTGTCCAAA TGTAAAGGTA TTCTTATTCG ATGAGAATGT GACTTTGACA TCCAAAGTTT TCCTTGATAT ATTGAAAAGG CTGGAAGTCG ATGTTAGCAT TTTGTCAGAA GAACTATTAT TATCTCTCAA AACCCAGAGA TTTGACATAA TAGTTGACGC AATCTTTGGA ATAGGTCTTT CAAAGGAGAT TGATGGACTT TATAAAGAGG CAATTGAGTA TATAAATGGT AGCGGCGCTT ATGTATATTC GGTTGACATT CCAAGTGGAA TTTGCTCTGA TTCAGCTCAA GTTAAAGGCT GTGCTGTTAA GGCTAACAAG ACAGTAACGT TTGTTTACCC TAAGATAGGG AATATTTTGT ACCCAGGTAG CTATTATTGT GGAAAGGTAA TTGTTAAAGA TATAGGTATC CCTGAAAAGA TTATCAAAGA TATCAAGGTA AAAATTCTAA CAGCTGAGGA TTTAGATATA TCCAGATTTT ACAGGTTTCC CGACTCTCAC AAAGGCGATT ATGGCAAGGT GGGAATAGTT GCCGGGTCAA AGTATTATCC TGGTGCGGCG GTTCTGTGCA GCAATGCTGC AGTACAAAGC GGCTGTGGAC TTTGCTACTT AATAACACCC CAAGAAGCGC TTTATTTTCA GAATTTAAGA AAACCGGAGA TAATAGTGCT GCCTCTTGAG GGCAAAGAAG GTGTTATATC TTTTGATAGT TTTGTAAAAT TTAACGAATA CCTTGCCAAG CTTGATGTTT TGGGGTTTGG CTGTGGGCTT ACAAGGGATT TAGAAGTTGA AAAGATATTG ATTCATATTT TAGAAAATTT CCAAATACCT ATTGTAATAG ATGCAGATGG ACTAAATACT CTTTCATCAA GCCCAAAAGC AAGAGAACTT TTGGCAAGCT ATAAATCTCA AAAAGTTTTA ACTCCGCATT ATATGGAAGC TGCAAGAGTA CTTGATGTTG ATGTAAAAGA TGTTGCTAAA AGTCCCATTG ATGCTGCAAA GAAGATTGCA AGTGAATTTA GAGCTATATG CGTCCTAAAA GGTTCAAGGA CAATAATTAC AGATGGTGAT ATGGTTTTTA TAAACGTCCT TGGCAATCCT GGCATGGCAA AAGGCGGAAG CGGAGATGTT CTCACGGGTA TTATTTTGTC TATGATTGCT CAAGGATATT CTGCTTTTGA GGCGGCAAAA CTGTCGGTAT ATCTTCATTC TCTTTCGGCA GATATCTTGC TTGAAAAAAA GACAATGCAG ACAATTTTGC CCTCAGATAT TATAGAGGGG CTCAATAGTG CTATTAGGAG ACTAATTGAA GGTTAA
|
Protein sequence | MFILNSSQMK EIDRKASQEI GIPEVVLMEN AGFCVFEEIR KDFETLDDKN IAVFCGKGNN GGDGFVVARY LAQVCPNVKV FLFDENVTLT SKVFLDILKR LEVDVSILSE ELLLSLKTQR FDIIVDAIFG IGLSKEIDGL YKEAIEYING SGAYVYSVDI PSGICSDSAQ VKGCAVKANK TVTFVYPKIG NILYPGSYYC GKVIVKDIGI PEKIIKDIKV KILTAEDLDI SRFYRFPDSH KGDYGKVGIV AGSKYYPGAA VLCSNAAVQS GCGLCYLITP QEALYFQNLR KPEIIVLPLE GKEGVISFDS FVKFNEYLAK LDVLGFGCGL TRDLEVEKIL IHILENFQIP IVIDADGLNT LSSSPKAREL LASYKSQKVL TPHYMEAARV LDVDVKDVAK SPIDAAKKIA SEFRAICVLK GSRTIITDGD MVFINVLGNP GMAKGGSGDV LTGIILSMIA QGYSAFEAAK LSVYLHSLSA DILLEKKTMQ TILPSDIIEG LNSAIRRLIE G
|
| |