Gene Anae109_2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2331 
Symbol 
ID5374037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2695686 
End bp2697221 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content79% 
IMG OID640843850 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001379517 
Protein GI153005192 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTCG TCGGCTCGGC TGAGATGCGC GCCATCGATC GCGCCGCCAT CGACGCCTTC 
GGCGTGCCCT CGCTCGCCCT CATGGATCGG GCGGGGCGCG CCGTCGCGGA GGCCGTCCGA
TCGCTCTGCG CGCCTGGCGG GCGGATCGTC GTCGTCTGCG GCGGCGGTAA CAACGGCGGC
GACGGCTACG TGGCGGCGCG GGTCCTGCGG GCGGAGGGCT GGGACGCGCG CGTCGTCTCG
ATCGTGCCGG CGGCGCGGCT TTCCGGCGAT GCGCGCGTGA CGCGCGAGGA GGCGGAGCGC
GCGGGCGTGC CGATCGACGA GGCGGGCGAG CTGCTCACCG TGGACGCCGG GCCCGGGGAC
GTGGTGGTGG ACGGCGTGTT CGGCACCGGC CTCACCCGCG CGCCGGAGGG CGCGTTCGCG
CGAGCCATCG AGCGGATCGA CGCCGCCCGC GCGGCGGGGG CGCGCGTCGT GGCGGTGGAC
GTCCCGTCCG GCCTGTCGGC GGACACCGGC CGCCCGCTCG GCGCGGCCTG CGTCCGGGCG
GACCGGACCG TCACCTTCGC CTTCCAGAAG CGCGGGCTCG TCCTCCACCC GGGGCCGTCC
GTCGCCGGCG AGGTGATCGT CGCCGACATC GGCATCCCGC TCGAGGCCGC CGCGCGGGTG
CCGCTCACCT GCGAGCTGCT CGAGGCGGCT CAGGCGCAGG CGCTCCTCCC GGCGCGCTCG
CCCGACGCCC ACAAGGGCGA CGCCGGCCGG CTGCTCGTGG TGGCCGGGTC GCCCGGCAAG
ACGGGCGCGG CCCACCTCGC GCTCACCGGC GCGCTGCGCG GCGGCGCCGG CCTCGTCACG
CTCGCCGCCC GCGCCGAGGC GCTGCCGCTC GCGCTGTTCG GCCGCCCGGA GGCGATGAGC
GTGGCGCTTC CCGGCGCGGG TCCGCTCGGC CGGGCAGATC TCCAGGCGCT CCTCGCGGCG
GCGAAGGGCG TGGACGCGCT GGCCATCGGC CCGGGCATCC CGCGCGGCGA GGAGACGGGT
GAGCTGCTGC GGGCGCTGCT CGAGCGGGCG CGGCTGCCGG CGGTGCTCGA CGCGGACGCG
CTGAACGCGC TCGCCGACGA GCCCGGCCGG CTCGCCGCGC TGGGCGAACC GCTCGTGCTC
ACGCCGCACC CCGGCGAGAT GGCGCGCCTG TGCGGGACCC CGATCGACGA GGTGCAGGCG
GACCGCATCG AGGTCGCGCG CGCGAAGGCG CGAGAGTGGG GCGTGACGGT GGTGCTGAAG
GGGGCGCGCA CGGTGGTCGC CGATCCGCAC GGGCCCGCGG CGGTGATCCC GACCGGCAAC
GCCGGGATGG CGACGGGCGG CACCGGCGAC GTGCTCGCGG GGCTCATCGG CGCGCTCCTC
GCCGGCGGCC TTCCGCCCGG GGCGGCGGCG CGCGTCGGCG CGTGGGTTCA CGGCCGGGCA
GGGGACCGCG TCGCGGCCCG GCTCGGCGAG CGTGGGCTCC TCGCCGGTGA TCTGGGCGAG
GCCATCGGCG AGGTGTGGGC GGAGTGGCGG CGATGA
 
Protein sequence
MRLVGSAEMR AIDRAAIDAF GVPSLALMDR AGRAVAEAVR SLCAPGGRIV VVCGGGNNGG 
DGYVAARVLR AEGWDARVVS IVPAARLSGD ARVTREEAER AGVPIDEAGE LLTVDAGPGD
VVVDGVFGTG LTRAPEGAFA RAIERIDAAR AAGARVVAVD VPSGLSADTG RPLGAACVRA
DRTVTFAFQK RGLVLHPGPS VAGEVIVADI GIPLEAAARV PLTCELLEAA QAQALLPARS
PDAHKGDAGR LLVVAGSPGK TGAAHLALTG ALRGGAGLVT LAARAEALPL ALFGRPEAMS
VALPGAGPLG RADLQALLAA AKGVDALAIG PGIPRGEETG ELLRALLERA RLPAVLDADA
LNALADEPGR LAALGEPLVL TPHPGEMARL CGTPIDEVQA DRIEVARAKA REWGVTVVLK
GARTVVADPH GPAAVIPTGN AGMATGGTGD VLAGLIGALL AGGLPPGAAA RVGAWVHGRA
GDRVAARLGE RGLLAGDLGE AIGEVWAEWR R