Gene Mlg_0568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0568 
Symbol 
ID4270898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp615878 
End bp617356 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content75% 
IMG OID638125310 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_741412 
Protein GI114319729 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0000000379756 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATACAC TACCGGAATA CCTCTACACG CCGGCCCAGG TCCAGGAGCT GGACCGCCGG 
GCCATTCAGG ATCACGGCCT ACCCGGGTTG TCGCTGATGG AGCGCGCCGG CCGCCGGGGC
TGGGAAGTGC TGCTCAAGCA CTGGCCCCAC GTGCGCCGCC TGCGGGTGCT CTGCGGCGGG
GGCAATAACG GTGGCGACGG CTACGTGGTG GCGCGGCTGG CCCGGCGCGC CGGCCTCGGT
GTGCGCCTGC AGGCCCTGTC CGACCCCGAC CGGCTCAACG GCGATGCCGC TACCGTGGCG
CGCCGCTTCC AGGAGGAGGG GGGGCAGATC GAGTCCTGGG ATCCGACCGG CCTGGCGGAC
GAGGACGTGG TGGTGGACGC CCTGCTGGGC ACCGGCCTGG ACCGGCCGGT GGAGGGGCGC
TACCGGGAGG CGTTGCAGGC GCTGAAGGCG GCCGGCGTGC CGGTGCTGGC CATCGACGTG
CCCTCGGGGC TCAACGCCGG CACCGGTGCC GTGATGGGGG AGGCGGTGGA GGCGCACTGC
ACGGTGACCT TCATCGGTCT CAAGCCGGGG CTGCTCACCG GCGCCGGCCC GCAGTGTGCC
GGCACCCTCT ACTTCGACGA CCTGGGTGTG CCGCCGGAGA TCTACCAGGA TATGGCCCCG
GTGGCGGGTC TCTGCCGCGA TGAGCTGTTG CGCCGGTGCC TCGGCCCCCG CCCGGCCCAT
GCCCACAAGG GCCAGTTCGG CCACGCCTTG GTGATCGGCG GTGATCTGGG CATGGGTGGT
GCGGCGCGGA TGGCCGGCGA GGCGGCGGGC CGCACCGGGG CGGGGCTGGT CAGCGTGGCC
ACCCGCCCGG CGCACGTCGC CGCCCTGCTG GCCGCGCGCC CGGAGCTGAT GGTCCACGGC
CTGGACAGCG CCGAGGGCCT GGCGCCGCTG CTGGAGAAGG CCACTGCCTG GGCCCTGGGG
CCGGGGCTGG GGCAGGGGCC GTGGGGGCGC GCGCTCTGGG AGGCGGCGCT GCGGACTGAG
CACCCCTGCG TGCTCGATGC CGATGCCCTC AACCTGCTGG CCGCCGACCC GCGTCCCTGC
CCCAACGCCC TGCTCACCCC CCACCCGGGC GAGGCGGCCC GGCTGCTGGG TGTGACCCCT
GCCGAGGTGC AGGCGGATCG GCTGGCCGCG GCCGACGCGC TGGTGGAACG CTACCGCGGG
GCAGTGGTGC TCAAGGGCGC CGGCAGTGTG ATCGCCGCCC CGGGGGCCCT GCCGCGCTTG
GTCACCGCCG GCAATCCGGG GATGGCCAGC GGCGGCATGG GCGATGTCCT CACCGGGGTG
GTGCTCGGGT TGCTGGCACA AGGCCTGTCC GCCGTGGAGG CGGCCGAATT GGGGGCGCTG
GTGCATGCCC GCGCCGCTGA CCGGGCCGCC CGGGCCGGGG AGCGCGGGCT GCTGGCGGGC
GATGTGCTGA TGGCCCTGCG TGCCGAGGTC AACCCGTGA
 
Protein sequence
MNTLPEYLYT PAQVQELDRR AIQDHGLPGL SLMERAGRRG WEVLLKHWPH VRRLRVLCGG 
GNNGGDGYVV ARLARRAGLG VRLQALSDPD RLNGDAATVA RRFQEEGGQI ESWDPTGLAD
EDVVVDALLG TGLDRPVEGR YREALQALKA AGVPVLAIDV PSGLNAGTGA VMGEAVEAHC
TVTFIGLKPG LLTGAGPQCA GTLYFDDLGV PPEIYQDMAP VAGLCRDELL RRCLGPRPAH
AHKGQFGHAL VIGGDLGMGG AARMAGEAAG RTGAGLVSVA TRPAHVAALL AARPELMVHG
LDSAEGLAPL LEKATAWALG PGLGQGPWGR ALWEAALRTE HPCVLDADAL NLLAADPRPC
PNALLTPHPG EAARLLGVTP AEVQADRLAA ADALVERYRG AVVLKGAGSV IAAPGALPRL
VTAGNPGMAS GGMGDVLTGV VLGLLAQGLS AVEAAELGAL VHARAADRAA RAGERGLLAG
DVLMALRAEV NP