Gene Mmar10_1332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1332 
Symbol 
ID4286901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1457273 
End bp1458703 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content66% 
IMG OID638140811 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_756562 
Protein GI114569882 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.977936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.520853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGAG CGGGGCGCGC GGTTGCGAAT GCGATTCGGG CCCGCTGGGC GCCTGTCCCT 
GTCTTGGTGC TGGCCGGGCC GGGCAATAAT GGTGGCGATG GGCTGGTCGC CGCGCGGGTC
CTGCGCGAAG CGGGCTGGCC TGTACGCGTG ATGTTGCTGG CAGCGCGGGA AGGTCTTTCG
ACAGACGCAG CCAGAGCGTT GGAGGGATGG GGCGGCCTGG TTGAGTCCCC AGCCTCGGGA
TTTTTGGGGG ATGCAGCGCT CGTGATCGAC GCCCTGTTCG GCGCAGGTTT GTCGAGACCT
GTCGAGGGAA GCGCGGCAGA TTGGCTGGTC CGAATTCGTG ATAGTGGCCT GCCGGTGGTC
TCAGTCGACT TGCCCAGCGG GATAAACGGG GATGCGCGCC CCTTGGCAGC GCCCCACCTG
ACGGCCGATC TGACAATCAC CTTTCATTGC AGAAAGCTGG CGCACCTGGT TGAGCCTTTT
GCGGAAATGT GTGGTGAAAC TGTCTGTGTC GATATTGGTA TCCCGGACGG TTGGGAAGCC
CGGGTAACGC CAGTGGCGCG AGAGGTCGAC AATCCCGACT GGAAACCGGT CGAGCCCTCC
GGATATCTCA GCTTGCACAA ACATGCTCGC GGCCGCGTGG CGGTGTTCAG TGGGGGCAGC
CAGTCCTCAG GCGCCGGGCG TCTGGCAGCG GCCGGCGCGC TACGCGCCGG CGCCGGCGTG
GTGACACTTT GTACGCCGCC CTCGGCCAGC CTGGTCAATG CGGCGCATCT GACCGCCATC
ATGCTGGCCA GGTGGGGTAG TGACGATCAG ATTGGTGAAG TACTGACCGG CTTGCGTGCC
GAGGCTGCCG TCCTGGGGCC GGCCCTGGGT GTTGGTCCGG CCACGCGCAA AGCCGTGCTG
GGTGCCCTGG AGGCCAATGT CCCTCTGGTC CTCGATGCGG ATGCTTTGAC CAGTTTTGCC
GACGATGCGG ATAGTCTTGT CGACCAATTG CATGATCGGT GCGTCCTGAC ACCGCATCAT
GGCGAGTTTT CCAGGTTGTT CGGGGAGGGC AGACCGAACT GGAACAAGCT CGAGCAAGCC
CAGTCTGCGG CGGACCGGTG TGGCTGCACG GTGTTGCTCA AAGGTCCGGC GACCATCATC
GCAACGCCGG GGCTGACACC ATGGATAAAC CGACATGCCT GCCGCTGGCT GGCCACGGCG
GGGAGCGGGG ATGTCCTGGC CGGCATCATT GCGGCAGGAC TGGCGGCGGG CATGTCGCCG
CATGACGCGG CGGCCGGCGC GACCTGGCTG CACGGAGATG CCGCCTTGCG CCAGGGGGCA
GGAATGACCG CCGAGGACCT TCCCGACGCC TTGCCAGGCG CGCTGCAAGC CCTGCAACGA
CGAAGGCGCC AAACGGCAGC GCTGAGCCAT CTCCTGTCGC ATGGAAGCTA G
 
Protein sequence
MDRAGRAVAN AIRARWAPVP VLVLAGPGNN GGDGLVAARV LREAGWPVRV MLLAAREGLS 
TDAARALEGW GGLVESPASG FLGDAALVID ALFGAGLSRP VEGSAADWLV RIRDSGLPVV
SVDLPSGING DARPLAAPHL TADLTITFHC RKLAHLVEPF AEMCGETVCV DIGIPDGWEA
RVTPVAREVD NPDWKPVEPS GYLSLHKHAR GRVAVFSGGS QSSGAGRLAA AGALRAGAGV
VTLCTPPSAS LVNAAHLTAI MLARWGSDDQ IGEVLTGLRA EAAVLGPALG VGPATRKAVL
GALEANVPLV LDADALTSFA DDADSLVDQL HDRCVLTPHH GEFSRLFGEG RPNWNKLEQA
QSAADRCGCT VLLKGPATII ATPGLTPWIN RHACRWLATA GSGDVLAGII AAGLAAGMSP
HDAAAGATWL HGDAALRQGA GMTAEDLPDA LPGALQALQR RRRQTAALSH LLSHGS