Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1332 |
Symbol | |
ID | 4286901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1457273 |
End bp | 1458703 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638140811 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_756562 |
Protein GI | 114569882 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.977936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.520853 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGAG CGGGGCGCGC GGTTGCGAAT GCGATTCGGG CCCGCTGGGC GCCTGTCCCT GTCTTGGTGC TGGCCGGGCC GGGCAATAAT GGTGGCGATG GGCTGGTCGC CGCGCGGGTC CTGCGCGAAG CGGGCTGGCC TGTACGCGTG ATGTTGCTGG CAGCGCGGGA AGGTCTTTCG ACAGACGCAG CCAGAGCGTT GGAGGGATGG GGCGGCCTGG TTGAGTCCCC AGCCTCGGGA TTTTTGGGGG ATGCAGCGCT CGTGATCGAC GCCCTGTTCG GCGCAGGTTT GTCGAGACCT GTCGAGGGAA GCGCGGCAGA TTGGCTGGTC CGAATTCGTG ATAGTGGCCT GCCGGTGGTC TCAGTCGACT TGCCCAGCGG GATAAACGGG GATGCGCGCC CCTTGGCAGC GCCCCACCTG ACGGCCGATC TGACAATCAC CTTTCATTGC AGAAAGCTGG CGCACCTGGT TGAGCCTTTT GCGGAAATGT GTGGTGAAAC TGTCTGTGTC GATATTGGTA TCCCGGACGG TTGGGAAGCC CGGGTAACGC CAGTGGCGCG AGAGGTCGAC AATCCCGACT GGAAACCGGT CGAGCCCTCC GGATATCTCA GCTTGCACAA ACATGCTCGC GGCCGCGTGG CGGTGTTCAG TGGGGGCAGC CAGTCCTCAG GCGCCGGGCG TCTGGCAGCG GCCGGCGCGC TACGCGCCGG CGCCGGCGTG GTGACACTTT GTACGCCGCC CTCGGCCAGC CTGGTCAATG CGGCGCATCT GACCGCCATC ATGCTGGCCA GGTGGGGTAG TGACGATCAG ATTGGTGAAG TACTGACCGG CTTGCGTGCC GAGGCTGCCG TCCTGGGGCC GGCCCTGGGT GTTGGTCCGG CCACGCGCAA AGCCGTGCTG GGTGCCCTGG AGGCCAATGT CCCTCTGGTC CTCGATGCGG ATGCTTTGAC CAGTTTTGCC GACGATGCGG ATAGTCTTGT CGACCAATTG CATGATCGGT GCGTCCTGAC ACCGCATCAT GGCGAGTTTT CCAGGTTGTT CGGGGAGGGC AGACCGAACT GGAACAAGCT CGAGCAAGCC CAGTCTGCGG CGGACCGGTG TGGCTGCACG GTGTTGCTCA AAGGTCCGGC GACCATCATC GCAACGCCGG GGCTGACACC ATGGATAAAC CGACATGCCT GCCGCTGGCT GGCCACGGCG GGGAGCGGGG ATGTCCTGGC CGGCATCATT GCGGCAGGAC TGGCGGCGGG CATGTCGCCG CATGACGCGG CGGCCGGCGC GACCTGGCTG CACGGAGATG CCGCCTTGCG CCAGGGGGCA GGAATGACCG CCGAGGACCT TCCCGACGCC TTGCCAGGCG CGCTGCAAGC CCTGCAACGA CGAAGGCGCC AAACGGCAGC GCTGAGCCAT CTCCTGTCGC ATGGAAGCTA G
|
Protein sequence | MDRAGRAVAN AIRARWAPVP VLVLAGPGNN GGDGLVAARV LREAGWPVRV MLLAAREGLS TDAARALEGW GGLVESPASG FLGDAALVID ALFGAGLSRP VEGSAADWLV RIRDSGLPVV SVDLPSGING DARPLAAPHL TADLTITFHC RKLAHLVEPF AEMCGETVCV DIGIPDGWEA RVTPVAREVD NPDWKPVEPS GYLSLHKHAR GRVAVFSGGS QSSGAGRLAA AGALRAGAGV VTLCTPPSAS LVNAAHLTAI MLARWGSDDQ IGEVLTGLRA EAAVLGPALG VGPATRKAVL GALEANVPLV LDADALTSFA DDADSLVDQL HDRCVLTPHH GEFSRLFGEG RPNWNKLEQA QSAADRCGCT VLLKGPATII ATPGLTPWIN RHACRWLATA GSGDVLAGII AAGLAAGMSP HDAAAGATWL HGDAALRQGA GMTAEDLPDA LPGALQALQR RRRQTAALSH LLSHGS
|
| |