Gene Mext_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1031 
Symbol 
ID5832371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1123966 
End bp1125603 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content76% 
IMG OID641366826 
ProductGlycerone kinase 
Protein accessionYP_001638507 
Protein GI163850464 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID[TIGR02361] dihydroxyacetone kinase, ATP-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0593318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.109318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTCACT TCATCAACGA CCGCGCCGGT CTCGTCACCG ACGCCATCGA CGGGCTCGTC 
GCGGGCAGCG GCGGGACGCT GGCACGGCTC GACGGCTACC CCGAGATCCG CGTCGTGCTG
CGGGCCGAGC CCGAGCCGGA CAAGGTCGCC GTCGTCTCCG GCGGCGGCTC GGGGCACGAG
CCGGCCCATG CCGGCTTCGT CGGGCCGGGG CTGCTCACGG CGGCGGTGTG CGGCGACGTG
TTCGCCTCGC CCTCCGTCGA TGCCGTGCTC GCCGGCATCC TGGCGGTGAC GGGCGAGGCC
GGCTGCGTCC TCATCGTGAA GAACTATGCC GGCGACCACC TGAATTTCGG CCTCGCCGCC
GAGCGCGCCC GGGCGCTCGG CCGCCGGGTC GAGACCGTGC TGGTCGCCGA CGACATCGCC
CTGCCCGACG CGGCCCGGCC GCGCGGGCTC GCTGGCACGC TCTTCGTCCA TAAGGCCGCG
GGCCACGCCG CCGCCTCCGG CGCGCCGCTC GCCGAGGTGG CGGCACTGGC CCGGCGCACG
GCCGCCGCGG TGCGCACCCT CGGCATCGCC GTCTCCACCG CGACGATCCC CGGCTCGAAA
CCGGAGCCGC GCCTGCACGA AGGCGAGGCG GAACTCGGTC TCGGCATCCA CGGCGAACCC
GGCATCGAAC GCATCGACCT GCCCCGCGCC GACGCGCTCG CCGCGCGCAT GGCCGCGCGC
TTCCCCGCGC CCATCGCGGG GGCCGACAGG CTCGCGCTCC TCGTCAACAG TCTCGGCTCG
ACCACGGCCC TAGAGATGGC GGTGCTGACG AAGGCCGTGC TCGCGACCGA CCTTGGGCGG
CGCGTGCGGC TTCTGCTCGG CCCCTCCCCG GTCATGACCG CGCTGGACAT GCACGGCGCC
TCCCTCAGCT TCCTGGCACT GGACGAGGTC CTCGAGGCCG CGCTTCTCTC CGAGACACCG
GTCACCGCCT GGCCGCGCGC GCGGATCCTG CGCGAATCGA TCGTCCGACC GCTGCCGGAG
GGGGTCTCAG GCGGGCCGGC ACCGGCACCC TCGCGGGATG CGGTCGTCGC CGCACGAATC
GAGGCGGTGG GCCGGGCGCT GATCGCGGCG GAAGCCTCGC TCAACGCCCT CGACGCGCGG
GTCGGCGACG GCGACACCGG CTCGACCTTC GCGGAGGGTG CGCGCGCCGT GCTGGCCGAT
CTCGGCCGAC TGCCCCAGGC CGATCCCGCC GCCCTGTGCC GTGCGCTGGG CGAGCGCCTC
GAGCGCGCCA CGGGCGGATC GAGCGGCGTG CTGCTCTCGA TCTTCTTCGC CGCCACAGGC
TCGGCACTCG CCGGGGGTGC GGACTGGCCG GCCGCCTGCG CCGCCGGTGT CGCACGGGTG
CGCGAAATCG GCGGCGCGGG CCCCGGCGAC CGCACCATGC TCGACGCGGC GATTCCGGCG
ATCGCGGCGC TGGCGGACTC CGGCCTCGGC GCGGCGGCAC AGGCCGCCCG CGCAGGCGCC
GAGGCCACCG CCGGGATGGA GCGGGCCGGG ACCGGCCGCT CCAGCTATCT CGCCGGCAAA
GACCTGAAGG GCCATCCCGA TCCCGGCGCG GTTGCGGTGG CGACCGCCTT TGAGGCACTG
GCCTCGGGAT CGAAGTGA
 
Protein sequence
MAHFINDRAG LVTDAIDGLV AGSGGTLARL DGYPEIRVVL RAEPEPDKVA VVSGGGSGHE 
PAHAGFVGPG LLTAAVCGDV FASPSVDAVL AGILAVTGEA GCVLIVKNYA GDHLNFGLAA
ERARALGRRV ETVLVADDIA LPDAARPRGL AGTLFVHKAA GHAAASGAPL AEVAALARRT
AAAVRTLGIA VSTATIPGSK PEPRLHEGEA ELGLGIHGEP GIERIDLPRA DALAARMAAR
FPAPIAGADR LALLVNSLGS TTALEMAVLT KAVLATDLGR RVRLLLGPSP VMTALDMHGA
SLSFLALDEV LEAALLSETP VTAWPRARIL RESIVRPLPE GVSGGPAPAP SRDAVVAARI
EAVGRALIAA EASLNALDAR VGDGDTGSTF AEGARAVLAD LGRLPQADPA ALCRALGERL
ERATGGSSGV LLSIFFAATG SALAGGADWP AACAAGVARV REIGGAGPGD RTMLDAAIPA
IAALADSGLG AAAQAARAGA EATAGMERAG TGRSSYLAGK DLKGHPDPGA VAVATAFEAL
ASGSK