Gene Mext_0847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0847 
Symbol 
ID5833337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp923526 
End bp924632 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content71% 
IMG OID641366629 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_001638323 
Protein GI163850280 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.307113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.210188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCGC TCAACCCCGA TCCTGCCTTC TGGGCGGGCA AGCGCGTGCT GCTCACCGGG 
CATACCGGCT TCAAGGGCGC GTGGCTGAGC CTGTGGCTCG CCCGGCTCGG CGCCCGCGTC
ACCGGCTTCG CCCTTCCCCC TGAGACGCGG CCGAACCTGT TCGAGGCGAT CGCATTCCCG
TCCGAGGACT CGCGCATCGG CGACATCCGC GATTTGCCGG CGCTCGCCGC GGCCGTGGCG
GCTGCCGAGC CGGAGATCGT GATCCACATG GCGGCGCAGG CCCTGGTGCG GCCCTCCTAT
ACCGATCCGG TGGGGACGTT CGCGATCAAC ACCATGGGCA GTGTTCACCT GCTGGAAGCG
GTGCGGTTGG CCCCGAGCGT GCGCGCCGTC GTCGTCGTGA CGAGCGACAA GGCCTACGAG
AACCGCGAAT GGCCCTATGC CTATCGCGAG ACCGAGGCGA TGGGCGGGCG CGATCCCTAC
AGCGCCTCGA AGGGCTGTGC CGAACTCGTA ACGAGTGCCT ATCGCGCCTC GTTCTTCGGC
GCGGGCGGCC ATCCGGCCCG GATCGCCAGC GCGCGGGCCG GCAACGTCAT CGGCGGCGGC
GATTGGTCCC TCGACCGGCT GATCCCCGAT ATCGTGCGCG CCTTCGAGGC CGGGGACTCG
GTCGAGATCC GCGCGCCGCA CGCGATCCGC CCGTGGCAGC ACGTGCTGGA ACCGCTGGCC
GGCTACCTCA GGCTCGCCGA ATGCCTCGCG GGCGCCGACG GCGCCGCCTT CGCGGAGGGC
TGGAATCTCG GGCCGGCGGA CGAGGATTGC CGGCCGGTCT CGTACCTCGT GGAGCGGCTG
GCGCAGGGCT GGGGCGGGGG AGCCGGCTGG CACCTCTCGC AGAAGACCCA TCCTCACGAG
GCGACATATC TCAAGGTCGA TGCCTCCAAG GCCCGCGCCC GCCTCGGCTG GGACCGGCGG
CTGACCCTCG ACACGGCGCT CGACTGGACC GCCGCGTGGT ATCGCGCGGC CGCTTCCGGT
GCCGATCCCC GCGCTCTGGC CGAGGCTGAG ATCGCGCGCT ACGAGGCGCT GGGCCAGCCT
GGAGCAAAAG CCGGAGTCCA AGCGTGA
 
Protein sequence
MAALNPDPAF WAGKRVLLTG HTGFKGAWLS LWLARLGARV TGFALPPETR PNLFEAIAFP 
SEDSRIGDIR DLPALAAAVA AAEPEIVIHM AAQALVRPSY TDPVGTFAIN TMGSVHLLEA
VRLAPSVRAV VVVTSDKAYE NREWPYAYRE TEAMGGRDPY SASKGCAELV TSAYRASFFG
AGGHPARIAS ARAGNVIGGG DWSLDRLIPD IVRAFEAGDS VEIRAPHAIR PWQHVLEPLA
GYLRLAECLA GADGAAFAEG WNLGPADEDC RPVSYLVERL AQGWGGGAGW HLSQKTHPHE
ATYLKVDASK ARARLGWDRR LTLDTALDWT AAWYRAAASG ADPRALAEAE IARYEALGQP
GAKAGVQA