Gene Mext_0850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0850 
Symbol 
ID5831710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp926573 
End bp927613 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content71% 
IMG OID641366632 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001638326 
Protein GI163850283 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.552823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.388725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCGGG TTCTCGTGAC CGGCGGGGCC GGCTTCGTCG GCCGCCACGC GGTCGCCGCT 
TTGGCCGCTC GCGGCTTCGA GGTTCACGCC ATCGGCCGAA CCGCGCCGGA GGGTGCCCAT
GCCTTTCACG CGGCCGACCT GCTCGATCCG GTGCAACGCC GGGCCGTCGT GCAGGCGGCC
TCGGCGAGCC ACCTGCTCCA CCTCGCCTGG ATCACCACAC CCGGCCGCTA CTGGCAGGCA
CCGGACAACC TCGACTGGAC GGCTGCGAGC CTCGACCTCG TGCGGACGTT CCGCGAGGCG
GGGGGCACCC GCGCCGTGGT GGCCGGGACC TGTGCCGAGT ACGATTGGAC GGGGATCAAC
CTTCTGCCAC GTGCAGAATT GGAATCCCCC TCTCCCCGCA CGCGGGGAAA GGGCTTCGGC
GAGCCTGTCG TCGCGACCCT ATCCCCGCAA GCGGGGCGCG GGGATGCAGC GGCGATCCAA
GAGGGTCATT TGGCGGAAGC GGCCCCCTGC CGCCCGGCGA CGCTCTATGG CGCCGCCAAG
GACGGTCTTC GCCTCATTCT GCAAGCCTAT GCGGCGACCG CCGGCCTCTC CCTCGGCTGG
GGGCGATTGT TCTACCTCTA CGGTCCCGGC GAGACGCCGG GCCGACTCGT CGGCGATGCG
GCGCGGGCGC TGCTCACGGG CCAGCGTCTC GCCACCAGCG AGGGCCGGCA GCGGCGCGAT
TTCCTGCATG CCGCCGATGT GGGAGCGGCC TTCGCGGCCC TGCTCGACTC GGGGGTGGAG
GGGCCCGTCA ATATCGGCTC GGGCGAAGCG GTGCCGGTGC GCAGAATCCT GGAAACGATC
GGTGCGCTGA CCGGACGCCC CGATCTGATC GATTTCGGCG CCCGCCCCCT CGGCCCGGCG
GAGCCGGCCC GCATCGAGGC CGACATCCGG CGCCTGACGG ACGAGGTTGG CTTTTCGGCC
CGCTACGGCC TCGAACAGGG CCTAGAGCAA ACCGTCGCGG CTTGGCGCGC CGCGCTCAGC
AATGCGGCAT CAATCCCTTG A
 
Protein sequence
MKRVLVTGGA GFVGRHAVAA LAARGFEVHA IGRTAPEGAH AFHAADLLDP VQRRAVVQAA 
SASHLLHLAW ITTPGRYWQA PDNLDWTAAS LDLVRTFREA GGTRAVVAGT CAEYDWTGIN
LLPRAELESP SPRTRGKGFG EPVVATLSPQ AGRGDAAAIQ EGHLAEAAPC RPATLYGAAK
DGLRLILQAY AATAGLSLGW GRLFYLYGPG ETPGRLVGDA ARALLTGQRL ATSEGRQRRD
FLHAADVGAA FAALLDSGVE GPVNIGSGEA VPVRRILETI GALTGRPDLI DFGARPLGPA
EPARIEADIR RLTDEVGFSA RYGLEQGLEQ TVAAWRAALS NAASIP