Gene Mext_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2037 
Symbol 
ID5834748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2273126 
End bp2275153 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content71% 
IMG OID641367835 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001639504 
Protein GI163851461 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTGCCGAA AACCGGTCTC CATCGTTCGG GCCGATGCTC TGACACGAAG CACCATGCAG 
AAGCGCACCC TCAAGACGGC GGTGATGGTC GTCCACGATC TCGCCGCGAC CGCGGCGGCC
GTGGTGCTGA CCTTCGTCTT CCGCTTTCAG GACGGGGCGC TCGTCGAGCG CCTGCACGCG
CTGCCGCTGC TGCTGCCGCC GTTCCTCGCC TATGCCGGGC TGATCTACGC GTGGTTCCGG
CTCTACCGCA CCAAGTGGCG CTTCGCCTCG CTGCCCGACC TCGCCGGCAT CGTGCGCGCC
GCGAGCGTCA CCGCGCTGAC GCTGCTCGTG CTCGACTACG TGCTGGTCTC CTCGAATCTC
TACGGGTTCT ACTTCTTCGG CAAGGTCGCC ATCGTCCTCT ACTGGGTCTT GCAGATCTTC
CTGCTCGGCG GGCCGCGGCT GGCGTTCCGC TACCTGAAAT ACGCCCGCTC CCGGCAGAGT
CAGGCCCGCA CCGCCACCAC GCCGACGCTG CTGCTGGGCC GCGGCGCCGA TATCGAGATC
GTGCTGCGGG CGATCGAATC CGGCTCGGTC AAGCGTCTCT CCCCCAAGGG CATCCTCTCG
CCGCGGGCGG ACGAGGCGGG CCAGATGATG CGCGGCGTGC CCGTACTCGG CGGGTTTCGC
GACCTCGAAC AGGTGGTTGC GGACCTCGCC AACCGTGGCC TGCCGGTGCG CCGGCTCGTG
GCGACGCCGA GCGCGCTGGC GCCCGAATCC GAGCCCGACG ATCTCATCGC CCGCGCGCGC
CGTCTCGGCT TGCCGCTCGC CCGCGTCACC AGCCTCGGCG AGGGGATGCG CGACGCCGAA
CTCGCACCGC TGGAGATCGA GGATCTGCTC CTGCGCCCCA CCGTCGCCAT CGACCGGCCG
CGGCTCGAGC GCTTCCTCAC CGGTGCCCGC GTCGTCGTCA CGGGCGGCGG CGGCTCGATC
GGCGCCGAGA TCTGCGCCCG GGCGGTCGCC TTCGGCGCCT CGGCGCTGCT CGTCATCGAG
AACTCGGAGC CGGCGCTGCA TGCCGTCCTC AACGGACCGG CCCTGCTCCA CGCCGAGGCC
GCGGTCGAGG GGGCGCTCGC CGACATCCGC GACCGCGAGC GGCTGCACGC GATCCTCCGC
GATTTCCGGC CCACTTACGT CTTCCACGCC GCCGCGCTGA AGCAGGTGCC CTATCTCGAG
CGCGACTGGG CCGAGGGCAT CAAGACCAAC GTGTTCGGTT CGATCAACGT CGCCGAGGCG
ACGGTGGCCG CGGGCGCCCG TGCCCTGGTG ATGATCTCCA CCGACAAGGC GATCGAGCCG
GTCTCGCAGC TCGGCGTGAC CAAGCGCTTC GCCGAGATGG TGGCCCAATC CCTCGACGCG
GAGCGCTCGG GCCCCGAGGC GACCCGGCTC ATCGCCGTGC GCTTCGGCAA CGTGCTGGGC
TCGGCCGGCT CGGTGGTGCC GGTGTTCAAG GCGCAGATCG CCCGCGGCGG GCCGGTCACG
GTGACCCATG CCGAGATGGT GCGCTACTTC ATGACCGTGC GCGAGGCCTC GGACCTCGTG
CTCACCGCCG CATCCCACGC CGACGCGGAA GGGCGCGGCG ACACCGGCGA CCAGCGCGCG
GCGGTCTACG TGCTGAAGAT GGGCCAGCCC GTGCGCATCC GGGATCTGGC CGAGCGGATG
ATCCGCCTCG CCGGCTTCGA GCCCGGCGAG GACATCGAGA TTCAGGTCAC CGGCGCACGG
CCGGGCGAGC GGCTCAACGA GATCCTGTTT GCCAAGGAAG AGCCGCGTGT GACGCTCGCC
GGCATCGAAG GCGTCATGGC GGCCAAGCCC GTCTTCGCCG ACCGCGCGGT GCTGGAAGGA
TGGATTCTTC GTCTGTGCGA GGCCGTCACG GCGGGCGACC GGGCGGCGGC GGAAGCCGTG
TTCGAGGAGG CGATCCCGGA TTTCCGCAAT CGCGCCGGGG CCGTTCCGAA CACCAAGCCC
GAGACCATCG CGCCGCCGGC GGCAGCCACC GAGGCGGCGA CCGCCTGA
 
Protein sequence
MCRKPVSIVR ADALTRSTMQ KRTLKTAVMV VHDLAATAAA VVLTFVFRFQ DGALVERLHA 
LPLLLPPFLA YAGLIYAWFR LYRTKWRFAS LPDLAGIVRA ASVTALTLLV LDYVLVSSNL
YGFYFFGKVA IVLYWVLQIF LLGGPRLAFR YLKYARSRQS QARTATTPTL LLGRGADIEI
VLRAIESGSV KRLSPKGILS PRADEAGQMM RGVPVLGGFR DLEQVVADLA NRGLPVRRLV
ATPSALAPES EPDDLIARAR RLGLPLARVT SLGEGMRDAE LAPLEIEDLL LRPTVAIDRP
RLERFLTGAR VVVTGGGGSI GAEICARAVA FGASALLVIE NSEPALHAVL NGPALLHAEA
AVEGALADIR DRERLHAILR DFRPTYVFHA AALKQVPYLE RDWAEGIKTN VFGSINVAEA
TVAAGARALV MISTDKAIEP VSQLGVTKRF AEMVAQSLDA ERSGPEATRL IAVRFGNVLG
SAGSVVPVFK AQIARGGPVT VTHAEMVRYF MTVREASDLV LTAASHADAE GRGDTGDQRA
AVYVLKMGQP VRIRDLAERM IRLAGFEPGE DIEIQVTGAR PGERLNEILF AKEEPRVTLA
GIEGVMAAKP VFADRAVLEG WILRLCEAVT AGDRAAAEAV FEEAIPDFRN RAGAVPNTKP
ETIAPPAAAT EAATA