Gene Mext_2537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2537 
Symbol 
ID5833221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2847943 
End bp2849892 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content73% 
IMG OID641368338 
Productcapsule polysaccharide biosynthesis protein 
Protein accessionYP_001640002 
Protein GI163851959 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC TGCGCCCCAT CGAGCCGCGA GGCGATACCG GGCTGCCCCG CACCTACCTG 
CCGCTCGGCC GGACGGTCCG GCTTCTGCGC GCGCAGATCG AGGCGGCAAC GGGCGCGTCG
CTGACCTTCC GCTCCCCCGG CCCGCCGGCC CTGTGGTCCG CGACGGCAGC CGACGGCAAG
TCGGTTCTGC GGGTCGCGCC GGGCCCGCTC CTCAGCCCCT ACGATTCGCC GGAGACCGGC
CTTTCGAGCC TTAGCCTCAG CTATGCCGGC CGGCCCCTCG GAGGCGAGCC CCCTTCCCCG
GATTTCGGCG ACCTTCTGCG CGCACGCGGC CTCGCCGGCT ACAACCGCTT CGCCCGCTCG
CTTCCGGATG GAGCGGCCGA ACTCTTCTCA GGACAGACGC GGCCTGCCCT CGCCCTGATC
GATCCGCAAA CCGCCGCGTC CGGCGCGGTC CTGCGAAGCT TCCTGACGCG GCTTCTCGCC
GAGAATCCGG GGACGCGCTT CGTCGCCGCC GGCCCCGACG GATGCCCGCG TGATGGCGCG
CCGACCGATC CGCGGCTCAC CCTGATCGCG GGCCCGGCCG ATCCGTGGCT GCTGTTTCCC
CTCGCGGCGC GCATCCATGT GGCGCGATGG GACGCGGCCT GCGAGGCGGC GCTGGCCAGC
TTCGAGACCT ATTGCCCCGA CCCTGCAGCG GGTCCGCGCC CGGCGGATGC GGCGGCGTTC
CTGGCGCTCC GTTACGGCAT CGGCGTGCAC GGCTTCGATC CGTGGACGCG CCGGCCGATC
CCGCTCGCGG ATGCGGTGGA GCGCGTGGCG TGGCTGCGCG ACCGCTTCCT CGGCAACGAC
CGCCGCGTCG TCCTCGTCGG CGTGTCCGGC TGGAAGCGCG CCGCGCTCGA TGTCTTCGCC
ACCGGGCCGG CGGGACCGCC ACTCCACACC ATGATCGCGG ACGAGGCGGT GGCGCTCGCT
CGCGCGCAGG GCGGCCGGGT TCAGGCCTGG GCGACCCGCT GCCCGGAGCA GCTCCCGCGT
CTCTGCGCGG AGGCCGGCTT GCCCTTCGCG CGGATCGAGG ACGGATTCCT GCGCTCGGTC
GGGCTCGGCG CTAGCCTGGA GCCGGGCGCC TCCATCGTCG TGGACGATCT CGGCATCTAC
TACGACCCGC GGGTGGAGAG CCGGCTGGCC CGCACGCTGA AGGAAGCCGA GTTCCCGCCC
GGACTGACCG CCCGCGCCGC CGCCTTGCGC GAATCGATCG TGGCGCGGCG CCTGAGCAAG
TACAATGTCG GCCTCGAAGG CGTCGGCGAG GACTGGCCGA CGGACCGGCG GATCGTGCTG
GTGCCGGGCC AAGTCGAGGA CGACGCCTCG GTGCTGACCG GATCGCCGCA GGTGCGCGGC
AACCTCGCCC TGCTGCGGGC CGCCCGCGCC CGCAACCCGG ACGCCTTCCT GCTCTACAAG
CCGCATCCCG ACGTCGAGGC CGGGTTCCGT CCCGGTGCGA TCCCGGAGGA GGAGGTGCGG
CGGCTCGCCG ACCGTGTCGT CGGCGGCCTC TCCATCGTCG ACCTGCTCGA CCGTTGCCAC
CATGTCGAGA CCATGACCTC GCTGGCAGGC TTCGAGGCGC TGATCCGGGG GCTGAGCGTC
GCCGTCCACG GCCGCCCCTT TTATGCCGGC TGGGGGCTGA CCGAGGATCT GGCACCGGGG
GCCGACCGCG GTCGCACGCT GTCCCTCGAC GCCCTGGTGG CGGGTGCGCT GATCCTCTAC
CCGCTCTATC TCGATCCGGT GGCGATGAAG CCGTGCACGC CCGAGCAACT GCTCGACCGG
CTCAGCGAGG CCCGTGCGGC CGCGCCGCCC TCGCGTCTCG CCCTCGGCGC GGTCCGTCAC
GCGGCGATGC GGCTGCGCTA CGCCCTGATC AATCCGGTCA TCCGCCGCCT ACGCGCTCGC
CGCGGCGTGC GCAGTGAATC CGGCCGCTGA
 
Protein sequence
MTDLRPIEPR GDTGLPRTYL PLGRTVRLLR AQIEAATGAS LTFRSPGPPA LWSATAADGK 
SVLRVAPGPL LSPYDSPETG LSSLSLSYAG RPLGGEPPSP DFGDLLRARG LAGYNRFARS
LPDGAAELFS GQTRPALALI DPQTAASGAV LRSFLTRLLA ENPGTRFVAA GPDGCPRDGA
PTDPRLTLIA GPADPWLLFP LAARIHVARW DAACEAALAS FETYCPDPAA GPRPADAAAF
LALRYGIGVH GFDPWTRRPI PLADAVERVA WLRDRFLGND RRVVLVGVSG WKRAALDVFA
TGPAGPPLHT MIADEAVALA RAQGGRVQAW ATRCPEQLPR LCAEAGLPFA RIEDGFLRSV
GLGASLEPGA SIVVDDLGIY YDPRVESRLA RTLKEAEFPP GLTARAAALR ESIVARRLSK
YNVGLEGVGE DWPTDRRIVL VPGQVEDDAS VLTGSPQVRG NLALLRAARA RNPDAFLLYK
PHPDVEAGFR PGAIPEEEVR RLADRVVGGL SIVDLLDRCH HVETMTSLAG FEALIRGLSV
AVHGRPFYAG WGLTEDLAPG ADRGRTLSLD ALVAGALILY PLYLDPVAMK PCTPEQLLDR
LSEARAAAPP SRLALGAVRH AAMRLRYALI NPVIRRLRAR RGVRSESGR