Gene Mext_2538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2538 
Symbol 
ID5833222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2850255 
End bp2851562 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content69% 
IMG OID641368339 
Productcapsule polysaccharide biosynthesis protein 
Protein accessionYP_001640003 
Protein GI163851960 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAGC CTCGTGCCCA TGCCATAGGC CCCCGCGCCG CCACCTTCCT GTTCCTGCAG 
GGGATCGCCT CCCCGTTCTT CTCGGTCCTC GGCAGCGCCC TGCGCGAGCG CGGCCACGGG
GTTCGCCGCA TCAATTTCTC GGCCGGCGAC TGGTTGTTCT GGCCCTTTCC CGCCGACCAC
TACCGCGGCA AGCGCGACGG CTGGGACGCC TATCTCGAAG CCTATATCCG CACGCATGGC
GTCACCGACA TCGTGCTGTT CGGCGATTGC CGGCCCTACC ATCAGGCGGC GGTGCTCGTC
GCCAAGCCGA TGGGCGTGCG CATCCACGTG TTCGAGGAGG GCTACATCCG GCCCTATTGG
ATCACGTGCG AGGCGGGCGG GGTGAACGGC AACTCGACCC TGCCGAAGCG GGCGGAGGAG
ATCCGGGAAC TGGCGCGCAA GCTGCCGCAG CCCGGACGGG CCATGCCGCT CACCGGAGAT
ATGGGCCGGC GCAGCCTGTG GGATATCAGC TTCAACATCG CCAATATCGG CTTTCCGTAC
CTTTATCCGG GTTTCCGCAC CCACCGGCCG AACCACATCG CCGCCGAATA TGCCGGCTGG
ATCCGGAAGT TCGTGCGCCG CCGCCGCACC CGCCGCGAGG CGGCGCGGGT GAACGAGATC
TACCACGCGA TCAACGCCGA CTACTTCCTG CTGCCGCTCC AGCTCGACAG CGACTACCAG
ATCCGCGTCC ACTCGCCGTT CCTCGGCGTC GAGGGCTTCA TGGACCGGGT GATCGCCTCC
TTCGCCAAGC ATTCGCAGGC GCCGACGCGG CTGCTGGTGA AGCTGCACCC GCTCGACAGC
GGCATCATGA ACTGGCGCAA GCGCGCCCGC CAATCGGCCA AGCGCCACGG CTGCAACGAC
CGGCTCGACT TCATCGACGG CGGCGACCTG CCCAAGCTCA TCGACGGCAG CCGCGGCGTC
GTGCTGGTGA ACTCCACCGT CGGGATGCTC GCCCTTGAGC GCGGGCGGCC GACGCTGGCC
TTCGGCTCTG CGGTCTACAA CATGCCGGGC CTGACCCATC AGGGCGACAT CGACACGTTC
TGGGGGGCCC CGCAGGCACC CGACGCGGCA CTGATGCAGG ATTTCTTCCG GGTCGTCATG
CACCGCACCC AGATCAACGG TGGCTACTTC TCCCGCTCGG CGATCGAGCG GGCGGTGGCC
GGCGCCGTGC CGCGTCTCGA GGCGGCACTT CCGCCTGCCG CGCTGGCTGC CGCCCGCGAC
ACGCTGGAGC AGGCCGGCCG CGACGGAAAC CTCTCCCCCG CCTATTGA
 
Protein sequence
MTQPRAHAIG PRAATFLFLQ GIASPFFSVL GSALRERGHG VRRINFSAGD WLFWPFPADH 
YRGKRDGWDA YLEAYIRTHG VTDIVLFGDC RPYHQAAVLV AKPMGVRIHV FEEGYIRPYW
ITCEAGGVNG NSTLPKRAEE IRELARKLPQ PGRAMPLTGD MGRRSLWDIS FNIANIGFPY
LYPGFRTHRP NHIAAEYAGW IRKFVRRRRT RREAARVNEI YHAINADYFL LPLQLDSDYQ
IRVHSPFLGV EGFMDRVIAS FAKHSQAPTR LLVKLHPLDS GIMNWRKRAR QSAKRHGCND
RLDFIDGGDL PKLIDGSRGV VLVNSTVGML ALERGRPTLA FGSAVYNMPG LTHQGDIDTF
WGAPQAPDAA LMQDFFRVVM HRTQINGGYF SRSAIERAVA GAVPRLEAAL PPAALAAARD
TLEQAGRDGN LSPAY