Gene Mext_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1859 
Symbol 
ID5831627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2079066 
End bp2081480 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content70% 
IMG OID641367658 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001639329 
Protein GI163851286 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.708627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCC TCAGGCTGTC GACCGATCCG TCTCTTCCGC CCTCCGGCGG GCCCGGTGGG 
CCCGGCGATG GATTGGCGGT GGCGCAGATT GGCGGCGTCC TGCGCCGCTC CTGGGCCTGG
ATCGCCGTGC CGACGCTCGT GGCCGCACTC GGGGCGGGCG TCTTCGTGCA GGTGGTGACC
CCGCGCTACA CCGGCGAAGC CAAGGTTCTG CTGGAAAGCC GCGATCCCGC CTTCGCTCGC
ACCGCCGCCG AACGGACGGA CCAGTCCCAG CCGATCGACG AGCAGGCCGT CGCGAGCCAA
GTACAGGTGG CGATGTCGCG CGACATCGCC CGCGAAGCGA TTCGCAGCCT GAAGCTGGTC
GGAAACCCCG AATTCGATCC GGAGGCCGAG GGCAGCTCGG CGATCCGGCG CACGCTGATG
ATGCTCGGGC TCGTCTCGGC GCCGATGGAT CGTCCGTCCG AGGATCGCAT CCTCGAAAGC
TACCTCGACC ACCTGCTGGT CTATCCGGTC GGCAAGTCGC GCATCCTCGC CGTCGAGTTC
CGCTCGCGCG ATCCCGAACT CGCCGCGCGC GGCGCCAACA CGGTGGCGGA CCTCTATCTC
GCCTCGCTGG AGGCGGCCTC CGTCGATACC GCCCGTTACG CCTCGACCTG GCTCGGCAAC
AACATCGCGA ACCTGCGTGC CCGCGTGGCC GAAGCCGAGG CGAAGGTCGA AGCGTTCCGC
GCCAAGCACG GCCTGATCGG CACCGGCAGC AGCGCGGCGG CTCAACCGCT CTCGTCCCAG
CAGCTCTCCG AATTGTCGAG CCAGCTCTCG CAGGCCCGCA CGATCCAGGC GGACCTGACT
GCCCGCGCCA AGCTCCTCAA GGACATGATC AAGGAGGGCC GTGCCTTCGA GATCCCCGAT
GTCGCCAACA ACGAGCTGAT TCGGCGCACC GTCGAGAGCC GCATGGCGAT GCGTGCGCAG
CTCGCCCTCG AGTCGCGCAC GCTGCTGCCG GCCCACCCGC GCATCAAGGA GCTGACGGCC
CAGGTCCAGG ATCTCGAGAA TCAGATCAAG GCAGCCGCCG AGCGGGTGGT GCGCACCCTC
GAGAACGACG CCAAGATCGC GGGCGCTCGG GTCGAGAGCC TGCGCGCGGC GGTCGAGGGA
CAGCAGGATG TGGTCGCCAA GGGCAACACC AGCGAGGTGG AGCTGCGCGC CCTGGAGCGC
GAGGCGAAAT CCCAGCGCGA ACAGCTCGAA TCCTATCTCG CGCGCTACCG CGAGGCCGCC
GCGCGTGACG CCGAGAATGC CAGCCCGGCC AATGCCCGCG TGGTGTCCCG TGCGATCGTG
CCCGATCTGC CCTCCTTCCC CAAGAAGCTG CCGATCGTCG CCTTCGCCAC GATGCTGGCC
TTCTTGCTCG CGAGCGCCGG GGTCATCGGC CGCCATCTCC TCGTCACGCC GGCCGGGCCG
GGCGGAGACC GCACCGGGGA TGAGGGAGAG CCCCTCGTCC AAAGCACCCG GACCGATCGC
CCGCGCGACT TCTATCCCGA GCCGGAGCCG CCCCCGTCGC GCCGCCCTGT CTACGAGCCG
GTCTATGGCG GCGCCTATCC GGCCTCGGCG GCGGAACGCT TCGCGCCGGC TCTCGCCTTT
GCCCATTCCC TGAGGGCGAC GGCGACGGTG CATCACCCGG TCTTCGCCAG CACGGCGCCG
ATCGGGGCCG AGGCGGCCGC GCAAGCGGGT GGGGCGAAAA CGGACAAGGA AGGGCTCGGC
GTCCCCGCGA AATCCTCCGC CTCCTCTGCC GATCTCGACG GGCTGATCGC CCGACTGGCG
AACGGAACGG GCAAGTCGCA GGGCGAAGGG CAGGGCAACG GGCCGTTGGC CGGGCCGTCC
AAGGGGGGCT GTGTGCTCGT GGTCGAAACC CCGCGCGCCG ATGGCACGCC CGGTCTTGCC
TCCAATCTCG CCCGGGTGCT CGGTCCGCGC TACCAGACGC TGCTGGTCGA TGTGAACGGT
GTGGTCTCCG GGCCGTCCGA GCCGGGACTG ACCGATCTAG TGGCCGGTGC CGCCGATTTC
CTCGACGTGA TCCAGCCGCT GCGGGGTTCG CGTCTCCACG CGGTGAAGCG CGGCGCCGCG
CCTCTCGATG TGCTGGTGGA GGAGCCGCAA GGCCTCGCCA TCGGGCTCAA CGCCCTGTCG
CAGAGTTACG ATTGGGTGCT GTGCCGCCTC GATGCGCGCA ATACCGAAGA CGGTGCCGAA
CTCATTCCCG CGGTCGGACC CTGCATGGAT TCGATCGTGA TCGCCTCGGA TGCGGCGTCC
GACGACCCGG CTCTGGTCTC GCTCTATCGC CTCGCCAAGG AAACCGGTGT GGCCCGGGTG
GTGGTTGCCC GCCACGGCGA GGACGCGGAC CTCACGCCGA GCCTGGAAGG GACGCCTCTG
CGGCTCTCGG CCTGA
 
Protein sequence
MPRLRLSTDP SLPPSGGPGG PGDGLAVAQI GGVLRRSWAW IAVPTLVAAL GAGVFVQVVT 
PRYTGEAKVL LESRDPAFAR TAAERTDQSQ PIDEQAVASQ VQVAMSRDIA REAIRSLKLV
GNPEFDPEAE GSSAIRRTLM MLGLVSAPMD RPSEDRILES YLDHLLVYPV GKSRILAVEF
RSRDPELAAR GANTVADLYL ASLEAASVDT ARYASTWLGN NIANLRARVA EAEAKVEAFR
AKHGLIGTGS SAAAQPLSSQ QLSELSSQLS QARTIQADLT ARAKLLKDMI KEGRAFEIPD
VANNELIRRT VESRMAMRAQ LALESRTLLP AHPRIKELTA QVQDLENQIK AAAERVVRTL
ENDAKIAGAR VESLRAAVEG QQDVVAKGNT SEVELRALER EAKSQREQLE SYLARYREAA
ARDAENASPA NARVVSRAIV PDLPSFPKKL PIVAFATMLA FLLASAGVIG RHLLVTPAGP
GGDRTGDEGE PLVQSTRTDR PRDFYPEPEP PPSRRPVYEP VYGGAYPASA AERFAPALAF
AHSLRATATV HHPVFASTAP IGAEAAAQAG GAKTDKEGLG VPAKSSASSA DLDGLIARLA
NGTGKSQGEG QGNGPLAGPS KGGCVLVVET PRADGTPGLA SNLARVLGPR YQTLLVDVNG
VVSGPSEPGL TDLVAGAADF LDVIQPLRGS RLHAVKRGAA PLDVLVEEPQ GLAIGLNALS
QSYDWVLCRL DARNTEDGAE LIPAVGPCMD SIVIASDAAS DDPALVSLYR LAKETGVARV
VVARHGEDAD LTPSLEGTPL RLSA