Gene Mext_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3972 
Symbol 
ID5835677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4412671 
End bp4416777 
Gene Length4107 bp 
Protein Length1368 aa 
Translation table11 
GC content71% 
IMG OID641369763 
Productnon-ribosomal peptide synthetase 
Protein accessionYP_001641414 
Protein GI163853371 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR02353] non-ribosomal peptide synthetase terminal domain of unknown function 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.564437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAATCGG CAGCAGAGCA TCGGATGAGC GGTCTCGCCG AGGTGGGACG GCACGACGAA 
GGACCGGGCG AGCGCCTCGG CGGCCGCGCC GTGCTGCGGG GGGCCGCGCG GCCCGATCTG
ATCCGCCGGG AGACCCTGGC CGACCTGTTC CGGGCGAGCG CCGCCGCGCG CGCCGCCGCG
CCCTGCCTGA TCGACGCCGC CGAGCCCGGC TCCGGCGGCC GCCGTCCGGT CCTGACCTAT
GCGGAGGTCG ATGCCCGCTC CGACGCCATC GCCGCCGGGC TTTTCGCCCG CGGCATCGGT
CCGGGGGACG TCGTGGGGCT GTGGATGGCC CGCGGCACCG AACTGCTGAT CGCACAGATC
GGCATCACGA AATCCGGCGC CGCCTGGCTG CCCTTCGACG CCGAGGCGCC CGCCGATCGG
GTCGCGGTCT GCCTGAACGA TGCCGAGGCC AAGGCGCTCC TCGTCTCGGA GGCCCTGCGG
CCTCAGGCGC CGGAGGGCAC GCCCGCCGTC ACCACCGAAG CGCTTCTGCG GGCCGGGCAG
GGGGCGTCAC CGCCCGATCT CGACGCCGCT GGCCTCGGTC CGGAGCACCC GGCCTACCTG
ATCTACACCT CGGGCTCGAC CGGCGTGCCG AAGGGCATCG TCATCAGCCA CGCCAACATC
TGCCACTTCC TGCGCTCGGG CAACGCGGTC TACGGCCTCG GCGCCGACGA CGTGGTGTTC
CAGGGCGCCT CCGTGGCCTT CGATCTTTCG ATGGAGGAGA TCTGGGTTCC CTATCTCGTC
GGCGCCTGCC TGTTCGTGGC GAGCCCGGCC ATGATGGGCG ACGTCGAATC CCTGCCCGCG
ATCATCGCGG AGGCCCGGAT CACCGTGCTC GACACGGTCC CGACCCTGCT CGCCATGATC
CCCGGCGACC TGCCGAGTGT CCGCCTCGTG CTGCTCGGCG GCGAGGCCCT GCCCGAGCCG
CTGGTGGCGC GCTGGGCGAC TGATCGGCGG CGCCTGTTCA ACACCTACGG CCCGACCGAA
GCCACCGTGG TGGCGACCGC CGCCGAGATG CGGCCGGGCC GGCCGGTCAC CATCGGCGGC
CCGATCCCGA ATTACTCCGT CTACGTTGCC GACGAGGCGC TAAACCTCCT CGCACCGGGC
GAGCAAGGCG AGCTGCTGAT CGGCGGCCCC GGCGTCGCGG CGGGCTACCT CAAGCGGCCC
GAACTGACGG CGGAAAAGTT CGTCGCCAAC CCCTACCCGT CCGACGGGAC CGATCCGGTG
CTCTACCGCT CGGGCGATGC CGTCTCGATG ACGCCGGAGG GCGACATCGT CTTCCACGGC
CGCATCGACG ATCAGGTCAA GATCCGCGGG TTCCGCGTGG AGCTCGGCGA GATCGAGGCA
CGCATCCGGG CGCAGGACGG GATCAATCAG GCGGCGGTAG TCTTGCGCCG TGACGACGAG
GTTGACCGCC TCGTCGCCTT CCTCGTGCCT GAGCGGAATG CCGCCCTCGA CCGCGCGGCG
CTGCGAAAGA ATCTCGTGGC GCAGATGCCG CCCTACATGG TGCCCGGCCA TTTCGAGGAG
GTCGAGACCC TGCCGCGGCT TACCTCCGGC AAGGTCGATC GCAAGGCGCT GCGGATCGCG
CCGCTGACCG TGGCGGTGGC CGACGGCGAG CAGGAGGCCC CCGACAACGA GACCGAAGCC
GCTTTGCTGG CGGCGGCGAA AAAGGTGTTC GGCGACGGTC CGATCGGGCT CGCGGCCGAC
TTCTTCTCCG AACTCGGCGG CCACTCGTTG CTCGCCGCCC GCTTCGTCGG CGCGGTGCGC
GAGACCCCGG CGCTCGCCGG CATCACCCTG CAGGACGTCT ATAACGGCCG CACCCTGCGG
GTGATGGCCG CGAGTCTGAT CGAGCGCACG GGCGGGGCGG GGGCGCAGAC CGCGATCCGC
GACCTGAGCT TTGCTCCGCC GCCGCTGCTG CGCCGGGCGC TGTGCGGGCT GGCGCAGGCG
ATCACGCTGC CCTTCGTCAT CGCGCTGGCC ACCGCGCAAT GGCTCGGCAT CTTCGTCACC
TACCTGCTGC TCACCGGCGG CGGGCTCGGC TTCTGGGGCG AACTCGGCGT TCTCCTGCTG
GTCTATATCG GCATCAACGC GGTCACTGCG ACGATCGCGA TCGCCGCCAA ATGGCTGATC
CTCGGGCGGA CGAAGCCCGG CCGCTACCCG CTCTGGGGCG TCTATTATTA CCGCTGGTGG
CTGGCGCAGC GCCTGACGCC GCTGGTGCAC ATCAAGTGGC TCCAGGGCTC GCCCGTCATC
GTCACCTATC TGCGCCTGCT CGGCGCGCGG ATCGGCGACG ACGTGCTGAT CTCCGACCTC
GATGTCGGCG CGCCCGACCT GCTCACCATC GGCCGCGGCG CGTCGCTCGG CGGGCGGCTG
GTGATCGCCA ACGCGGAGGT CGTCGGCAAC GAACTCGTCA TCGGCTCCGT CGAGATCGGC
GAGGATGCCG CCATCGGCAC CTCCTGCGTG ATCGGCCCCG GCACGGTGAT CGGCGACCAT
GCCGAGATCG CCGACCTCAC CACCGTGCCG GCCGGCACCG AGGTCGGCTC GGCCGAGGCC
TGGGACGGCT CGCCAGGCCG CCGGGTCGGC ACCGTCGATT TTTCCGCGCT GCCCGAACCG
GCCACCGCCT CGCCCGCCCG CCGGGCGGCC TTCGGCGCGG TCTACGCCTT CCTGCTCGCG
GCGGTGCCGG CGGTCGGCCT GCTGCCGATC TTCCCGGCCT TCTACCTCTT CGACCAGATC
TCCGATAACC TCTCGGAACT CACCGACGTC GATTACCACG TCTACCTGCC GCTGCTGACC
TGGCCCACCG CCATGCTGAT GACGGCGGGC ACGGTGCTGC TCATCGCCGC CATCCGCTGG
GCGGTGCTGC CGCGGGTCAC GCACGGCACC TTCTCGATCT GGTCGGGCTT CTACCTGCGC
AAATGGCTGG TGGCGCTCGC CTCCGAAGTG ACGCTGGAGA CGCTCTCCTC GCTGTTCGCC
ACCGTCTACA TGCGGGCGTG GTACCGGCTG ATGGGCGCGC GGATGGGAAG GGGCGCCGAG
ATCTCGACCA ATCTCGGCTC GCGCCACGAC CTCGTCGCGG TCGGCGCCAA CAACTTCATC
GCCGACGAGG TGGTGGTCGG CGAGGAGGAG ATCCGCCGCG GCTGGATGCA TCTCCATCCG
GTCGAGACCG GCGCTCGCGT CTTCGTCGGC AATGACGGCG TGCTGCCGCC GGGCGCGCAC
ATCCCCGACG ACGTGCTGAT CGGCATCAAG TCGAAGCCGC CGGCCAACGA CAAGATGGGG
CCAGGCGAGA CGTGGTTCGG CTCGCCGCCG ATCCGCCTGC CGGTGCGCCA GAAGGTCGAT
CTCGGCTCCA ACGCCCAGAC CTTCGAGCCG AGCGTATGGG CCAAGCTGCG GCGCGGCCTG
TTCGAGGCCT TCGCCACCTC GTTCTCGCCG ATGCTCTACA TTTCGCTCGC CATCTGCGCG
ATCGACTGGG TGTTCTACCC GGCCATCCTC GCCGAGGATT GGGGTGGGCT GGCGCTCGCC
TTCGTGCTCG TGAGCCTCGC CATCGCGCTG ATCCAGACGA GCAGCGTCAT CGCCCTCAAA
TGGCTCTTGA TGGGACGCTA CCGCCCCGGC ATGCGGCCGA TGTGGTCGTG GTGGGCGATG
CGGACCGAGG CCATCGCGGT GGCCTATTGG GGGCTCGCCG GCAAGGTGCT GCTGGAGCAC
CTGATGGGCA CGCCGTTCCT GCCCTGGATG CTGCGGCTGT TCGGGGTGAA GGTGGGCAAG
GGCGCCTGCC TCCTTATGAC CGACATCACC GAGTTCGACT GCGTCGAGAT CGGCGACTTC
GCCGCGATCA ACCGCTCGGC GGCCCTCCAG ACCCACCTCT ACGAGGACCG CATCATGAAG
GTCGGCCGCG TCGTGGTCGG GCGCGGCGTC ACGGTGGGCG CCTTCTCCAC CGTCCTCTAC
GACAGCCATG TCGGCGACTA CGCGCGGCTG CGCCCGCTCA CCATCGTGAT GAAGGGCGAG
TCGATCCCGG CACATTCGGA ATGGGAGGGC GCGCCTGCCG TGCCCGTGGT GCACGCGGCG
GGCGAGGCTG TGGCGCGGGC GGCCTGA
 
Protein sequence
MESAAEHRMS GLAEVGRHDE GPGERLGGRA VLRGAARPDL IRRETLADLF RASAAARAAA 
PCLIDAAEPG SGGRRPVLTY AEVDARSDAI AAGLFARGIG PGDVVGLWMA RGTELLIAQI
GITKSGAAWL PFDAEAPADR VAVCLNDAEA KALLVSEALR PQAPEGTPAV TTEALLRAGQ
GASPPDLDAA GLGPEHPAYL IYTSGSTGVP KGIVISHANI CHFLRSGNAV YGLGADDVVF
QGASVAFDLS MEEIWVPYLV GACLFVASPA MMGDVESLPA IIAEARITVL DTVPTLLAMI
PGDLPSVRLV LLGGEALPEP LVARWATDRR RLFNTYGPTE ATVVATAAEM RPGRPVTIGG
PIPNYSVYVA DEALNLLAPG EQGELLIGGP GVAAGYLKRP ELTAEKFVAN PYPSDGTDPV
LYRSGDAVSM TPEGDIVFHG RIDDQVKIRG FRVELGEIEA RIRAQDGINQ AAVVLRRDDE
VDRLVAFLVP ERNAALDRAA LRKNLVAQMP PYMVPGHFEE VETLPRLTSG KVDRKALRIA
PLTVAVADGE QEAPDNETEA ALLAAAKKVF GDGPIGLAAD FFSELGGHSL LAARFVGAVR
ETPALAGITL QDVYNGRTLR VMAASLIERT GGAGAQTAIR DLSFAPPPLL RRALCGLAQA
ITLPFVIALA TAQWLGIFVT YLLLTGGGLG FWGELGVLLL VYIGINAVTA TIAIAAKWLI
LGRTKPGRYP LWGVYYYRWW LAQRLTPLVH IKWLQGSPVI VTYLRLLGAR IGDDVLISDL
DVGAPDLLTI GRGASLGGRL VIANAEVVGN ELVIGSVEIG EDAAIGTSCV IGPGTVIGDH
AEIADLTTVP AGTEVGSAEA WDGSPGRRVG TVDFSALPEP ATASPARRAA FGAVYAFLLA
AVPAVGLLPI FPAFYLFDQI SDNLSELTDV DYHVYLPLLT WPTAMLMTAG TVLLIAAIRW
AVLPRVTHGT FSIWSGFYLR KWLVALASEV TLETLSSLFA TVYMRAWYRL MGARMGRGAE
ISTNLGSRHD LVAVGANNFI ADEVVVGEEE IRRGWMHLHP VETGARVFVG NDGVLPPGAH
IPDDVLIGIK SKPPANDKMG PGETWFGSPP IRLPVRQKVD LGSNAQTFEP SVWAKLRRGL
FEAFATSFSP MLYISLAICA IDWVFYPAIL AEDWGGLALA FVLVSLAIAL IQTSSVIALK
WLLMGRYRPG MRPMWSWWAM RTEAIAVAYW GLAGKVLLEH LMGTPFLPWM LRLFGVKVGK
GACLLMTDIT EFDCVEIGDF AAINRSAALQ THLYEDRIMK VGRVVVGRGV TVGAFSTVLY
DSHVGDYARL RPLTIVMKGE SIPAHSEWEG APAVPVVHAA GEAVARAA