Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3972 |
Symbol | |
ID | 5835677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4412671 |
End bp | 4416777 |
Gene Length | 4107 bp |
Protein Length | 1368 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641369763 |
Product | non-ribosomal peptide synthetase |
Protein accession | YP_001641414 |
Protein GI | 163853371 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain [TIGR02353] non-ribosomal peptide synthetase terminal domain of unknown function |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.564437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAATCGG CAGCAGAGCA TCGGATGAGC GGTCTCGCCG AGGTGGGACG GCACGACGAA GGACCGGGCG AGCGCCTCGG CGGCCGCGCC GTGCTGCGGG GGGCCGCGCG GCCCGATCTG ATCCGCCGGG AGACCCTGGC CGACCTGTTC CGGGCGAGCG CCGCCGCGCG CGCCGCCGCG CCCTGCCTGA TCGACGCCGC CGAGCCCGGC TCCGGCGGCC GCCGTCCGGT CCTGACCTAT GCGGAGGTCG ATGCCCGCTC CGACGCCATC GCCGCCGGGC TTTTCGCCCG CGGCATCGGT CCGGGGGACG TCGTGGGGCT GTGGATGGCC CGCGGCACCG AACTGCTGAT CGCACAGATC GGCATCACGA AATCCGGCGC CGCCTGGCTG CCCTTCGACG CCGAGGCGCC CGCCGATCGG GTCGCGGTCT GCCTGAACGA TGCCGAGGCC AAGGCGCTCC TCGTCTCGGA GGCCCTGCGG CCTCAGGCGC CGGAGGGCAC GCCCGCCGTC ACCACCGAAG CGCTTCTGCG GGCCGGGCAG GGGGCGTCAC CGCCCGATCT CGACGCCGCT GGCCTCGGTC CGGAGCACCC GGCCTACCTG ATCTACACCT CGGGCTCGAC CGGCGTGCCG AAGGGCATCG TCATCAGCCA CGCCAACATC TGCCACTTCC TGCGCTCGGG CAACGCGGTC TACGGCCTCG GCGCCGACGA CGTGGTGTTC CAGGGCGCCT CCGTGGCCTT CGATCTTTCG ATGGAGGAGA TCTGGGTTCC CTATCTCGTC GGCGCCTGCC TGTTCGTGGC GAGCCCGGCC ATGATGGGCG ACGTCGAATC CCTGCCCGCG ATCATCGCGG AGGCCCGGAT CACCGTGCTC GACACGGTCC CGACCCTGCT CGCCATGATC CCCGGCGACC TGCCGAGTGT CCGCCTCGTG CTGCTCGGCG GCGAGGCCCT GCCCGAGCCG CTGGTGGCGC GCTGGGCGAC TGATCGGCGG CGCCTGTTCA ACACCTACGG CCCGACCGAA GCCACCGTGG TGGCGACCGC CGCCGAGATG CGGCCGGGCC GGCCGGTCAC CATCGGCGGC CCGATCCCGA ATTACTCCGT CTACGTTGCC GACGAGGCGC TAAACCTCCT CGCACCGGGC GAGCAAGGCG AGCTGCTGAT CGGCGGCCCC GGCGTCGCGG CGGGCTACCT CAAGCGGCCC GAACTGACGG CGGAAAAGTT CGTCGCCAAC CCCTACCCGT CCGACGGGAC CGATCCGGTG CTCTACCGCT CGGGCGATGC CGTCTCGATG ACGCCGGAGG GCGACATCGT CTTCCACGGC CGCATCGACG ATCAGGTCAA GATCCGCGGG TTCCGCGTGG AGCTCGGCGA GATCGAGGCA CGCATCCGGG CGCAGGACGG GATCAATCAG GCGGCGGTAG TCTTGCGCCG TGACGACGAG GTTGACCGCC TCGTCGCCTT CCTCGTGCCT GAGCGGAATG CCGCCCTCGA CCGCGCGGCG CTGCGAAAGA ATCTCGTGGC GCAGATGCCG CCCTACATGG TGCCCGGCCA TTTCGAGGAG GTCGAGACCC TGCCGCGGCT TACCTCCGGC AAGGTCGATC GCAAGGCGCT GCGGATCGCG CCGCTGACCG TGGCGGTGGC CGACGGCGAG CAGGAGGCCC CCGACAACGA GACCGAAGCC GCTTTGCTGG CGGCGGCGAA AAAGGTGTTC GGCGACGGTC CGATCGGGCT CGCGGCCGAC TTCTTCTCCG AACTCGGCGG CCACTCGTTG CTCGCCGCCC GCTTCGTCGG CGCGGTGCGC GAGACCCCGG CGCTCGCCGG CATCACCCTG CAGGACGTCT ATAACGGCCG CACCCTGCGG GTGATGGCCG CGAGTCTGAT CGAGCGCACG GGCGGGGCGG GGGCGCAGAC CGCGATCCGC GACCTGAGCT TTGCTCCGCC GCCGCTGCTG CGCCGGGCGC TGTGCGGGCT GGCGCAGGCG ATCACGCTGC CCTTCGTCAT CGCGCTGGCC ACCGCGCAAT GGCTCGGCAT CTTCGTCACC TACCTGCTGC TCACCGGCGG CGGGCTCGGC TTCTGGGGCG AACTCGGCGT TCTCCTGCTG GTCTATATCG GCATCAACGC GGTCACTGCG ACGATCGCGA TCGCCGCCAA ATGGCTGATC CTCGGGCGGA CGAAGCCCGG CCGCTACCCG CTCTGGGGCG TCTATTATTA CCGCTGGTGG CTGGCGCAGC GCCTGACGCC GCTGGTGCAC ATCAAGTGGC TCCAGGGCTC GCCCGTCATC GTCACCTATC TGCGCCTGCT CGGCGCGCGG ATCGGCGACG ACGTGCTGAT CTCCGACCTC GATGTCGGCG CGCCCGACCT GCTCACCATC GGCCGCGGCG CGTCGCTCGG CGGGCGGCTG GTGATCGCCA ACGCGGAGGT CGTCGGCAAC GAACTCGTCA TCGGCTCCGT CGAGATCGGC GAGGATGCCG CCATCGGCAC CTCCTGCGTG ATCGGCCCCG GCACGGTGAT CGGCGACCAT GCCGAGATCG CCGACCTCAC CACCGTGCCG GCCGGCACCG AGGTCGGCTC GGCCGAGGCC TGGGACGGCT CGCCAGGCCG CCGGGTCGGC ACCGTCGATT TTTCCGCGCT GCCCGAACCG GCCACCGCCT CGCCCGCCCG CCGGGCGGCC TTCGGCGCGG TCTACGCCTT CCTGCTCGCG GCGGTGCCGG CGGTCGGCCT GCTGCCGATC TTCCCGGCCT TCTACCTCTT CGACCAGATC TCCGATAACC TCTCGGAACT CACCGACGTC GATTACCACG TCTACCTGCC GCTGCTGACC TGGCCCACCG CCATGCTGAT GACGGCGGGC ACGGTGCTGC TCATCGCCGC CATCCGCTGG GCGGTGCTGC CGCGGGTCAC GCACGGCACC TTCTCGATCT GGTCGGGCTT CTACCTGCGC AAATGGCTGG TGGCGCTCGC CTCCGAAGTG ACGCTGGAGA CGCTCTCCTC GCTGTTCGCC ACCGTCTACA TGCGGGCGTG GTACCGGCTG ATGGGCGCGC GGATGGGAAG GGGCGCCGAG ATCTCGACCA ATCTCGGCTC GCGCCACGAC CTCGTCGCGG TCGGCGCCAA CAACTTCATC GCCGACGAGG TGGTGGTCGG CGAGGAGGAG ATCCGCCGCG GCTGGATGCA TCTCCATCCG GTCGAGACCG GCGCTCGCGT CTTCGTCGGC AATGACGGCG TGCTGCCGCC GGGCGCGCAC ATCCCCGACG ACGTGCTGAT CGGCATCAAG TCGAAGCCGC CGGCCAACGA CAAGATGGGG CCAGGCGAGA CGTGGTTCGG CTCGCCGCCG ATCCGCCTGC CGGTGCGCCA GAAGGTCGAT CTCGGCTCCA ACGCCCAGAC CTTCGAGCCG AGCGTATGGG CCAAGCTGCG GCGCGGCCTG TTCGAGGCCT TCGCCACCTC GTTCTCGCCG ATGCTCTACA TTTCGCTCGC CATCTGCGCG ATCGACTGGG TGTTCTACCC GGCCATCCTC GCCGAGGATT GGGGTGGGCT GGCGCTCGCC TTCGTGCTCG TGAGCCTCGC CATCGCGCTG ATCCAGACGA GCAGCGTCAT CGCCCTCAAA TGGCTCTTGA TGGGACGCTA CCGCCCCGGC ATGCGGCCGA TGTGGTCGTG GTGGGCGATG CGGACCGAGG CCATCGCGGT GGCCTATTGG GGGCTCGCCG GCAAGGTGCT GCTGGAGCAC CTGATGGGCA CGCCGTTCCT GCCCTGGATG CTGCGGCTGT TCGGGGTGAA GGTGGGCAAG GGCGCCTGCC TCCTTATGAC CGACATCACC GAGTTCGACT GCGTCGAGAT CGGCGACTTC GCCGCGATCA ACCGCTCGGC GGCCCTCCAG ACCCACCTCT ACGAGGACCG CATCATGAAG GTCGGCCGCG TCGTGGTCGG GCGCGGCGTC ACGGTGGGCG CCTTCTCCAC CGTCCTCTAC GACAGCCATG TCGGCGACTA CGCGCGGCTG CGCCCGCTCA CCATCGTGAT GAAGGGCGAG TCGATCCCGG CACATTCGGA ATGGGAGGGC GCGCCTGCCG TGCCCGTGGT GCACGCGGCG GGCGAGGCTG TGGCGCGGGC GGCCTGA
|
Protein sequence | MESAAEHRMS GLAEVGRHDE GPGERLGGRA VLRGAARPDL IRRETLADLF RASAAARAAA PCLIDAAEPG SGGRRPVLTY AEVDARSDAI AAGLFARGIG PGDVVGLWMA RGTELLIAQI GITKSGAAWL PFDAEAPADR VAVCLNDAEA KALLVSEALR PQAPEGTPAV TTEALLRAGQ GASPPDLDAA GLGPEHPAYL IYTSGSTGVP KGIVISHANI CHFLRSGNAV YGLGADDVVF QGASVAFDLS MEEIWVPYLV GACLFVASPA MMGDVESLPA IIAEARITVL DTVPTLLAMI PGDLPSVRLV LLGGEALPEP LVARWATDRR RLFNTYGPTE ATVVATAAEM RPGRPVTIGG PIPNYSVYVA DEALNLLAPG EQGELLIGGP GVAAGYLKRP ELTAEKFVAN PYPSDGTDPV LYRSGDAVSM TPEGDIVFHG RIDDQVKIRG FRVELGEIEA RIRAQDGINQ AAVVLRRDDE VDRLVAFLVP ERNAALDRAA LRKNLVAQMP PYMVPGHFEE VETLPRLTSG KVDRKALRIA PLTVAVADGE QEAPDNETEA ALLAAAKKVF GDGPIGLAAD FFSELGGHSL LAARFVGAVR ETPALAGITL QDVYNGRTLR VMAASLIERT GGAGAQTAIR DLSFAPPPLL RRALCGLAQA ITLPFVIALA TAQWLGIFVT YLLLTGGGLG FWGELGVLLL VYIGINAVTA TIAIAAKWLI LGRTKPGRYP LWGVYYYRWW LAQRLTPLVH IKWLQGSPVI VTYLRLLGAR IGDDVLISDL DVGAPDLLTI GRGASLGGRL VIANAEVVGN ELVIGSVEIG EDAAIGTSCV IGPGTVIGDH AEIADLTTVP AGTEVGSAEA WDGSPGRRVG TVDFSALPEP ATASPARRAA FGAVYAFLLA AVPAVGLLPI FPAFYLFDQI SDNLSELTDV DYHVYLPLLT WPTAMLMTAG TVLLIAAIRW AVLPRVTHGT FSIWSGFYLR KWLVALASEV TLETLSSLFA TVYMRAWYRL MGARMGRGAE ISTNLGSRHD LVAVGANNFI ADEVVVGEEE IRRGWMHLHP VETGARVFVG NDGVLPPGAH IPDDVLIGIK SKPPANDKMG PGETWFGSPP IRLPVRQKVD LGSNAQTFEP SVWAKLRRGL FEAFATSFSP MLYISLAICA IDWVFYPAIL AEDWGGLALA FVLVSLAIAL IQTSSVIALK WLLMGRYRPG MRPMWSWWAM RTEAIAVAYW GLAGKVLLEH LMGTPFLPWM LRLFGVKVGK GACLLMTDIT EFDCVEIGDF AAINRSAALQ THLYEDRIMK VGRVVVGRGV TVGAFSTVLY DSHVGDYARL RPLTIVMKGE SIPAHSEWEG APAVPVVHAA GEAVARAA
|
| |