Gene Msil_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1002 
Symbol 
ID7093681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1084092 
End bp1088168 
Gene Length4077 bp 
Protein Length1358 aa 
Translation table11 
GC content64% 
IMG OID643464341 
Productnon-ribosomal peptide synthetase 
Protein accessionYP_002361333 
Protein GI217977186 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR02353] non-ribosomal peptide synthetase terminal domain of unknown function 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0203382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAGA ACGGCGCGGC GACTGCCGAT GCGCAAGAGG CTACGGGGAC AGCGCAGTGC 
GTATTGCGCG GCCTGCGCTT GCCGAACCTG CGGCGCGACG AGCTCCTGTG CGAGATTTTT
GCCGCCACAG TCGCCGCCGA TCCGGACGCG CTGGCGATGG TGACGCGCGA CGGCGCGCTG
ACCTACGCTG AAGTCGATGA GAGAGCAGAG GCGATCGCCC GCGGCCTTCT GCGCGCGGGC
CTGCGTCCGG GAGACATCGC CGGCCTGTGG ATGCCGCGCG GCCACGAGCT TTTGATCGGA
CAGATCGCCA TCGCCAAGAT CGGCGCCGCC TGGCTCCCCT TCGACGGCGA TGCGCCCGTT
GACCGGATCG CTGTCTGCCT CGATGACGCC GCCGCCAAGC TGATTGTGAC CACGGCGGAT
TTCGCCGCAA AACTCGCCGG CCGCGTCGGA TGCGCCATTC TGACGCCGCG CGAGCTTGCC
GATTATTCGA CAGATGAAAA AATCGATGCG CGCGCGCTCG GCGCAACGCC CGATTCCCCC
GCTTATCTAA TCTATACGTC GGGTTCGACC GGGACGCCGA AAGGCATCGT CATCACCGGC
GCCAATATCT GCCATTATTT GCGCGCGGCG AATGAGATCT ACCGGCTTGA CGCCACGGAT
GTGATGTTTC AGGGCGCTTC GGTCGCCTTC GATCTGTCGA TGGAAGAGAT CTGGCTCCCT
TATCTTGTTG GCGCGCGGCT GTTCGTTGCG ACGCCGGAGG TCATGGGCGA GGCCGACAAG
CTGCCCGAAA TCATGGAGGC GAATGGCGTC ACCGTCCTCG ACACGGTGCC GACGCTGCTC
GCGTTGCTGC CGCGCGACGT CGTGACGCTG CGGGTGATCA TTCTGGGCGG CGAGGCCTGT
CCGCCCGCAA TCGCGGGCCG CTGGTGCAAG CCGGGGCGGA AGATCTTCAA CTCCTACGGT
CCGACCGAGG CGACCGTCGT CGCCACCATC GCCGAGGTGC AGCCGGGCGC GGCCGTTACG
ATCGGCGGTC CGATCCCGAA CTATTCCTGT TACGTCGTCG ATGATGAACT TCATCTCGTA
GCGCCGGGCA GCGAGGGGGA GCTTCTAATC GGCGGCCCCG GCGTCGCGCG CGGCTATTTG
AAGCGTCCGG AGCTGACGGC CGAGAAATTT ATTCCCAATC CGTTTCCCGT CGCGGACTTC
GATGCGGCGA CCGGCGATCC GGTGCTTTAT CGCTCCGGAG ACGCCGTCGC GATCAACGAG
GCCGGCGAAA TCCTGTTTCG CGGCCGCATC GACGATCAGG TCAAGGTGCG CGGCTTCCGG
GTCGAGCTCG GCGAGATCGA GGCGAAGCTC GGCGATCTTG AAGGCGTCGC CCACGCCGCC
GTGGTTCTGC GCAATGATGC GGGCGTCGAT CAGCTTGTCG CCTTCCTCGT GCCGGCGCCC
GGCGCGGTCG AGGCCGGCGC GCTCGAGACT CGCGTATTGC GCGGCGCGCT GCGGGCGAGC
CTGCCGCCCT ATATGGTGCC GAGCCGTTTC GAATCGATCG CCACCCTGCC GAAACTGTCG
TCCGGCAAGG TCGACCGCAA GAGCCTGAAG CTCGTCCGGC TCGCCGAGGT CGATTCCAGC
GAGGCGCAGG AAGATCCACG CACATCGACG GAAGCCAGCC TGCTCGCCGC CGCGAAGGAG
GTGCTGCCGC CGCAGGCGAT TCCCTTCGAC GCCGATTTCT TCACGGACCT TGGCGGCCAT
TCGCTTCTCG CCGCGCGCTT CATCTCGATT GTCCGCAAGA CGCAGGCGCT CTCCCGCGTC
ACGCTTCAGG ACGTCTATTC GGCGCGGACG CTGCGCGGCA TTGGCGAACT CATCGATCGT
AAATGGGCGC ATCTCGCCGG GCCTGCCGAT CTTGGCTTCG ACCCGCCGCC GCTGCTGCGG
CGCTTTCTCT GCGGCCTGGC CCAGGCCGTC GCTTTGCCAA TCATCCTTGC GCTCGTCACG
GCGCAATGGC TTGGCGTCTT CGTCAGCTAC ATGCTGCTGA CCGGCGCTGA AGCGGCGATC
GGCGAGGAGA TCATCTCGCT GATCGCCGTC TATATGTGCA TCAACATCGT CACCGTCGCC
ATCACCATCG CGGCGAAATG GCTGATCATC GGGCGCACCA AGCCGGGCCG CTACCCGTTG
TGGGGCGTCT ATTATTTCCG CTGGTGGCTG GCGCGGCGGT TCATCGGGCT CGTCCACATC
AAATGGTTTT CCGGCTCTCC CTTTATGCGC TTTTATCTGC GCGCGCTCGG GGCGAAGGTC
GGCAAGGACG CCATCATCGG CGAGGTCGAC GCCGGCGCGA TCGACCTCAT CTCCTTCGGC
GACGGCGCGA GCGTCGGCTC GATCGCCAAT CTCGCCAATG CGCGGGTCGA GGGCGGCGAA
CTCATCATTG GTTCGATCGA GATCGGCGCA GACGCCTATA TCGGCTCGTC CTGCGTCATC
GAGGAAGATG TCGTGATCGG ACGCGGCGCC GAGATCGGCG ATCTGAGCGC GATCGGCGCC
GGCGGGCGCA TTGGCGATTA TGAGAGCTGG GACGGCTCGC CTGTGCGCCA GACCGGCAAG
ATCGACCCCT CCGAGCTCGG CGCCGCCTCG ATCGGCTCGA TCCCGCGCCG TTTCGCAATG
GGCGGCGTCT ATCTCGCGCT GCTGCTGGCC ATTCCGCCGC TCGGCCTGTT GCCCATCTTC
CCCGCCTTCT GGGTGTTCGA CCGGATCGAC GACATTATCG GCATCGGCGA CATCGACCGC
TTCCATTATA TGATGATGAT CCCGATCATG GCGTGGCCGA CCGCCTTCGT CATGGTTCTG
GTGACGGTGG GCTTCATCGC CGCGTGCCGC TGGATCATCC TGCCGCGCGT GCGCGAGGGG
ACCTATTCGG TCCATTCCTG GTTTTATCTG CGCAAATGGG CGGTGACGCT CGCGACCGAG
ATCACCCTCG AAACGCTGTC CTCGCTTTAC GCCACCGTCT ATATGCGCGG CTGGTATCGG
CTGATGGGCG CGAAGATCGG CAAGGACGCC GAAATCTCGA CCAATCTGTC CGGCCGCTAT
GACCTCGTCG AGATCGGCGA AAAGAACTTC ATCGCCGATG AAGTGGTGTT CGGCGACGAG
GAAGTGCGCA ACGGCTGGAT GGTGCTGCGG CGCGTCAAGA CAGGGCCGCG GGTCTTTGTG
GGCAATAGCG CCGTGATTCC GACCGGCGCC GACATTCCGG CCAATGCGCT GATCGGCATC
AAGTCGAAGC CGCCGGCGAA CCACCTCATG AATGAGGGCG ACACCTGGTT CGGATCGCCG
CCGATCAAAT TGCCGGTGCG GCAGAAATTC GACGCCGGCG GCGCCAACTG GACCTATGAA
GCGCCCAAGT GGAAGAAATT CGCCCGCGCC TGCTTCGAGG CGGTGATGAT CTCGATGCCG
ACGATGCTGT TCATCACGCT CGGCACCTGG GCGGTCGAAT GGTTCAGCGC GAGCGTGCTG
GACGGCGATT ATGGCGCGGT CGCGGTGCAG TTCACATTCT CGTCGGTCGC CATCTGCCTG
ATGCTGACCA TCGTCGTGAT CGTGCTGAAA TGGCTGACCA TGGGGCGCTA TGAGCCTATG
GTGAAGCCGA TGTGGTCCTG GTGGGCGATG CGCACCGAGG CCGTGGCCGT GATCTATTGG
GGCATGGCCG GCAAGGTGCT GCTCGATCAT TTGCGCGGCA CGCCGTTCCT GCCTTGGATG
ATGCGCCTGT TCGGGGCGAA ATTCGGCAAG GGCGTTTATA TGGACATGAC CGACATCACC
GAATTCGACT GTGTCAAGGT CGGCGATTTT GCCGCGCTGA ATGCGATCTC CGCGCTGCAG
ACGCATCTTT ACGAGGATCG CGTGATGAAG GTCGGCCGCG TCGCCATCGC CGACGGCGTC
ACCATCGGGG CCGGCTCGAC GGTTCTCTAT GATACTCTCG TCGGCGATTA TGCGCGGCTC
GGACCGCTGA CGCTGGTGAT GAAGGGCGAG CAGATTCCGC CCCATTCCGA ATGGGTCGGC
GCGCCGGCCG AGCCAAAAGG CGCGGCCGAC GTCATCGTAG AGAAGGAAGC GGCATAA
 
Protein sequence
MTQNGAATAD AQEATGTAQC VLRGLRLPNL RRDELLCEIF AATVAADPDA LAMVTRDGAL 
TYAEVDERAE AIARGLLRAG LRPGDIAGLW MPRGHELLIG QIAIAKIGAA WLPFDGDAPV
DRIAVCLDDA AAKLIVTTAD FAAKLAGRVG CAILTPRELA DYSTDEKIDA RALGATPDSP
AYLIYTSGST GTPKGIVITG ANICHYLRAA NEIYRLDATD VMFQGASVAF DLSMEEIWLP
YLVGARLFVA TPEVMGEADK LPEIMEANGV TVLDTVPTLL ALLPRDVVTL RVIILGGEAC
PPAIAGRWCK PGRKIFNSYG PTEATVVATI AEVQPGAAVT IGGPIPNYSC YVVDDELHLV
APGSEGELLI GGPGVARGYL KRPELTAEKF IPNPFPVADF DAATGDPVLY RSGDAVAINE
AGEILFRGRI DDQVKVRGFR VELGEIEAKL GDLEGVAHAA VVLRNDAGVD QLVAFLVPAP
GAVEAGALET RVLRGALRAS LPPYMVPSRF ESIATLPKLS SGKVDRKSLK LVRLAEVDSS
EAQEDPRTST EASLLAAAKE VLPPQAIPFD ADFFTDLGGH SLLAARFISI VRKTQALSRV
TLQDVYSART LRGIGELIDR KWAHLAGPAD LGFDPPPLLR RFLCGLAQAV ALPIILALVT
AQWLGVFVSY MLLTGAEAAI GEEIISLIAV YMCINIVTVA ITIAAKWLII GRTKPGRYPL
WGVYYFRWWL ARRFIGLVHI KWFSGSPFMR FYLRALGAKV GKDAIIGEVD AGAIDLISFG
DGASVGSIAN LANARVEGGE LIIGSIEIGA DAYIGSSCVI EEDVVIGRGA EIGDLSAIGA
GGRIGDYESW DGSPVRQTGK IDPSELGAAS IGSIPRRFAM GGVYLALLLA IPPLGLLPIF
PAFWVFDRID DIIGIGDIDR FHYMMMIPIM AWPTAFVMVL VTVGFIAACR WIILPRVREG
TYSVHSWFYL RKWAVTLATE ITLETLSSLY ATVYMRGWYR LMGAKIGKDA EISTNLSGRY
DLVEIGEKNF IADEVVFGDE EVRNGWMVLR RVKTGPRVFV GNSAVIPTGA DIPANALIGI
KSKPPANHLM NEGDTWFGSP PIKLPVRQKF DAGGANWTYE APKWKKFARA CFEAVMISMP
TMLFITLGTW AVEWFSASVL DGDYGAVAVQ FTFSSVAICL MLTIVVIVLK WLTMGRYEPM
VKPMWSWWAM RTEAVAVIYW GMAGKVLLDH LRGTPFLPWM MRLFGAKFGK GVYMDMTDIT
EFDCVKVGDF AALNAISALQ THLYEDRVMK VGRVAIADGV TIGAGSTVLY DTLVGDYARL
GPLTLVMKGE QIPPHSEWVG APAEPKGAAD VIVEKEAA