Gene Msil_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1246 
Symbol 
ID7092319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1329472 
End bp1332288 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content63% 
IMG OID643464587 
Productcondensation domain protein 
Protein accessionYP_002361577 
Protein GI217977430 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGATA ACGGCGTGCA GCGTCCGGAA GCGGAAACGA TGAGCCCTGT CGAAGGGCTG 
GTCTTTCCCT GCTCGTCCGC CCAGAAGCGC TTCTGGTTCC TCAATGCGGT CAATCCCGGC
AATCCGGCGC TCAATGTGGC GTTGCGTTGG GAGGTCACCG GTCGACTGAC GCCCGCAACG
GCGGAGCAGG CCTTCCAGGC AATTGTCGAT CGACATGAGA TTCTGCGCAC CCGCGTCTTC
GAGCAGGAAG ACGGCGAGCC GATGCAGGAG GCGCTTGCCC ATCTCGCCTT TCGGCTGAGC
GTGATCGACC TCAGCATGCT TAGCGAGAGC GAGCAGACCA GGGAGGCGCT GGCGCTCGGC
GAACGCGAGG CGCGCAAATC CTTCGACCTT TCGGTCGCGC CGCCGATCCG CGTGACCTTC
CTGCGTCTGT CGCTGGAGCG CGCCTTTCTG CTCGTCACGG TGCATCACAT CGCTTTCGAC
GGCTGGTCGA TTCGTGTGCT CGCCGAGGAG TTTGGAACGA CCGCTGCGGC CATCGCCGCC
AAACGCGCGG CCGATCTGCC GCCGCTGCCC CTTCAATATG GCGACTACTG CCTGTGGCAG
AAGGAATATC TGGCGAGCGG CGAATTCGAG GAAGAGGCCG GCTACTGGAA GAAGCAGCTT
GGCGGCGCTC CCTATTTCGA GCTTGTCCCC GATCACGAGA AACCGCCCGC GCTGACCTAC
AATGGCGAAA TCCTCGCCGC CACCCTTCCC GCAGAGCTTG GGGATAGGAT GGAGGCTTTT
GCTCGCAGCC AAAATCTCAC CCTGTTCAGC CTCGGCTGCG CGGTGATGGG GGCGATGCTG
CATCGCTTTT CGGGCGAATC CGACATCGTC TTCGGGACCC AGATCGCCGA TCGCAACGAT
CCGGACCTCG AGCCGATGAT CGGCATGTTC GTGAATAATC TCGTGATGCG CCTCGATGCG
TCGGGCGATC CGAGCTTCGA GGCGTTCCTC AAACGCGTCA ACGGGACGGT GCAGGACGCG
CTGATCAACC AGCGCATGCC GTTCGACAAA CTGGTCGAAC TCCTCAATCC GCCGCGCGAT
CCGAGCCGCG CACCGCTGAT CTCGCTCAAT TTCACCGTTC TGCGCGATGT GATGGATCAC
AAGCGCTACG GCGATTTCGA ATTGCTCGGA TTGCCCTCGC TGTCCGCCGG CTCGCTCTAT
GACATCTTCT TTTTCCTGGT CCATTGGCCG AGCGGCTGGC GCGTGGCCAT GGAATATAAT
CCCGACCTAT TCCAGCGCGA GACGGCCGAG AAGCTGCTCG ACTTCCTGCT CGCGACATTC
GACTTCGCGG TGGGGCGTCC GCAACAGAAG CTCTCCGATC TGACGCCGCC GTCGCGCGAT
CTGATCGACG CGCGCCGGCG CGGCGATGAA CTGGGCGCCA TTGAGGCCGC GCTGCTTCAG
CATCCGGACG TGAAGGAGGC GGCGGTCGCG CTAAAGCCCG ACAGCGCCGG CCGCGAACAG
CCCTGCGCCT ATATCGCGCC GCGCCCGGAG TCGCGCGCGC CGCTCGAATC ATTGCCCGGA
ATCCTTACCG CCCATCTCGA CGCCATCCTG CCTTCGGGAG CGGCGAGGCC CGCGGGGATC
AGCGTCCTTT TGGCCTTGCC GCGCACCGCT CGGGGAGACG TCGATCTGCG CGGCCTGCCG
GCGCCCGCGC CCTCGCTTGC GCCGCCGGTC CCGGCTGCAA CGGAAGCCGC GCTCACCCCC
GTGGAAATGA GGCTCGGCAA GATCTGGAGC GATCTTCTCA AGGTTGACGC CATCAAGCCG
GCATCGAATT TCTTCGAGCT TGGCGGCCAT TCCCTGCTCA CGGTCAGGCT GATGTCGCGC
GTTTTTGCGG AATTCGGCGT AAAGCTCGAT CCGCTGACCT TGTTCCTCGC GCCGACGTTG
CGGGAATTCG CTTCGAGGCT GCCAGAGCTG CGCAGCGCGG AGCCGACCCA GCGCCTGGTG
CCGATCCAGC CTCTTGGCCA CAAGACCACG ATCATCGCCA TCAACCATTC GGTGCTTTAC
TATAATCTGG CGCGCCAGAT CGGCACGGAC CGGCCGTTTA TCGGCGTTCC GCTCATCGAG
CCGGGGAGCG ACGAGCCTCC GGAGCGCACG CTCGAAGAAA TCGCGGTCGA CTATGTCCGC
GTCATTCGAG AGGCGCAGCC GCATGGACCC TACATCCTCT GCGGCCTCTG CGTCGCGGGC
GCGATCGCCT ATGAATCCGC GCAGCAGTTG ATTGCGGCGG GCGAAGAAGT TCCGTTGCTG
ATCCTCTCGG ATATTTGGGC GCCCGGCTAC CATGCCTCAC TGCCCCTGTA CAGAAAGCTG
ATCTATCAAT TCAACTATCG GCTTCATACA TTACGCCACC GGATCGACAC TGTCCGCACG
GGCGCGGCGT CGCTGGCCGA GATGCTGTCA AGCTTCACCA TCGTGCGGAA ATCGCGTCTG
CTCGAGGCGC TGGCCTATCT CGGACTTGTC GACGGCGACA AATTGTTGCG CCGGGTGATC
GACCATGAGG AATGGCGCTT TTTGTTTTCG CTCGAAGTCG CGCGCAACGC CTATAAGATT
CAGCCGATTT CCTCGGACGT CATTGCTTTC AACAGCGATG AAATCGTGAC GCGCTTTGCC
GATCCCAACA TGGGCTGGTC CGGCCTTGTC AAGAAGCTGG CCATTCACTC GATCCCCGGA
TGGCATCAGG ACATCTTCCG CGAGGAGGGC GCCCGTGAGA TTGCCGGCTA TCTGCGTCCG
GAACTCGAAA AGATCGACAG CGAGTACAGG GCTCCGGCCC GCTCTTCCAA GGACTGA
 
Protein sequence
MPDNGVQRPE AETMSPVEGL VFPCSSAQKR FWFLNAVNPG NPALNVALRW EVTGRLTPAT 
AEQAFQAIVD RHEILRTRVF EQEDGEPMQE ALAHLAFRLS VIDLSMLSES EQTREALALG
EREARKSFDL SVAPPIRVTF LRLSLERAFL LVTVHHIAFD GWSIRVLAEE FGTTAAAIAA
KRAADLPPLP LQYGDYCLWQ KEYLASGEFE EEAGYWKKQL GGAPYFELVP DHEKPPALTY
NGEILAATLP AELGDRMEAF ARSQNLTLFS LGCAVMGAML HRFSGESDIV FGTQIADRND
PDLEPMIGMF VNNLVMRLDA SGDPSFEAFL KRVNGTVQDA LINQRMPFDK LVELLNPPRD
PSRAPLISLN FTVLRDVMDH KRYGDFELLG LPSLSAGSLY DIFFFLVHWP SGWRVAMEYN
PDLFQRETAE KLLDFLLATF DFAVGRPQQK LSDLTPPSRD LIDARRRGDE LGAIEAALLQ
HPDVKEAAVA LKPDSAGREQ PCAYIAPRPE SRAPLESLPG ILTAHLDAIL PSGAARPAGI
SVLLALPRTA RGDVDLRGLP APAPSLAPPV PAATEAALTP VEMRLGKIWS DLLKVDAIKP
ASNFFELGGH SLLTVRLMSR VFAEFGVKLD PLTLFLAPTL REFASRLPEL RSAEPTQRLV
PIQPLGHKTT IIAINHSVLY YNLARQIGTD RPFIGVPLIE PGSDEPPERT LEEIAVDYVR
VIREAQPHGP YILCGLCVAG AIAYESAQQL IAAGEEVPLL ILSDIWAPGY HASLPLYRKL
IYQFNYRLHT LRHRIDTVRT GAASLAEMLS SFTIVRKSRL LEALAYLGLV DGDKLLRRVI
DHEEWRFLFS LEVARNAYKI QPISSDVIAF NSDEIVTRFA DPNMGWSGLV KKLAIHSIPG
WHQDIFREEG AREIAGYLRP ELEKIDSEYR APARSSKD