Gene Msil_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1047 
Symbol 
ID7091875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1131787 
End bp1134852 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content64% 
IMG OID643464386 
Productacriflavin resistance protein 
Protein accessionYP_002361378 
Protein GI217977231 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0882711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGCC CCAATCTGTC CGAATGGGCG CTGAAAAGCC GCTCGTTCAT CATCTTTCTG 
ATGGTCGGCT TCACGGCGGC CGGGCTGCTC TCCTTCTATC GGCTGGGACG CGGCGAGGAT
CCGCCCTTCA CCTTCCGAAC CATGATCGTG CAGGCCTCCT GGCCCGGCGC GACGCTCGAC
GATACGGTCA AGCAGGTCAC CGAGCGGATC GAACGCAAGC TGCAGGAGAC GCGGGGCCTC
GACTTTCTGC GCAGCTATAC GACGCCGGGC CTCACCACCA TCTTCGTCAC GCTGAAAGGC
AGCACGACGG CGAAGGAGGT GCCGGACATC TGGTATCAGG TGCGCAAGAA CATCGGCGAC
ATCCGCCACA CGCTGCCGCA GGGTGTGGTC GGACCCGGCT TCAACGATGA TTTCGGCGAC
ACGTTCGGCA TCATTTACGG CTTCACCGCC GATGGCTTCA GCCATCGCGA ATTGCGCGAC
TATGTCGAGG ACGTCCGCTC CAAGCTGCTC CACGTCAACG ACGTCTCGAA GATCGAGATC
ATCGGGGCGC AGGACGAGCA GATTTTTGTT GAATTCTCAA CTCAGAAACT GGCCGGGCTT
GGCATTGACC GCGCCGCGCT GATCGCGGCG CTGCAAGCGC AAAATGCGGT GAGTCCGGCG
GGATCGATCC AGACGGGCGA CGAGAAGCTC TCGCTGCGCG TCTCCGGAGC GTTCCGGTCC
GAGGACGATA TTCTCAACGT CAATTTCCTG TCGAACGGGC GGCTGATCCG GCTGCGCGAT
ATCGCCGAGG TTCGGCGCGG CTATTCCGAT CCGCCGCAGC CGTTGTTCCG GGTCGACGGC
AAGCCGGCGA TCGGGCTCGC CATCGCCATG CGCGACGGCG GCGACATCCT CGCGCTTGGC
GCCAACATCC GCAGCGAACT TGACAAAGCG GTGGCCGAAC TGCCGCTCGG CGTCGAGCCG
GCGCTGGTGT CCGATCAGCC GCAGGTGGTG ACGACGGCGA TCGGCGAATT CATGGAATCG
CTCTGGCAGG CCGTCGCCAT CATCATGGCG ATCAGCGTGG TCAGCCTTGG CCTGCGGCCC
GGCGCGATCG TCGCCCTGAC GATCCCCCTG ACGATCGCTA TCGTCTTCCC GATCATGGAG
TTTCTGGACA TCGATTTGCA GCGCATCTCG CTTGGCGCGC TGATCATTTC GCTGAGCCTT
CTCGTCGACG ACGCCATGAC GACGATCGAC GCCATGACGA CGCGGCTCGC GCTCGGCGAC
GACAAGGAGA AGGCGGCAAG CTTCGCCTAT AAGACGCTGG CCTTCCCGAT GCTGACTGGC
ACTTTCGTCA CCATCGCCGG CTTCGTGCCG ATCGGCTTCG CGCGCAGCGC CGCCGGCGAA
TATACGTTCT CGATTTTCGC CGTGGTGGCG ATCGCCCTGA TCGTCTCCTG GTTCGTCGCC
GTGCTGTTTG CGCCGCTGCT TGGCGTCTGG CTCTTGAAGG CGCCCCCGGC GGGGGCCGCG
CCGGAGAAGC CCAACATCGT GCTGCGGCTG TTCCGCTCGG TGCTTGTCGG CATGATGCGG
ATGCGCTGGA TCTCGATCGC CGCGGCGCTC GGCTGTCTCG TCGCGGCCTT GATCGTGCTT
CCGCATGTGC CGCGGCAGTT CTTTCCGGCG TCCGACCGGC CGGAGCTCGT CGTCGATCTA
ACCTTGCCGC AGAACGCCTC GATCTACGCA AGCGAAGCAG CCTCGGCGAA GCTCGACGCT
ATGCTGAAAG AGGATCCGGA CGTTGCGAGC TGGAGCACCT ATGTCGGCCG CGGCGCGATC
CGCTTCTATC TTCCGCTGAA TGTGCAACTG GCCAATGATT TCTTCTCGCA GGCGGTGGTC
GTCGCCAAGG ACGTCGCCGC GCGCGAACGC CTGCAGGCGA AGCTTGAAAA AGAGCTGGCC
GAACAGTTGC CGACCGTCGT CGCCCGCGTG TCGCCGCTCG AGCTCGGGCC GCCGGTCGGC
TGGCCGGTGC AATATCGGGT CAGCGGGCCC GACACGGGCA AGGTGCGCGA GATCGCGCTG
AAACTCGCCG AAGTCATGGG CGGCAATGCG ACAGTGCGCG ACGTCAATTT CGACTGGATG
GAGCCCGCGC GGAAGGTTCG CATCAAGATC GATCAGGATC AGGCGCGTCT TCTCGGCCTC
AGTTCGCAGG CGCTGGCGAC GTCGCTCAAC GCGGTGATGA CGGGCCTGCC GATCACGCAG
GTGCGCGACG ACATCTATCT CGTCAATGTC GTTGCAAGAG CGACCGACGA ACAGCGGATT
TCGCTCTCCA CGCTGCGCTC CATGCAACTG CCGGTCGCGG GCGGGCGCAC CGTGCCGCTG
AGCCAGGTCG CGAGCTTCGA TTTCGAACAG GAACTGCCGC TTCTGTGGCG GCGCGACCGC
ACGCCGACCC TGACGGTGCA AGCCGAAACC GCCAAAGGCG TCTTGCCCGA GACGGCCGTA
CATGCGCTGG CGCCGGCGAT CGACAAGCTT CGCGCAAGCC TGCCCGATCA ATATCAAATC
ACGACCGGCG GCACGGTCGA GGAAAGCGTT GCCTCGCAGG CCTCGGTCTT CGCTATGCTG
CCTGTGGCGG CGATCCTGAT GCTCTTTTTC CTGATGATGC AGCTGCAGAG TTTTGGGCGG
ATGTTCCTTG TCGTTGCGAT CGTGCCTTTC GGCCTGATCG GCATTGTCTT CGCCCTGTTT
GCGGCGAATC GGCCGCTCGG CTTCGTCGCA ATCCTCGGCA TTCTGGCGCT GGTCGGCATG
ATCGCCCGCA ATGCGGTCAT TCTGATCGAT CAGATCGAGA CGGAACGGGC GCAGGGCCGC
GACATCTGGA ACGCCGTCAT CGAGGCGGCG CTATCGCGCT TCCGGCCGAT CATGCTGACG
GCGATCTCGA CCGTGCTCGG CATGATCCCG ATCGCGCCGA CGGTGTTTTG GGGGCCGATG
GCCTTCGCCA TCATGGGCGG GTTGTTCGTC GCGACGATGC TGACGCTGCT TGTGTTGCCG
GTGTTCTACA TCACGCTGTA TGGCGCGAAG GAGACGGCCG CAGCGAAGGA GCCGGTCGCT
GCCTAA
 
Protein sequence
MTGPNLSEWA LKSRSFIIFL MVGFTAAGLL SFYRLGRGED PPFTFRTMIV QASWPGATLD 
DTVKQVTERI ERKLQETRGL DFLRSYTTPG LTTIFVTLKG STTAKEVPDI WYQVRKNIGD
IRHTLPQGVV GPGFNDDFGD TFGIIYGFTA DGFSHRELRD YVEDVRSKLL HVNDVSKIEI
IGAQDEQIFV EFSTQKLAGL GIDRAALIAA LQAQNAVSPA GSIQTGDEKL SLRVSGAFRS
EDDILNVNFL SNGRLIRLRD IAEVRRGYSD PPQPLFRVDG KPAIGLAIAM RDGGDILALG
ANIRSELDKA VAELPLGVEP ALVSDQPQVV TTAIGEFMES LWQAVAIIMA ISVVSLGLRP
GAIVALTIPL TIAIVFPIME FLDIDLQRIS LGALIISLSL LVDDAMTTID AMTTRLALGD
DKEKAASFAY KTLAFPMLTG TFVTIAGFVP IGFARSAAGE YTFSIFAVVA IALIVSWFVA
VLFAPLLGVW LLKAPPAGAA PEKPNIVLRL FRSVLVGMMR MRWISIAAAL GCLVAALIVL
PHVPRQFFPA SDRPELVVDL TLPQNASIYA SEAASAKLDA MLKEDPDVAS WSTYVGRGAI
RFYLPLNVQL ANDFFSQAVV VAKDVAARER LQAKLEKELA EQLPTVVARV SPLELGPPVG
WPVQYRVSGP DTGKVREIAL KLAEVMGGNA TVRDVNFDWM EPARKVRIKI DQDQARLLGL
SSQALATSLN AVMTGLPITQ VRDDIYLVNV VARATDEQRI SLSTLRSMQL PVAGGRTVPL
SQVASFDFEQ ELPLLWRRDR TPTLTVQAET AKGVLPETAV HALAPAIDKL RASLPDQYQI
TTGGTVEESV ASQASVFAML PVAAILMLFF LMMQLQSFGR MFLVVAIVPF GLIGIVFALF
AANRPLGFVA ILGILALVGM IARNAVILID QIETERAQGR DIWNAVIEAA LSRFRPIMLT
AISTVLGMIP IAPTVFWGPM AFAIMGGLFV ATMLTLLVLP VFYITLYGAK ETAAAKEPVA
A