Gene Msil_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1946 
Symbol 
ID7094064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2119816 
End bp2121711 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content49% 
IMG OID643465273 
ProductSite-specific DNA-methyltransferase (adenine-specific) 
Protein accessionYP_002362251 
Protein GI217978104 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC CGGAACATAC TCAACCGGAG AAGATCGATT TGCGATCTAT GGATGTTGCG 
GAGGAGAAGC GGGGCGAGTT GAAGCGTAGC TTGGGACAGG CGTTTCCGGA CATCTTCGCC
GAAGGTTCGA TAGATTTTGA TCAGCTCAAA CGGGCTCTTG GCGAGTGGGT GGACCCCGGC
AAGGAACGGT TCGGTCTCAA CTGGCCGGGT AAGGCCGAAT GCATGAAGAT TATTCAACAG
CCAAGCCCCG CAACATTAAG GCCAGAGCGC GAAAAGTCAG CAAACTTCGA CGAGGCAGAG
AACGTCTTCG TAGAGGGGGA CAATCTTGAG GTCTTGAAAC TGCTTCAGAA AGCTTATTTC
GGTAAAGTAA AACTCATATA TATCGACCCG CCGTACAACA CGGGTAATGA GTTTATCTAT
CCTGATAATT TTACCGAAAC GCTGGAAACG TACCTGGCCT ATACAGGACA AGTTGACGAC
GAAAGAAAGC GCTTTTCAAC AAACACAGAT CAGTCTGGCC GGTACCATTC TCGCTGGATG
AATATGATGT TTCCGCGGCT TTATCTTGCC CGCAACTTGC TAAGAGATGA TGGCGCCATA
TTCATTTCGA TAGATGACAA TGAAGTTCAT AATCTCCGTG CCTTAATGGA TCAAATCTTT
GGCGAAGAGA ATTTCGTGGC TACGATCATT TGGCAGAAGG TTTACGCTCC AAAAAATAGC
GCGAAATTCT TCTCCGATGA TCACGACTAT ATCCTCGTTT ACGCACGCAA CTCGGATCAG
TGGAAACCCG AGCTGCTTGA AAGGACACCC GAGCAAGATG CTTTATACAA AAATCCGGAT
AAGGATCAAC GCGGCCCTTG GATGTCCGAC AACCTTACTG CGCGCAATTT TTATGGAGAG
GGCTCTTACG AAGTCACTGG ACCTTCCGGT AAGAAATTTA CTCCAGGAAA AGGGCGATAC
TGGCCCGTTT CTCAATCAAA GTTCAATGAT TTGAACGCAG ACGGGAGAAT ATGGTGGGGC
GTTTCTGGAG ACAGCATGCC GCGTTACAAA CGGTATTTGT CCGAAGTGTC TGCTGGACGC
GTTCCTCAAA CGCTCTGGAA GTATGAAGAG GTTGGTCATA CGCAAGACGC AAAGCGAGAG
TTGAACAAAT ATGTGCCTTA CGAGGAAACC GAGAATACGT TGAACTCTGT AAAGCCGGTT
AACCTCATCA GGCGCATGAT CAAAATAGCC ACGAAGAGCG ACGGCGACAT TGTGCTGGAT
TTTTTTGCTG GCAGCGGCAC CACAGGACAG GCGGTTATAG AGCAGTCTTT GGACGACGGC
ATCAGACGAC GGTTCATCAT GGTGCAGCTA CCGGAGGAAC TGCCGAAGCC AGAAGCGAAT
TTTAAGACGA TCTCCGATTT TGCCCGTGCG AGGGTAAAAA ATGTAATTGC AGCGAGTCAG
TCCGATTTAT TGAAAAAGGA CAGTCATGGT CGCGGATTTC GATCTTTTGC GCTTGATAGT
TCCAATTTCA GGTCGTGGGA TGGAGGCGTG TCGACTTCAA GCGATTTAGA ATCGCAGCTG
CAGCTGCATG TCGACCATCT TCGCGATGCA GGAGAACCGG AGGATGTACT TTACGAATTG
CTTCTGAAAT CGGGCTTCCC GCTGGCGACC AAGGTTGTAA AGATCGACCT AAATGGTGCT
GAGGTATTTT CGATTGAAGA GGGAGCGCTG CTCATTTGCC TCTCTAAGGA GATTACGCCG
GAGCTGATCG ACGCGCTCGC TGAAGCGAAC CCGCTTCAGG TTATATGTTT AGACGAGGGC
TTCAAGGGCA ATGACCAACT GAAAACGAAC GCGGTACAGA CCTTCAAGGC CCGCGCTCAG
GCCGAGGAAT CCGAAATCGT CTTTAAGACG GTTTGA
 
Protein sequence
MNKPEHTQPE KIDLRSMDVA EEKRGELKRS LGQAFPDIFA EGSIDFDQLK RALGEWVDPG 
KERFGLNWPG KAECMKIIQQ PSPATLRPER EKSANFDEAE NVFVEGDNLE VLKLLQKAYF
GKVKLIYIDP PYNTGNEFIY PDNFTETLET YLAYTGQVDD ERKRFSTNTD QSGRYHSRWM
NMMFPRLYLA RNLLRDDGAI FISIDDNEVH NLRALMDQIF GEENFVATII WQKVYAPKNS
AKFFSDDHDY ILVYARNSDQ WKPELLERTP EQDALYKNPD KDQRGPWMSD NLTARNFYGE
GSYEVTGPSG KKFTPGKGRY WPVSQSKFND LNADGRIWWG VSGDSMPRYK RYLSEVSAGR
VPQTLWKYEE VGHTQDAKRE LNKYVPYEET ENTLNSVKPV NLIRRMIKIA TKSDGDIVLD
FFAGSGTTGQ AVIEQSLDDG IRRRFIMVQL PEELPKPEAN FKTISDFARA RVKNVIAASQ
SDLLKKDSHG RGFRSFALDS SNFRSWDGGV STSSDLESQL QLHVDHLRDA GEPEDVLYEL
LLKSGFPLAT KVVKIDLNGA EVFSIEEGAL LICLSKEITP ELIDALAEAN PLQVICLDEG
FKGNDQLKTN AVQTFKARAQ AEESEIVFKT V