Gene Msil_1597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1597 
Symbol 
ID7090954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1723021 
End bp1724160 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content64% 
IMG OID643464923 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_002361908 
Protein GI217977761 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.111041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTACTG AGACCGGAGC GTTTTCCGCG CCTCTCCCGG CGGATTGTTC TCCCGCGCTT 
CCGCTCAATG AAATCCTGAT CGGCGACTGC CTCGAGCAGC TGGCGCGCCT GCCTGCGGCC
TCGGTCGACG CCGTCTTCGC CGATCCGCCC TATAATCTGC AGCTCGAATC GACGCTGTCG
CGGCCGGATC AGAGCCTCGT CGACGCCGTC AACGACGATT GGGACAAGTT CGACAGCTTC
TCCCATTATG ATTCCTTCAG CAGGTCATGG CTCAAGGCCG TGCGCCGCGT CATGAAGCCC
GAAGCGACGC TGTTCGTGAT CGGCTCCTAT CACAATATTT TCCGGGTCGG CTCGACGCTG
CAGGACGAAG GCTTTTGGAT CTTGAACGAC ATCGTCTGGC GCAAGGCCAA TCCGATGCCG
AACTTTCGCG GACGCCGCTT CACCAACGCC CATGAAACCC TGATCTGGGC CGCGAAGGAT
TCCGCCGCCA AAAACTACCG CTTCAATTAT GAGCTTCTGA AAGCGGGCAA TGAGGATTGC
CAGCTCCGCT CGGACTGGCT TTTCCCGATC TGCACCGGCG CCGAACGGCT GAAAGGTTCG
GACGGGCGCA AGACGCATCC GACGCAAAAG CCGGAAGCTT TGCTGGCCCG TATCCTGATC
GCCGCGACGA ACCCCGGCGA TGTCGTGCTC GACCCGTTCT TCGGCTCGGG CACCACGGGC
GCCGCCGCGA AACGGCTCGG GCGGCATTTC GTCGGCATCG AGCGTGATAA AACCTACGCC
GCCGCCGCGC GGGCGCGCAT CGACGCCGTC GAGACTCTGC CCGAAGCAGC AATCGCGCTG
ACGCCGAGCA AGCGCACCGA GCCGCGCGTC GCCTTTTCGG CAATCGTTGA GGCCGGGCTG
ATCGCGCCCG GCGACAGTCT CGTCGACGAC AAGCAGCGTC ATCGCGCGAC CGTGAGGGCC
GACGGCGCCA TCACGCTTGG GCCGGTCGTC GGCTCGATCC ACAAAATCGG CGCGCTGGCG
CAGGGCCTGC CGGCTTGCAA CGGCTGGACC TATTGGCACT TCGCACAAGG CGGCAAATTG
CAGCCGATCG ACGCGCTGCG CACGGTGGCG CGCGGAAAAC TGCGCGAGGC CGAAGCCTGA
 
Protein sequence
MRTETGAFSA PLPADCSPAL PLNEILIGDC LEQLARLPAA SVDAVFADPP YNLQLESTLS 
RPDQSLVDAV NDDWDKFDSF SHYDSFSRSW LKAVRRVMKP EATLFVIGSY HNIFRVGSTL
QDEGFWILND IVWRKANPMP NFRGRRFTNA HETLIWAAKD SAAKNYRFNY ELLKAGNEDC
QLRSDWLFPI CTGAERLKGS DGRKTHPTQK PEALLARILI AATNPGDVVL DPFFGSGTTG
AAAKRLGRHF VGIERDKTYA AAARARIDAV ETLPEAAIAL TPSKRTEPRV AFSAIVEAGL
IAPGDSLVDD KQRHRATVRA DGAITLGPVV GSIHKIGALA QGLPACNGWT YWHFAQGGKL
QPIDALRTVA RGKLREAEA