Gene Msil_1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1551 
Symbol 
ID7092057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1673298 
End bp1674617 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content55% 
IMG OID643464877 
Producttype I restriction enzyme 
Protein accessionYP_002361863 
Protein GI217977716 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTCTC TTCGTTTCAA AAACGTCATG CGGGAGCGTG TCGACCTTTC AGAGACCGGA 
GAAGAAACGC TTCTTTCTGT TTCTGAATAT TACGGCGTAA AGCCAAGAGC AGAAGCCTTT
CAGGGCGAGG AATACGAATC CCGGGCAGAG TCCTTGGAAG GCTATCGTCA AGTTCAACGC
GGCGATTTCG TCATGAACTA TATGCTTGCC TGGAAGGGTG CATACGGCAT TTCTGAGTAT
GACGGTATCG TCAGCCCAGC ATACGCGGTT TTCCAAATAG ATAAATCTAA GATCGATCTA
AAATATTTAC ACCATAGGAC TAGATCTAAC CCAATGCGAG CGCTTTTCCG CTCCCGATCC
AAGGGCATAA TTGACTCTCG ATTGCGTCTA TACCCAGATG CACTTCTTGC TACGGAGATT
GATCTTCCAG GCCTCGCCGC TCAGAAGGTG ATTGCTGATT TCCTCGACCG TGAGACCGCC
CGCATCGATC AATTGATCGA GAAGAAGGAA CGGTTCTCAG CGCTCGCAGC TGAACGCTGG
CGCGCCACAC TGGACGCTGA GATACTTGGA CGCACGACCG CCGGCAAACG GAGCCTAACA
AGCGGCCAAC CGTATATTTC CGACGTCCCC GCCGACTGGG TTCTTACTCC GCTCAAGCAT
CTTGTCGATC CGCGCAGGCC CGTCATGTAC GGCATCGTCT TGCCAGGCCC AAACGTCGAG
AATGGCATTA TGATCGTCAA AGGTGGCGAC GTGAAGCCGA ACCGCTTGTC ACCAGACCGC
CTCTGCAAAA CCAGTAGGGA AATCGAGGCC GGATACGTTA GGTCCCGCCT GCGCGGAGGT
GACCTTGTAA TGGCTATTCG TGGCGGTATT GGAGACGTTG AAATCGTACC GGCTGACATA
GAGGGAGCCA ACCTCACCCA GGACGCCGCG CGCATTGCTC CTCGTCATGG CGTCCTAAAC
CGCTGGCTGC GGTACGCGCT TCAGGCTCCA TCGGTCTTCG CTCCACTCGG AGCGGGGGCA
AATGGAGCTG CCGTCCGGGG CGTCAACATC TTTGACGTTG ATCGAGTCTT GGTTCCAGTT
CCGCCCACGG CAGAGCAGAT TGTCATAGCT GATCGCTTAG ACATCAAGGA ACAGCAGATC
TTACGCATGC GAGAGAAGAT TTTTGATCAT GCGAAACTGA TCCAAGAATT CCGCGCCGCC
CTCATTACCG CCGCCGTCGC CGGTCAGATC AACGTGGATA CATGGGGTAA ACGCCGTGAG
ACGGACCGCC GCCTCGATCG GATCGAAGAG AAGATGTCAG CCGGAGACGC GCTTGCATGA
 
Protein sequence
MKSLRFKNVM RERVDLSETG EETLLSVSEY YGVKPRAEAF QGEEYESRAE SLEGYRQVQR 
GDFVMNYMLA WKGAYGISEY DGIVSPAYAV FQIDKSKIDL KYLHHRTRSN PMRALFRSRS
KGIIDSRLRL YPDALLATEI DLPGLAAQKV IADFLDRETA RIDQLIEKKE RFSALAAERW
RATLDAEILG RTTAGKRSLT SGQPYISDVP ADWVLTPLKH LVDPRRPVMY GIVLPGPNVE
NGIMIVKGGD VKPNRLSPDR LCKTSREIEA GYVRSRLRGG DLVMAIRGGI GDVEIVPADI
EGANLTQDAA RIAPRHGVLN RWLRYALQAP SVFAPLGAGA NGAAVRGVNI FDVDRVLVPV
PPTAEQIVIA DRLDIKEQQI LRMREKIFDH AKLIQEFRAA LITAAVAGQI NVDTWGKRRE
TDRRLDRIEE KMSAGDALA