Gene Msil_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3571 
Symbol 
ID7092430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3931098 
End bp3932474 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content56% 
IMG OID643466863 
Producthypothetical protein 
Protein accessionYP_002363822 
Protein GI217979675 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATG GCCTTCGCCC TTACGCCGAC ACAAATCCGA CCGGTCTGCC ATGGCTCGGT 
GACGTGCCGG CGCATTGGAA TGTTCGGCGT ATCAAAACGC TGCTTAGGGA GGTCGATAGT
CGCAGCAAGA CAGGAGAAGA ACGGCTTCTG TCGCTGAGAA TGAGACAAGG GCTTGTCGAT
CACATCGATG CGGGCGGGAA GCTGATCCCG CCGGAGTCAC TGGTTAATTT CAAGATCGTC
GAGCCCGGAC AGGTGGTCAT GAACCGCATG AGGGCCGCTG CCGGTTTGTT CGGCGTTGCG
AATGTGCGCG GATTGGTGAG CCCGGACTAC GCCGTCTTCG AACCATTGCC CGAAGCGTTC
AATCCCTACC TCCTTCAAGC ATTCCGATTG CCGTCACTGT CAGCGGTGTT CCGAGCGGAA
TCGAAGGGCT TAGGCACCGG AGAATCCGGG TTCCTTCGCC TTTACACAGA TAGGTTCGGC
CCGATCCCTG TTCCCTATCC GCCGCTCGAT GAGCAGCGGC TGATTGTGCG GTTTCTGGAT
TGGCATGGGG CGCAGACGGC GAAGCTGATC CGCGCGAAGA AGAAGATCAT CGCGCTTCTG
AACGAGCAGA AGCAGGCGAT CATCCACCGC GCCGTCACCC GCGGCCTCGA TCCCAATGTC
CGCCTCAAGC CCTCAGGCAT CCCTTGGCTC GGCGACATCC CTGAAGATTG GGAGGTTTCG
CGCGTAAAGA CTGAGTTTCA GTGCCTCAAC TACCGACGAG TTCCGTTAAG TGGCACAGAG
CGGGGACGAA TGACTGTTCG CCAATACGAT TACTACGGGG CATCAGGAGT AATCGATAAG
GTCGACGAAT TTTTGTTCGA CGACAAACTT CTGCTGATAG CGGAAGATGG CGCAAATTTG
GTCCTTAGAA ACCTCCCTTT AGCGATCATC GCCGAAGGAA AGTTTTGGGT AAACAACCAC
GCTCATATTT TGAAGCCTCG TCGTGGGGAC ATTCGATTTC TCGCCGCAAT TCTTGAGGGA
CTGAATTTCC TTCCATGGAT ATCCGGCGCA GCACAACCAA AACTAACTCA GGATCGCCTT
ATGGGGATCG CAATCGCAGT TCCGCCTGGG CACAAGCAGC TAGAAATCAT TCAAAGCTGC
GATGAGGAAG TGTCCGAACT GGTCCGCGCG ATAAACGTGG CAAGTAAAGA GCTTATCTTT
ATTCAGGAAT TCCGCACCCG CCTGATAGCC GATGTCGTTA CCGGTAAGCT CGACGTGAGG
GCCGCTGCGG CCAGCCTGCC TGAATCCGCC GAACTTGAGG CCACCGAAGA GCTTGTTGAG
GACGACGATC TCGACGAAGC CATTGACGAT GCCGAAAATA AGGAGGTTGC CGCCTGA
 
Protein sequence
MIDGLRPYAD TNPTGLPWLG DVPAHWNVRR IKTLLREVDS RSKTGEERLL SLRMRQGLVD 
HIDAGGKLIP PESLVNFKIV EPGQVVMNRM RAAAGLFGVA NVRGLVSPDY AVFEPLPEAF
NPYLLQAFRL PSLSAVFRAE SKGLGTGESG FLRLYTDRFG PIPVPYPPLD EQRLIVRFLD
WHGAQTAKLI RAKKKIIALL NEQKQAIIHR AVTRGLDPNV RLKPSGIPWL GDIPEDWEVS
RVKTEFQCLN YRRVPLSGTE RGRMTVRQYD YYGASGVIDK VDEFLFDDKL LLIAEDGANL
VLRNLPLAII AEGKFWVNNH AHILKPRRGD IRFLAAILEG LNFLPWISGA AQPKLTQDRL
MGIAIAVPPG HKQLEIIQSC DEEVSELVRA INVASKELIF IQEFRTRLIA DVVTGKLDVR
AAAASLPESA ELEATEELVE DDDLDEAIDD AENKEVAA