Gene Msil_1553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1553 
Symbol 
ID7092059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1675654 
End bp1678611 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content58% 
IMG OID643464879 
Productprotein of unknown function DUF450 
Protein accessionYP_002361865 
Protein GI217977718 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGACTC ACACTGCATA CAATGTCCAC AGGGAGATCG TGCTCGAACA GCATCTGGTC 
AGCCAGTTGG TCCAGGCCCA TTGCTATATC GATCGCAGCC CGGAGGATTA CGACCGGGCG
CTTGCTCTCG ATAAAGGGCT CGCGCTTGGT TTTCTGCGCG AGACGCAACC CGATGAATGG
CAGAAATTAA AGGCACAATA CGCGGCATCG ACCGAGGCGG AGTTCTTCAA ACAGCTCGAT
AAAGCCCTCA AGACGCGCGG CGCACTTGAT GTGCTGCGCC AAGGCCTGAA GCTGATCCCC
AACATCCATT TCCGCTTCTG CTTCTTCAAG CCGGCCTCCA GCCTCAACCC CGAACTGGTG
CGGCTCTATG AGGCCAATAT TCTAAGCGTG ATCCGCCAGG TTCGCTATAG CCAGAAGAGC
GAGAACGCGC TCGACGCGGT GCTGTTCGTT AACGGCATCC CGGCTGCCAC ACTTGAGTTC
AAGAATCTAC TGACCGGCTC GACCTTCAAG CATGCGGAGA AGCAATATAA GTCTGACCGC
CCGCCGGCAG GCGAGCCGCT CCTGACCTTC CGGCGCGGCG CGCTCGTGCA TTTCGCGCTG
GATGAAGACA ACGTGTCGAT GACGACGCGT CTTCAGAACG GCAAGACCCG CTTTCTGCCC
TTCAACCGGG GTCATCAGGG CGGAGCCGGC AATCCCGACG TGCGGGACGA GTTCCGCGTC
GCCTATCTCT ATAAGGATTT GCCGGAGGGC GCGGCCGTGT TTGGCCGGGA GAAGTGGCTT
TCCGTCATTG GCCGCTTCCT GCATCTTGAG AAAGACGAGA GAAAGGAGGC GATGATCTTC
CCCCGCTTTC ATCAGCTCGA CGCGGTGACA GGGATGATGG ACCACGCCCG CACCTTCGGG
CCGGGCAACA ATTACCTGAT TCAGCATTCC GCCGGGTCCG GCAAGTCCAA TACGATCGGC
TGGACAGCGC ATCAGGCGAT CAACCTGCAT GATGCGCAGG ACCGCCCGAT CTTCGATACC
GCGATCATCG TTACCGACCG CGTCGTGCTC GACCGTCAGC TTCAGAACAC CGTCGCGCAA
TTCGAGCAGA CGAAGGGCGT AGTGAAGAAG ATCGACGGCA CTTCACGCCA GCTTAGAGAA
GCGATCCAGA GCGGCGCGCG TATAATCATC ACCACCATTC AGAAATTCGG CACTGACCAT
CTGAAGGAAG TGTCCGGCCA GGCGGGCCGC AAATTCGCGA TCTTGATCGA CGAGGCGCAC
GGCAGCCAGT CCGGCAAGAG CGCCCAGGCG CTGACCGACA CGCTGACCCG CGAGGCGACC
TCAAGCGACG ATGTGGAAGA TCTCATCGCC GAGTATCAGA AAGGCAGGGG GCCGCAGCCC
AACATCAGCG CCTTCGCCTT CACCGCGACG CCGCGCAATG TAACCCTGGA GCGCTTCGGC
ATACGCGGGC CGGACGGTCT GCCTCACCCC TTCCACCTTT ATTCCATGCG CCAAGCCATC
GAGGAGGGAT TCATCCTCGA CGTGCTGCAA AATTATATGA CCTACAAGGC CTATTATGAG
CTTGAGAAGG TCGTCGAAGA CGACCCCACC TTCAAGACTA AAAAGGCGCA ACGCAAGGTC
GCCCGCTTTG CCCATATGCA CCCGACGGCA ATTAGCCAAA AGGTCGAGGT GATCGTCGAG
CATTTTCGGC GTCACGTCAT GGCCGAGATG GCCGGCCAGG CCAAGGCGAT GGTGGTGACG
TCCAGCCGCG AAGCGGCTCT GAAATATTAT TTCGGCATGC GAGACTATAT CGAGAAGCAG
GGCTACGTCG ACATAAAGGC CCTTGTGGCT TTCTCCGGCG AGCTGGAGGT TGACGGCCGG
AAATGGACAG AGGCGGCGGT CAACCAATTT TCCGAGACAG AATTGCCGCG TCGCTTCGAC
AGCGACGAAT ACCGAGTCCT GATCGTGGCC GAGAAGTACC AGACCGGCTT TGACCAGCCT
AAACTCTGCG CCATGTATGT GGATCGGAAA CTCGCCGGGC TCCAGGCCGT GCAGACGCTC
TCCCGCCTCA ACCGCACGGC TCCCGGCAAG GAGCGAACAT ACATCCTTGA TTTTCAAAAT
GAGATCGAGG ACATCCAGGA CGCCTTCAAA CCGTTCTATG AGGTGGCGGC GCTCGAAGAG
ACCTCTGACC CGAACCAGAT TTATGAGCTG GAAGCCAAGC TCAAAACCTT CGGTGTCCTG
GACGCCGCCG AGATCGACCG CTTCGCGACA ATTTTCTACA AGGGGCCGCT CGATCCGTTC
GATCGGATCA AGCTCGAAGG CTTGGTGCGC CAGGCGGTGC AGCGTTTCGA GCTTGAGGAC
GACGAAGGCC GCCAGGAAGA GTTTCGTCAG CTCCTTAAAA GCTATATGCG CTTTTATAGC
TTCGTCGCGC AGGTGGTCAG GCTTGGCGAT ACCGAACTGG AGAAACTCTA TTCCTATGCC
GCCTGGCTGA CCCGGCTCCT GCCCGGTCGA GAGCTGCCTC CTGACATTGA GATTACGGAA
GACATGATGC GACTACATGC CTTCAAGATC GAACAGAAAG AGGCCGCCAG CGCCTCACTT
GCGCCGGGCG ATACAATTCC GCTGTTACCG ATCAAGGAGT TTGCTGCGAA GCCCTACACT
GCCGAGGAGG AGATCTCCCT GTCCGAGATC ATTCGGGCTT TCAACGACCG CCACGGCACG
CAATTCACGA AGGAGGATTT CGTTCGATTC GAGCAGGTGA ACCACGAGAT CATGGACGAA
GACATGATCG AGATGCTGCG CAGCAATCCG CCGGACGTGG TCTACTCGGC TTTCAGTCAG
GCTTTCTTCA AAGGGGCAAT CCGCATGTTT CAACGCGACG CCGAGATGAA GAACATCGTG
CTCGCCGACG CCACGGCTCG CGACCAGGCC ATTCGGCACT TCTTCGGCCG GGCGATGAGA
GAGGCGAGGG GAAGGTGA
 
Protein sequence
MTTHTAYNVH REIVLEQHLV SQLVQAHCYI DRSPEDYDRA LALDKGLALG FLRETQPDEW 
QKLKAQYAAS TEAEFFKQLD KALKTRGALD VLRQGLKLIP NIHFRFCFFK PASSLNPELV
RLYEANILSV IRQVRYSQKS ENALDAVLFV NGIPAATLEF KNLLTGSTFK HAEKQYKSDR
PPAGEPLLTF RRGALVHFAL DEDNVSMTTR LQNGKTRFLP FNRGHQGGAG NPDVRDEFRV
AYLYKDLPEG AAVFGREKWL SVIGRFLHLE KDERKEAMIF PRFHQLDAVT GMMDHARTFG
PGNNYLIQHS AGSGKSNTIG WTAHQAINLH DAQDRPIFDT AIIVTDRVVL DRQLQNTVAQ
FEQTKGVVKK IDGTSRQLRE AIQSGARIII TTIQKFGTDH LKEVSGQAGR KFAILIDEAH
GSQSGKSAQA LTDTLTREAT SSDDVEDLIA EYQKGRGPQP NISAFAFTAT PRNVTLERFG
IRGPDGLPHP FHLYSMRQAI EEGFILDVLQ NYMTYKAYYE LEKVVEDDPT FKTKKAQRKV
ARFAHMHPTA ISQKVEVIVE HFRRHVMAEM AGQAKAMVVT SSREAALKYY FGMRDYIEKQ
GYVDIKALVA FSGELEVDGR KWTEAAVNQF SETELPRRFD SDEYRVLIVA EKYQTGFDQP
KLCAMYVDRK LAGLQAVQTL SRLNRTAPGK ERTYILDFQN EIEDIQDAFK PFYEVAALEE
TSDPNQIYEL EAKLKTFGVL DAAEIDRFAT IFYKGPLDPF DRIKLEGLVR QAVQRFELED
DEGRQEEFRQ LLKSYMRFYS FVAQVVRLGD TELEKLYSYA AWLTRLLPGR ELPPDIEITE
DMMRLHAFKI EQKEAASASL APGDTIPLLP IKEFAAKPYT AEEEISLSEI IRAFNDRHGT
QFTKEDFVRF EQVNHEIMDE DMIEMLRSNP PDVVYSAFSQ AFFKGAIRMF QRDAEMKNIV
LADATARDQA IRHFFGRAMR EARGR