Gene Msil_3575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3575 
Symbol 
ID7092434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3934665 
End bp3937595 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content58% 
IMG OID643466866 
Productprotein of unknown function DUF450 
Protein accessionYP_002363825 
Protein GI217979678 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAG ATACCTCGGA AAAGGGACTC GAGGCCCTGA TCGTCGCTGG AATGACGGGC 
CGCACCTCGG CGCCGTCCGG CGGCGGATTC TCCGAGGAGC CGGAGCCCTT CGTCGGCCTG
CATAACTGGT TGCTCGGAAA TCCGAAGGAC TATGATCGGG CATGGACGGT CGATCTTGTG
CAATTGCGCG CCTTTGTGGG CTCCACGCAA CGGCCGTTGG TGGAAGCCTT CGATCTCGAC
AACGACAGCC CGGCGCGGCA GAAATTCCTT GCCCGGCTTC AAGGCGAAAT CGGCAAGCGC
GGCGTCATCG ACGTTCTGCG CCACGGCGCG AAGCATGGCG CGCATGATGT GGACCTGTTC
TATGGCACTC CGTCCCCGGG CAACGCCAAG GCCGCCGAAC GCTTTGCGCT GAACAGGTTC
TCGGTCACGC GCCAGCTTCG CTACAGCCGT GACGATACCG CCCATGCGCT CGATCTCGCG
CTGTTCATCA ATGGCTTGCC GATCGCAACG TTCGAACTGA AGAACAGCCT GACGAAACAG
ACAGTCGAAG ACGCCGTTGA GCAATACAAA CGCGACCGCG ATCCGCGTGA GAAGCTCTTC
GAATTCGGCC GGTGTATCGT GCATCTTGCG GTGGACGACG CGCAGGTGAA GTTCTGCACC
CAGCTGAAGG GCAAGGCATC GTGGTTCCTG CCCTTCAACA AGGGCTGGAA CGATGGCGCC
GGCAACCCGC CGAATCCCAC AGGCATCAAG ACCGACTATC TTTGGAAGGA TATCCTCACG
CCGCTCAGCC TGACGGACAT CATCGAGAAC TATGCCCAGA TCGTTGAGCG CAAAGACCCG
AAGACCAACC GGACCAAGCG GGATCAGCTT TTCCCGCGCT TTCATCAGCT CGATGTGGTG
CGCAAGCTCC TCGCGGATGC GAAGGCGAAG GGCGCTGGCC GGCGCGTGCT GATCCAGCAT
TCGGCGGGAT CAGGGAAATC AAATTCAATT GCGTGGCTGG CGCACCAGCT CGTGCGGTTG
GCGAATGGCG GAGGTCAGGT CTTCGATTCC GTGGTCGTTG TAACCGACCG CCGAATTCTC
GATCAGCAAA TCCGCGACAC CATCAAGCAG TTCGCCCAAG TTGGCGCGAC GGTCGGGCAT
GCCGAGCATT CCGGCGATCT TCGCCGCTTC ATCGCCGACG GCAAGAAGAT CATCATCACC
ACGGTTCAGA AGTTCCCGTT CATCCTCGAT GACATCGGCG CGCAGCACAA AGACAGACGC
TTTGCGATCC TCATCGACGA GGCGCATTCC AGCCAGGGCG GCAAAGCGGC GGCGGCTTTG
AACGCAGCGT TGACCGGCGC GGAAGACGGC AACGAGGACG AAACCGTCGA AGACAAGATC
AATGCGATCA TGGAGCAACG GAAGATGCTC CCGAACGCAA GCTATTTCGC GTTTACAGCG
ACGCCGAAGA ACAAGACGCT TGAGATATTT GGCGAGCCGT TCCCCGAAGG CGATGTCGTC
AAACACCGCC CGTTCCACAG CTACACGATG AAGCAAGCGA TCCAGGAAGG CTTCATTCTG
GACGTGCTTC GCTATTACAC GCCCGTTAAC AGCTACTATC GGCTGGTCAA GACGGTCGAC
GAGGATCCGG AGTTCGATAC GAAACGCGCG ACAAGGAAGC TTCGCCGCTA TGTCGAGAGC
AACGACCATG CCATCAGGCT CAAGGCTGAG ATCATGGTCG ATCACTTCCA CGAGCAGGTG
CTCGCGTTGA ACAAGATCGG TGGCCAGGCG CGGGCGATGG TGGTGACTTC AGGAATCGAA
CGCGCGATCC AGTACTATCA GGCGGTGAGC GCCTATCTGG TCGAACGCAA GAGCCCTTAT
CGTGCGATCG TCGCCTTTTC GGGCGAGCAT GAATTCTGCG GAGTGAAAGT CTCCGAGGCC
AGCCTCAACG GGTTTCCCTC GAAGGATATC GTCGATCAGA TCGAAACCGA TCCGTATCGA
TTCCTGATCT GCGCCGACAA ATTTCAGACC GGGTACGACC AGCCGCTTCT GCATTCCATG
TATGTGGACA AGGCCCTGTC GGGCATCAAA GCGGTTCAGA CCCTGTCGCG TCTCAACCGC
GCACACCCCC AGAAGTACGA CACCTTCGTT CTGGATTTCA TGAACGATAC CGAGACGATC
CGCGCATCGT TCGACAAGTT CTATCGAACA ACGATCCTGA GCGACGAAAC CGATCCAAAC
CGGCTTCACG ATCTCAAGGC CACGCTGGAC GGGTATCAGG TCTACGATCC GGCCCAGATT
GACCAGCTCG TTGGTTTGTA TCTCTCGGGC GCTGATCGCG ATCAGCTCGA TCCGATCCTC
GATGCTTGCG TCACCACTTA CAACGACAGC CTCGACGAGG ACGGGCAGGT TGACTTCAAG
GGCAAAGCAA AGGCATTCGC ACGGACCTAC GCATTCATTT CCGCGATTCT TCCCTACACG
ACCGGAATGG GAAAAGCCCT CGATCTTATT GAACTTCTTG CTGCCAAAGC TGCCGGCGCC
GCGCGAGGAA GACCTCTCCA AGGGAATTCT CGAAGCCATC GATATGGACA GCTACCGCGT
GGAGAAGCAG GCCGCGCAAA GAGTGCAATT GTCCGATCAA GACGCGGAAA TCGATCCCAT
CCCAGCCGAA GGCGGCGGCC ACAAGGCCGA ACCCCAACTC AATCGGCTGT CAAATATCAT
TCGAAGCTTC AACGATCTCT TCGGCAACAT CACATGGGCG GACACCGATC GTATTCGTCG
CCTGATCGCC ATCGAGATCC CCGACAAGGT TGCGGCCAAC GCGGCCTATC AGAACGCGAA
GTTAAACTCC GACAAACAGA ACGCCCGGAT CGAACACGAC AAAGCGCTGG CTGGCGTAAT
CATCGGGCTG ATGAAGGACG ACACCGAACT GTTCAAGCAG TTCAGCGATA A
 
Protein sequence
MKTDTSEKGL EALIVAGMTG RTSAPSGGGF SEEPEPFVGL HNWLLGNPKD YDRAWTVDLV 
QLRAFVGSTQ RPLVEAFDLD NDSPARQKFL ARLQGEIGKR GVIDVLRHGA KHGAHDVDLF
YGTPSPGNAK AAERFALNRF SVTRQLRYSR DDTAHALDLA LFINGLPIAT FELKNSLTKQ
TVEDAVEQYK RDRDPREKLF EFGRCIVHLA VDDAQVKFCT QLKGKASWFL PFNKGWNDGA
GNPPNPTGIK TDYLWKDILT PLSLTDIIEN YAQIVERKDP KTNRTKRDQL FPRFHQLDVV
RKLLADAKAK GAGRRVLIQH SAGSGKSNSI AWLAHQLVRL ANGGGQVFDS VVVVTDRRIL
DQQIRDTIKQ FAQVGATVGH AEHSGDLRRF IADGKKIIIT TVQKFPFILD DIGAQHKDRR
FAILIDEAHS SQGGKAAAAL NAALTGAEDG NEDETVEDKI NAIMEQRKML PNASYFAFTA
TPKNKTLEIF GEPFPEGDVV KHRPFHSYTM KQAIQEGFIL DVLRYYTPVN SYYRLVKTVD
EDPEFDTKRA TRKLRRYVES NDHAIRLKAE IMVDHFHEQV LALNKIGGQA RAMVVTSGIE
RAIQYYQAVS AYLVERKSPY RAIVAFSGEH EFCGVKVSEA SLNGFPSKDI VDQIETDPYR
FLICADKFQT GYDQPLLHSM YVDKALSGIK AVQTLSRLNR AHPQKYDTFV LDFMNDTETI
RASFDKFYRT TILSDETDPN RLHDLKATLD GYQVYDPAQI DQLVGLYLSG ADRDQLDPIL
DACVTTYNDS LDEDGQVDFK GKAKAFARTY AFISAILPYT TGMGKALDLI ELLAAKAAGA
ARGRPLQGNS RSHRYGQLPR GEAGRAKSAI VRSRRGNRSH PSRRRRPQGR TPTQSAVKYH
SKLQRSLRQH HMGGHRSYSS PDRHRDPRQG CGQRGLSERE VKLRQTERPD RTRQSAGWRN
HRADEGRHRT VQAVQR