Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3575 |
Symbol | |
ID | 7092434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 3934665 |
End bp | 3937595 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643466866 |
Product | protein of unknown function DUF450 |
Protein accession | YP_002363825 |
Protein GI | 217979678 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAG ATACCTCGGA AAAGGGACTC GAGGCCCTGA TCGTCGCTGG AATGACGGGC CGCACCTCGG CGCCGTCCGG CGGCGGATTC TCCGAGGAGC CGGAGCCCTT CGTCGGCCTG CATAACTGGT TGCTCGGAAA TCCGAAGGAC TATGATCGGG CATGGACGGT CGATCTTGTG CAATTGCGCG CCTTTGTGGG CTCCACGCAA CGGCCGTTGG TGGAAGCCTT CGATCTCGAC AACGACAGCC CGGCGCGGCA GAAATTCCTT GCCCGGCTTC AAGGCGAAAT CGGCAAGCGC GGCGTCATCG ACGTTCTGCG CCACGGCGCG AAGCATGGCG CGCATGATGT GGACCTGTTC TATGGCACTC CGTCCCCGGG CAACGCCAAG GCCGCCGAAC GCTTTGCGCT GAACAGGTTC TCGGTCACGC GCCAGCTTCG CTACAGCCGT GACGATACCG CCCATGCGCT CGATCTCGCG CTGTTCATCA ATGGCTTGCC GATCGCAACG TTCGAACTGA AGAACAGCCT GACGAAACAG ACAGTCGAAG ACGCCGTTGA GCAATACAAA CGCGACCGCG ATCCGCGTGA GAAGCTCTTC GAATTCGGCC GGTGTATCGT GCATCTTGCG GTGGACGACG CGCAGGTGAA GTTCTGCACC CAGCTGAAGG GCAAGGCATC GTGGTTCCTG CCCTTCAACA AGGGCTGGAA CGATGGCGCC GGCAACCCGC CGAATCCCAC AGGCATCAAG ACCGACTATC TTTGGAAGGA TATCCTCACG CCGCTCAGCC TGACGGACAT CATCGAGAAC TATGCCCAGA TCGTTGAGCG CAAAGACCCG AAGACCAACC GGACCAAGCG GGATCAGCTT TTCCCGCGCT TTCATCAGCT CGATGTGGTG CGCAAGCTCC TCGCGGATGC GAAGGCGAAG GGCGCTGGCC GGCGCGTGCT GATCCAGCAT TCGGCGGGAT CAGGGAAATC AAATTCAATT GCGTGGCTGG CGCACCAGCT CGTGCGGTTG GCGAATGGCG GAGGTCAGGT CTTCGATTCC GTGGTCGTTG TAACCGACCG CCGAATTCTC GATCAGCAAA TCCGCGACAC CATCAAGCAG TTCGCCCAAG TTGGCGCGAC GGTCGGGCAT GCCGAGCATT CCGGCGATCT TCGCCGCTTC ATCGCCGACG GCAAGAAGAT CATCATCACC ACGGTTCAGA AGTTCCCGTT CATCCTCGAT GACATCGGCG CGCAGCACAA AGACAGACGC TTTGCGATCC TCATCGACGA GGCGCATTCC AGCCAGGGCG GCAAAGCGGC GGCGGCTTTG AACGCAGCGT TGACCGGCGC GGAAGACGGC AACGAGGACG AAACCGTCGA AGACAAGATC AATGCGATCA TGGAGCAACG GAAGATGCTC CCGAACGCAA GCTATTTCGC GTTTACAGCG ACGCCGAAGA ACAAGACGCT TGAGATATTT GGCGAGCCGT TCCCCGAAGG CGATGTCGTC AAACACCGCC CGTTCCACAG CTACACGATG AAGCAAGCGA TCCAGGAAGG CTTCATTCTG GACGTGCTTC GCTATTACAC GCCCGTTAAC AGCTACTATC GGCTGGTCAA GACGGTCGAC GAGGATCCGG AGTTCGATAC GAAACGCGCG ACAAGGAAGC TTCGCCGCTA TGTCGAGAGC AACGACCATG CCATCAGGCT CAAGGCTGAG ATCATGGTCG ATCACTTCCA CGAGCAGGTG CTCGCGTTGA ACAAGATCGG TGGCCAGGCG CGGGCGATGG TGGTGACTTC AGGAATCGAA CGCGCGATCC AGTACTATCA GGCGGTGAGC GCCTATCTGG TCGAACGCAA GAGCCCTTAT CGTGCGATCG TCGCCTTTTC GGGCGAGCAT GAATTCTGCG GAGTGAAAGT CTCCGAGGCC AGCCTCAACG GGTTTCCCTC GAAGGATATC GTCGATCAGA TCGAAACCGA TCCGTATCGA TTCCTGATCT GCGCCGACAA ATTTCAGACC GGGTACGACC AGCCGCTTCT GCATTCCATG TATGTGGACA AGGCCCTGTC GGGCATCAAA GCGGTTCAGA CCCTGTCGCG TCTCAACCGC GCACACCCCC AGAAGTACGA CACCTTCGTT CTGGATTTCA TGAACGATAC CGAGACGATC CGCGCATCGT TCGACAAGTT CTATCGAACA ACGATCCTGA GCGACGAAAC CGATCCAAAC CGGCTTCACG ATCTCAAGGC CACGCTGGAC GGGTATCAGG TCTACGATCC GGCCCAGATT GACCAGCTCG TTGGTTTGTA TCTCTCGGGC GCTGATCGCG ATCAGCTCGA TCCGATCCTC GATGCTTGCG TCACCACTTA CAACGACAGC CTCGACGAGG ACGGGCAGGT TGACTTCAAG GGCAAAGCAA AGGCATTCGC ACGGACCTAC GCATTCATTT CCGCGATTCT TCCCTACACG ACCGGAATGG GAAAAGCCCT CGATCTTATT GAACTTCTTG CTGCCAAAGC TGCCGGCGCC GCGCGAGGAA GACCTCTCCA AGGGAATTCT CGAAGCCATC GATATGGACA GCTACCGCGT GGAGAAGCAG GCCGCGCAAA GAGTGCAATT GTCCGATCAA GACGCGGAAA TCGATCCCAT CCCAGCCGAA GGCGGCGGCC ACAAGGCCGA ACCCCAACTC AATCGGCTGT CAAATATCAT TCGAAGCTTC AACGATCTCT TCGGCAACAT CACATGGGCG GACACCGATC GTATTCGTCG CCTGATCGCC ATCGAGATCC CCGACAAGGT TGCGGCCAAC GCGGCCTATC AGAACGCGAA GTTAAACTCC GACAAACAGA ACGCCCGGAT CGAACACGAC AAAGCGCTGG CTGGCGTAAT CATCGGGCTG ATGAAGGACG ACACCGAACT GTTCAAGCAG TTCAGCGATA A
|
Protein sequence | MKTDTSEKGL EALIVAGMTG RTSAPSGGGF SEEPEPFVGL HNWLLGNPKD YDRAWTVDLV QLRAFVGSTQ RPLVEAFDLD NDSPARQKFL ARLQGEIGKR GVIDVLRHGA KHGAHDVDLF YGTPSPGNAK AAERFALNRF SVTRQLRYSR DDTAHALDLA LFINGLPIAT FELKNSLTKQ TVEDAVEQYK RDRDPREKLF EFGRCIVHLA VDDAQVKFCT QLKGKASWFL PFNKGWNDGA GNPPNPTGIK TDYLWKDILT PLSLTDIIEN YAQIVERKDP KTNRTKRDQL FPRFHQLDVV RKLLADAKAK GAGRRVLIQH SAGSGKSNSI AWLAHQLVRL ANGGGQVFDS VVVVTDRRIL DQQIRDTIKQ FAQVGATVGH AEHSGDLRRF IADGKKIIIT TVQKFPFILD DIGAQHKDRR FAILIDEAHS SQGGKAAAAL NAALTGAEDG NEDETVEDKI NAIMEQRKML PNASYFAFTA TPKNKTLEIF GEPFPEGDVV KHRPFHSYTM KQAIQEGFIL DVLRYYTPVN SYYRLVKTVD EDPEFDTKRA TRKLRRYVES NDHAIRLKAE IMVDHFHEQV LALNKIGGQA RAMVVTSGIE RAIQYYQAVS AYLVERKSPY RAIVAFSGEH EFCGVKVSEA SLNGFPSKDI VDQIETDPYR FLICADKFQT GYDQPLLHSM YVDKALSGIK AVQTLSRLNR AHPQKYDTFV LDFMNDTETI RASFDKFYRT TILSDETDPN RLHDLKATLD GYQVYDPAQI DQLVGLYLSG ADRDQLDPIL DACVTTYNDS LDEDGQVDFK GKAKAFARTY AFISAILPYT TGMGKALDLI ELLAAKAAGA ARGRPLQGNS RSHRYGQLPR GEAGRAKSAI VRSRRGNRSH PSRRRRPQGR TPTQSAVKYH SKLQRSLRQH HMGGHRSYSS PDRHRDPRQG CGQRGLSERE VKLRQTERPD RTRQSAGWRN HRADEGRHRT VQAVQR
|
| |