Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1553 |
Symbol | |
ID | 7092059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 1675654 |
End bp | 1678611 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643464879 |
Product | protein of unknown function DUF450 |
Protein accession | YP_002361865 |
Protein GI | 217977718 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACGACTC ACACTGCATA CAATGTCCAC AGGGAGATCG TGCTCGAACA GCATCTGGTC AGCCAGTTGG TCCAGGCCCA TTGCTATATC GATCGCAGCC CGGAGGATTA CGACCGGGCG CTTGCTCTCG ATAAAGGGCT CGCGCTTGGT TTTCTGCGCG AGACGCAACC CGATGAATGG CAGAAATTAA AGGCACAATA CGCGGCATCG ACCGAGGCGG AGTTCTTCAA ACAGCTCGAT AAAGCCCTCA AGACGCGCGG CGCACTTGAT GTGCTGCGCC AAGGCCTGAA GCTGATCCCC AACATCCATT TCCGCTTCTG CTTCTTCAAG CCGGCCTCCA GCCTCAACCC CGAACTGGTG CGGCTCTATG AGGCCAATAT TCTAAGCGTG ATCCGCCAGG TTCGCTATAG CCAGAAGAGC GAGAACGCGC TCGACGCGGT GCTGTTCGTT AACGGCATCC CGGCTGCCAC ACTTGAGTTC AAGAATCTAC TGACCGGCTC GACCTTCAAG CATGCGGAGA AGCAATATAA GTCTGACCGC CCGCCGGCAG GCGAGCCGCT CCTGACCTTC CGGCGCGGCG CGCTCGTGCA TTTCGCGCTG GATGAAGACA ACGTGTCGAT GACGACGCGT CTTCAGAACG GCAAGACCCG CTTTCTGCCC TTCAACCGGG GTCATCAGGG CGGAGCCGGC AATCCCGACG TGCGGGACGA GTTCCGCGTC GCCTATCTCT ATAAGGATTT GCCGGAGGGC GCGGCCGTGT TTGGCCGGGA GAAGTGGCTT TCCGTCATTG GCCGCTTCCT GCATCTTGAG AAAGACGAGA GAAAGGAGGC GATGATCTTC CCCCGCTTTC ATCAGCTCGA CGCGGTGACA GGGATGATGG ACCACGCCCG CACCTTCGGG CCGGGCAACA ATTACCTGAT TCAGCATTCC GCCGGGTCCG GCAAGTCCAA TACGATCGGC TGGACAGCGC ATCAGGCGAT CAACCTGCAT GATGCGCAGG ACCGCCCGAT CTTCGATACC GCGATCATCG TTACCGACCG CGTCGTGCTC GACCGTCAGC TTCAGAACAC CGTCGCGCAA TTCGAGCAGA CGAAGGGCGT AGTGAAGAAG ATCGACGGCA CTTCACGCCA GCTTAGAGAA GCGATCCAGA GCGGCGCGCG TATAATCATC ACCACCATTC AGAAATTCGG CACTGACCAT CTGAAGGAAG TGTCCGGCCA GGCGGGCCGC AAATTCGCGA TCTTGATCGA CGAGGCGCAC GGCAGCCAGT CCGGCAAGAG CGCCCAGGCG CTGACCGACA CGCTGACCCG CGAGGCGACC TCAAGCGACG ATGTGGAAGA TCTCATCGCC GAGTATCAGA AAGGCAGGGG GCCGCAGCCC AACATCAGCG CCTTCGCCTT CACCGCGACG CCGCGCAATG TAACCCTGGA GCGCTTCGGC ATACGCGGGC CGGACGGTCT GCCTCACCCC TTCCACCTTT ATTCCATGCG CCAAGCCATC GAGGAGGGAT TCATCCTCGA CGTGCTGCAA AATTATATGA CCTACAAGGC CTATTATGAG CTTGAGAAGG TCGTCGAAGA CGACCCCACC TTCAAGACTA AAAAGGCGCA ACGCAAGGTC GCCCGCTTTG CCCATATGCA CCCGACGGCA ATTAGCCAAA AGGTCGAGGT GATCGTCGAG CATTTTCGGC GTCACGTCAT GGCCGAGATG GCCGGCCAGG CCAAGGCGAT GGTGGTGACG TCCAGCCGCG AAGCGGCTCT GAAATATTAT TTCGGCATGC GAGACTATAT CGAGAAGCAG GGCTACGTCG ACATAAAGGC CCTTGTGGCT TTCTCCGGCG AGCTGGAGGT TGACGGCCGG AAATGGACAG AGGCGGCGGT CAACCAATTT TCCGAGACAG AATTGCCGCG TCGCTTCGAC AGCGACGAAT ACCGAGTCCT GATCGTGGCC GAGAAGTACC AGACCGGCTT TGACCAGCCT AAACTCTGCG CCATGTATGT GGATCGGAAA CTCGCCGGGC TCCAGGCCGT GCAGACGCTC TCCCGCCTCA ACCGCACGGC TCCCGGCAAG GAGCGAACAT ACATCCTTGA TTTTCAAAAT GAGATCGAGG ACATCCAGGA CGCCTTCAAA CCGTTCTATG AGGTGGCGGC GCTCGAAGAG ACCTCTGACC CGAACCAGAT TTATGAGCTG GAAGCCAAGC TCAAAACCTT CGGTGTCCTG GACGCCGCCG AGATCGACCG CTTCGCGACA ATTTTCTACA AGGGGCCGCT CGATCCGTTC GATCGGATCA AGCTCGAAGG CTTGGTGCGC CAGGCGGTGC AGCGTTTCGA GCTTGAGGAC GACGAAGGCC GCCAGGAAGA GTTTCGTCAG CTCCTTAAAA GCTATATGCG CTTTTATAGC TTCGTCGCGC AGGTGGTCAG GCTTGGCGAT ACCGAACTGG AGAAACTCTA TTCCTATGCC GCCTGGCTGA CCCGGCTCCT GCCCGGTCGA GAGCTGCCTC CTGACATTGA GATTACGGAA GACATGATGC GACTACATGC CTTCAAGATC GAACAGAAAG AGGCCGCCAG CGCCTCACTT GCGCCGGGCG ATACAATTCC GCTGTTACCG ATCAAGGAGT TTGCTGCGAA GCCCTACACT GCCGAGGAGG AGATCTCCCT GTCCGAGATC ATTCGGGCTT TCAACGACCG CCACGGCACG CAATTCACGA AGGAGGATTT CGTTCGATTC GAGCAGGTGA ACCACGAGAT CATGGACGAA GACATGATCG AGATGCTGCG CAGCAATCCG CCGGACGTGG TCTACTCGGC TTTCAGTCAG GCTTTCTTCA AAGGGGCAAT CCGCATGTTT CAACGCGACG CCGAGATGAA GAACATCGTG CTCGCCGACG CCACGGCTCG CGACCAGGCC ATTCGGCACT TCTTCGGCCG GGCGATGAGA GAGGCGAGGG GAAGGTGA
|
Protein sequence | MTTHTAYNVH REIVLEQHLV SQLVQAHCYI DRSPEDYDRA LALDKGLALG FLRETQPDEW QKLKAQYAAS TEAEFFKQLD KALKTRGALD VLRQGLKLIP NIHFRFCFFK PASSLNPELV RLYEANILSV IRQVRYSQKS ENALDAVLFV NGIPAATLEF KNLLTGSTFK HAEKQYKSDR PPAGEPLLTF RRGALVHFAL DEDNVSMTTR LQNGKTRFLP FNRGHQGGAG NPDVRDEFRV AYLYKDLPEG AAVFGREKWL SVIGRFLHLE KDERKEAMIF PRFHQLDAVT GMMDHARTFG PGNNYLIQHS AGSGKSNTIG WTAHQAINLH DAQDRPIFDT AIIVTDRVVL DRQLQNTVAQ FEQTKGVVKK IDGTSRQLRE AIQSGARIII TTIQKFGTDH LKEVSGQAGR KFAILIDEAH GSQSGKSAQA LTDTLTREAT SSDDVEDLIA EYQKGRGPQP NISAFAFTAT PRNVTLERFG IRGPDGLPHP FHLYSMRQAI EEGFILDVLQ NYMTYKAYYE LEKVVEDDPT FKTKKAQRKV ARFAHMHPTA ISQKVEVIVE HFRRHVMAEM AGQAKAMVVT SSREAALKYY FGMRDYIEKQ GYVDIKALVA FSGELEVDGR KWTEAAVNQF SETELPRRFD SDEYRVLIVA EKYQTGFDQP KLCAMYVDRK LAGLQAVQTL SRLNRTAPGK ERTYILDFQN EIEDIQDAFK PFYEVAALEE TSDPNQIYEL EAKLKTFGVL DAAEIDRFAT IFYKGPLDPF DRIKLEGLVR QAVQRFELED DEGRQEEFRQ LLKSYMRFYS FVAQVVRLGD TELEKLYSYA AWLTRLLPGR ELPPDIEITE DMMRLHAFKI EQKEAASASL APGDTIPLLP IKEFAAKPYT AEEEISLSEI IRAFNDRHGT QFTKEDFVRF EQVNHEIMDE DMIEMLRSNP PDVVYSAFSQ AFFKGAIRMF QRDAEMKNIV LADATARDQA IRHFFGRAMR EARGR
|
| |