Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3551 |
Symbol | |
ID | 4075227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 594681 |
End bp | 596180 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005063 |
Product | type I restriction-modification system, M subunit |
Protein accession | YP_611782 |
Protein GI | 99078524 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.795147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.253327 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCGA TCACCCAATC CGAGATCAAC AAGGCCGCCT GGGGCGCATG TGACACCTTC CGGGGCGTGG TCGATCCGTC CATCTACAAG GACTACGTGC TGACCATGCT GTTCCTGAAG TACGTCAGCG ACGTCTGGAA GGACCACAAA GCCAGCTATG CGGCGCATTA TCCTGACAGC CCGGAACTCG TCGCGGCCAT GATGGAGCGG GAAACCTTCA AACTGCCTGA GACCGCCAGC TTCGACGCTC TGCATGGACG TCGCCACGAA CCCGGCAACG GAGAACGTAT CGACAAGGCC CTCCACGCCA TCGAAGAGGC CAACGGCTCC AAGCTGCGCG ACGTCTTCCA GGACATCAGC TTCAACTCCA ACAAGCTGGG TGATGAGGAG CAGAAGAACG ACATCCTGCG CCACCTGCTG GAGGACTTCG CCAAGACCGC GCTCGACCTG CGCCCATCGA GAGTGGGTAA CCTCGACATC ATTGGCGGGG CCTACGAATA CCTGATCTCG CGCTTTGCGG CCACAGCGGG CAAGAAGGCG GGCGAGTTCT ACACCCCGGC GGAGGTCTCG GAGCTGATGG CGCGGCTGGT CGATCCACAG CCTGGTGACG ATATCTGCGA CCCCACTTGT GGCTCCGCCT CGCTGTTGAT GAAATGCGGG CGGCTGATCC GCGAGGGCGG TAGCAAAGCC TATGCGCTGT TCGGGCAAGA GGCCATCGGG TCCACCTGGG CGCTGGCCAA GATGAACCTC TTCCTGCATG GCGAGGAGAA CCACCAAATC GAATGGGGCG ACACCATCCG CAACCCCAAG CTGCGGACGT CGGATGACAT GCTGCGCCAT TTTGACGTGG TCGTCGCCAA TCCGCCTTTC AGCCTGGACA AGTGGGGGGT CGAGAGCGCA GAAGCCGACA AGTTTGCCCG CTTCCGCCGT GGCATCCCGC CCAAGACCAA GGGCGATTAT GCCTTCATCT TGCACATGAT CGAGACGCTG AAGCCCAAGA CCGGGCGCAT GGCCGTGGTT GTGCCACATG GGGTGCTGTT CCGCGGATCG AGCGAGGGAA AGATCCGCCA CAAGCTGATT GAGGACAACC TGCTGGACGC CGTCATCGGC CTGCCGGAAA AGCTGTTCTT CGGTACCGGC ATTCCATCGG CGATTTTGGT CTTCCGCAAG GACAAGGCTG ACGACAGCGT GTTGTTCGTC GATGCCAGCC GGGAGTTTGT GGCAGGCACC AACCAGAACG CGCTGGATAT GACACTGATC GAGAAGATCG TGGCCACACA TCAGACGCGG CAGACGGTGG AGAAGTACGC ATACCGTGCC ACACTGGCTG AGATCATCGA GAACGATTTC AACCTCAACA TCCCCCGCTA CGTGGACACA TTCGAGGAAG AGGAAGAAAT CGACCTGATG GCCGTGCGGG CTGAACGGAT GAAGTTGAAG GGCGAAATGG CCGAGCTGGA AGACCGGATG GAAGGCTATC TGCAGGAGCT GGGTTACTGA
|
Protein sequence | MTPITQSEIN KAAWGACDTF RGVVDPSIYK DYVLTMLFLK YVSDVWKDHK ASYAAHYPDS PELVAAMMER ETFKLPETAS FDALHGRRHE PGNGERIDKA LHAIEEANGS KLRDVFQDIS FNSNKLGDEE QKNDILRHLL EDFAKTALDL RPSRVGNLDI IGGAYEYLIS RFAATAGKKA GEFYTPAEVS ELMARLVDPQ PGDDICDPTC GSASLLMKCG RLIREGGSKA YALFGQEAIG STWALAKMNL FLHGEENHQI EWGDTIRNPK LRTSDDMLRH FDVVVANPPF SLDKWGVESA EADKFARFRR GIPPKTKGDY AFILHMIETL KPKTGRMAVV VPHGVLFRGS SEGKIRHKLI EDNLLDAVIG LPEKLFFGTG IPSAILVFRK DKADDSVLFV DASREFVAGT NQNALDMTLI EKIVATHQTR QTVEKYAYRA TLAEIIENDF NLNIPRYVDT FEEEEEIDLM AVRAERMKLK GEMAELEDRM EGYLQELGY
|
| |