Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3550 |
Symbol | |
ID | 4075226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 593428 |
End bp | 594681 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 638005062 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_611781 |
Protein GI | 99078523 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.257959 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.256697 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCG TGTCGAAGGC AAGCGCTTTT CCTGCAACTG TCCAGCCCGG AATACCGAAA CTCGGGAAGA CGCCAGAAGG GTGGTTACGC GCTCCATTGT CCCGCTTTCT TGTGGAAGTG CGCAGGCCTA TCAAGATGGC TGACAATGAG GCCTATCGTT TGGTGACGGT CAAACGTGCT CGTGGCGGTG TGGTTGAGCG CGGGACGCTG GATGGCAGAG AAATCTCAGT CAAGTCGCAA TTCATCGTTG AAGGCGGTGA TTTCCTGATC TCCAAGCGTC AGATCGTTCA CGGGGCCTGC GGACTGGTGC CTCAAGAGCT GGCCGGTTCT GTCGTTTCCA ACGAGTACTC GGTTTTGAAC AGCAACGGAA ACATTGACCT TCAGTTCTTG AACTACCTCG CGCACACGGT GTTCTTTCAA CAGACCTGTT TCCATTCAAG CATTGGCGTC CATGTCGAGA AAATGATTTT CAAGCTCGAC CGTTGGCTCA AGTGGGAGTT TGATCTGCCC CCACTCTCTG AGCAACGCAA GATCGTGGAG ATCCTGTCGA CCTGGGACCG GGCGATTGAG GTAGCCGAGG CGCAGCTGGC CAATGCCCGC AAACAGAAAC GCGCCCTCAT GCAGTCCCTG CTGACTGGCA AACGCCGCTT CCCAGAGTTT GAGGGACAGG AGTGGCGTGA AGTGTGGTTG GCGGACCTGG TTTCGGCCAT TCGAGGAGGA GGAACACCAG ACAAGAGTAA CACCGCATAT TGGGGAGGAG AGATTCCTTG GGTGAGCGTC AAGGACCTCA AATCTGATGT TCTTCAGCAG ACTAAAGACA CGATCACTCA ATCAGGCTTG AACAGCAGTG CAGCCAATTA CTTCCCTAAG GGCACTATTG TTGTCGCTAC GAGAATGGCA GTTGGCGCTG CCGTTCAACT GGGCAAAGGG ATGGCTATCA ACCAGGACTT GAAGGCGATC ATCCCCGGGC CAGATGTCCG GAACGATTAT CTTTTCCACT TCATGCAAAT GGTACAGCCA AAGTTGGAAG CTCTCGGCAC TGGAAGTACA GTCAAAGGCA TCACTTTGGG TGATCTGCAT CGCCTTGTCA TTGGGCTTCC CGCGACCTTG GAGGAACAAG ACAAGATCGT TCAAATGCTT GATGTCGCCA GGAAAGATAT CTCGTCTATG TGTGTAAATA TTGGAAAGCT CCGTGCCGAG AAGAAAGCGT TGATGCAACA GCTCCTGACT GGAAAACGCC GGGTCACAGG TTGA
|
Protein sequence | MNAVSKASAF PATVQPGIPK LGKTPEGWLR APLSRFLVEV RRPIKMADNE AYRLVTVKRA RGGVVERGTL DGREISVKSQ FIVEGGDFLI SKRQIVHGAC GLVPQELAGS VVSNEYSVLN SNGNIDLQFL NYLAHTVFFQ QTCFHSSIGV HVEKMIFKLD RWLKWEFDLP PLSEQRKIVE ILSTWDRAIE VAEAQLANAR KQKRALMQSL LTGKRRFPEF EGQEWREVWL ADLVSAIRGG GTPDKSNTAY WGGEIPWVSV KDLKSDVLQQ TKDTITQSGL NSSAANYFPK GTIVVATRMA VGAAVQLGKG MAINQDLKAI IPGPDVRNDY LFHFMQMVQP KLEALGTGST VKGITLGDLH RLVIGLPATL EEQDKIVQML DVARKDISSM CVNIGKLRAE KKALMQQLLT GKRRVTG
|
| |