Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3537 |
Symbol | |
ID | 4075215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 572581 |
End bp | 575973 |
Gene Length | 3393 bp |
Protein Length | 1130 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 638005051 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_611770 |
Protein GI | 99078512 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.244688 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAAT TCGTTTTCTT GACTGCAGAT TTCCCCGACC TTCTAGCCCA CGCGAAAAAG GCTGAGAGTG CGGCACTATC CGACCCACGA GGCGCATGCT TCTGGGCGCG GTTGACCTTG GAAACAGCGA TAAAATGGAT TTATCGCAAC GACCCGTCGA TGCGCTCGCC CTACCAGGAC ACTCTAGCGG CATTGATTGC AGAGCCGTCG CTCGGCCAAC TCACTGGCCC GGCTATCGTC ACCAAAGCGC GATTTATCAA AGATCATGGC AACCGGGCGG CGCATGACAG CGGTAAGCCG ATCAAGCCTC AGGACGCCGC CGCTGTTGTG CGTGAGCTGT TCCATGTCTG CTATTGGATG GCGCGGACCT ATGCAAAGGA TGCGAAGCCA GACTCTTCGT TGCAGTTCGA TGCCTCAAAG TTGGAAAAAA CCCTGACGAT TAGTGCCAGC ACAGTTGCAC AAATCCAGGC ATTGAAAGAG AAGCACGACG CCCATGCCAA AGCGCTAAAG GACGCTGAAG CCGCCGCGCT CGCATCTGAG GAGGGTCGAA AGAGGCTCGA GGCTGAACTC GCGCAAGTGC GGGCGGAAAT CAAAGAAATT CGCCAGGCCA ACACTGCGGT ACAAGATGAC CACGACTACA ATGAGGCCGC GACGCGCGAT GCTTTCATTG ATCTGCTGCT GAACGAGGCT GGGTGGCCGC TCGACCAGGA GCGCGACCGT GAATTTCCGG TATCAGGTAT GCCTAATGAC AAAGGTGAAG GCCTTGTCGA TTATGTGCTT TGGGGAAACG ATGCCAAGCC ACTCGCCTTG GTCGAAGCGA AACGAACCAT GAAGGACAGC CGGATAGGGC AGCAGCAAGC CAAGCTTTAC GCCGATTGTC TTGAAACGAT GTATGGGCGC CGTCCGGTGA TTTTCACGAC GAACGGCTAT GAACATTGGC TCTGGGACGA CCAGATGTAC CCCCCGCGTC GTGTATCGGG CTTCCTGAAA AGGGATGAAC TCGTCCTGCT TCACCAACGT AGAGGCACCA GAAAATCGTT AGATGGGGTG TCCGTAGACC AGGGTATTGC TGGGCGCTTT TACCAACAGC GAGCCATTCG CCGGGTAGGC GAGGCTTTTG AACGAGACCG ACAGCGGAAG GCCTTGCTTG TAATGGCGAC TGGCAGTGGC AAGACGCGAA CGGTCATCGC GCTTATCGAC CAGCTCATGC GCGCCAATTG GGTGCGGCGT GTGCTCTTCC TCGCGGATCG CGTTGCTCTG GTGAAACAGG CACACAATGC CTTCAAGACA CACTTGCCGA CGGCGGCAAG TGCAAACCTG CTGAAGAATC ATGACCCGGC CAGGAACGAT CATTCTGGTG CTCGTATTTG TCTATCGACC TATCCCACGA TGTTGGGGTT GATCGACGAC GTCAAAGGCG GTGAAAAGCA GTTTGGCCCC GGCCACTTTG ACTTGATCGT GATCGACGAA GCGCACCGTT CTGTCTATCG GAAGTACCGT GCCATCTTCG ACTATTTTGA TGGACTCTTG GTCGGCCTCA CGGCGACGCC GCGCGAAGAG ATTGACCGAG ACACATACTC GCTCTTTGAG CTGGAACGCG GAATACCAAC CGACAGTTAT GACTTAGAGG ATGCGGTGGC GGATGAATAT CTCGTACCGC CTAGATCGAT CTCGGTCCCT TTGAAGTTCC AACGTGACGG TATCGACTAC GACAGCCTTT CTGACGAAGA AAAAGCCGAG TGGGACGCAA TCGAGTGGGA TGATGAGGGC GCGGTTCCTG ATCGGGTGGA GGCGGCTGAC TTGAACCGCT GGCTGTTCAA CAAGGATACA GTGGACAAAG TCCTTGAGCA TCTAATGACC AACGGCATCA AAGTCGCCGG AGGCGACAGG CTTGGAAAGA CAATCATTTT CGCTAAAAAC AGCGATCATG CGAAGTTCAT TGTCGAGCGC TTCGATGCCA ACTACCCACA TTACATGGGT CAGTTTGCAC GTCTCATCGA CTACAGTGTC ACCTATGCGC AATCGTTGAT CGACGATTTC TCAGAAGCGG AGGAAGCCCC CCATATCGCG GTCTCTGTCG ATATGTTGGA CACAGGCATT GATGTGCCCG AGGTCGTGAA TCTCGTTTTT TTCAAGATTG TTCGTTCTAA AACGAAGTTC TGGCAAATGG TTGGTAGAGG CACCCGAACT TGCGAGGATC TGTTCGGGCC AGGGCAAGAC AAACAAGAGT TCATCATCTT TGACTTCTGC CAAAACTTAG AGTTCTTCAA CGAGAACCCG AAAATCACAG ATAGCCCTAC TGCGCGACCA ATTGGCGAAC GGTTGTTTGT GTCGCGTGTC GAGTTGATTA GCGCTTTGGA CGAAGCCGGG AACGAAAGCC CCTTGGCTGC CGACATTAAG GACCGTTTGC ATCAGGAAGT TCTCGGCATG AACTTAGAGA ACTTCATCGT ACGTCAAGCG CGCCGACCGG TCGAACGATT TCAAAGCGCC GAAGCTTGGG ACACTTTGGA CCTCGATGCA CGTCTAGCTT TGATTGAAGA AGTTGCCGGT CTTCCTACAG CTTTCGAAGA CGGCCGACTC GCAGCAAAGC AGTTCGATCT ATTGGTGCTA AATGCCCAAC TACTTCTACT TCGCGGTGAC GCGGCATTCG CCAATCTTCA AAGACGGATC GTGTCCTTTG CTTCCGCCTT GGAAAGCCTT TCGAATGTTC CGGCAGTGAC CCAAGAGCTT GAGCTCGTAT TGGCGATCCA AACTGATGAG TTTTGGCAGG ATATCAATGT TGAAATCCTA GAAGACGTCC GGCGACGCTT CCGCAACTTG GCAGAGCTAA TCCAGCCGAA GGAACGAAAA AACGTTATCA CGGATTTTGA AGACAGCATC GGCACCTCAA CCACTATCGA CTTGCCCGAA GTGGGAAGCG GCGTGGATAA GGTGCGTTTC AAAACCAAGA CACGGAAATT CATCGAAGCC CATTCTGACC ACATCGCTCT CCAAAAAATT CGGCGCGGCG AACAACTTAC GCCCGCCGAT CTTCAGGAGC TAGAACGTAT GCTGATTGAT GAAGGCGTAG CCGACCATGA CGTTCTGGAT GGCCTGCAGA ATGAAGGTGG TCTCGGAGTG TTTCTGCGGT CGCTTACGGG GCTTGATCGA GCCGCAGCGA AAGCCTTGTT CAGCGATTTC GCGGCAGCCA ACCAACTTTC GGCGAACCAA ACAGAATTCA TCGATCTAAT CATCAACAGC CTTTGCGAAA ATGGCGTTTT GGACCCCAAG ACCTTCTATG AGAGTCCGTT CACTGACCTG GATGACATGG GGATTATGGG TGTTTTCAGT GAAACTCAAT CGGCCGAGAT AATCAGACTG GTCAGAGAAA CCAATCAAAC CGCTGCAGCC TAA
|
Protein sequence | MTQFVFLTAD FPDLLAHAKK AESAALSDPR GACFWARLTL ETAIKWIYRN DPSMRSPYQD TLAALIAEPS LGQLTGPAIV TKARFIKDHG NRAAHDSGKP IKPQDAAAVV RELFHVCYWM ARTYAKDAKP DSSLQFDASK LEKTLTISAS TVAQIQALKE KHDAHAKALK DAEAAALASE EGRKRLEAEL AQVRAEIKEI RQANTAVQDD HDYNEAATRD AFIDLLLNEA GWPLDQERDR EFPVSGMPND KGEGLVDYVL WGNDAKPLAL VEAKRTMKDS RIGQQQAKLY ADCLETMYGR RPVIFTTNGY EHWLWDDQMY PPRRVSGFLK RDELVLLHQR RGTRKSLDGV SVDQGIAGRF YQQRAIRRVG EAFERDRQRK ALLVMATGSG KTRTVIALID QLMRANWVRR VLFLADRVAL VKQAHNAFKT HLPTAASANL LKNHDPARND HSGARICLST YPTMLGLIDD VKGGEKQFGP GHFDLIVIDE AHRSVYRKYR AIFDYFDGLL VGLTATPREE IDRDTYSLFE LERGIPTDSY DLEDAVADEY LVPPRSISVP LKFQRDGIDY DSLSDEEKAE WDAIEWDDEG AVPDRVEAAD LNRWLFNKDT VDKVLEHLMT NGIKVAGGDR LGKTIIFAKN SDHAKFIVER FDANYPHYMG QFARLIDYSV TYAQSLIDDF SEAEEAPHIA VSVDMLDTGI DVPEVVNLVF FKIVRSKTKF WQMVGRGTRT CEDLFGPGQD KQEFIIFDFC QNLEFFNENP KITDSPTARP IGERLFVSRV ELISALDEAG NESPLAADIK DRLHQEVLGM NLENFIVRQA RRPVERFQSA EAWDTLDLDA RLALIEEVAG LPTAFEDGRL AAKQFDLLVL NAQLLLLRGD AAFANLQRRI VSFASALESL SNVPAVTQEL ELVLAIQTDE FWQDINVEIL EDVRRRFRNL AELIQPKERK NVITDFEDSI GTSTTIDLPE VGSGVDKVRF KTKTRKFIEA HSDHIALQKI RRGEQLTPAD LQELERMLID EGVADHDVLD GLQNEGGLGV FLRSLTGLDR AAAKALFSDF AAANQLSANQ TEFIDLIINS LCENGVLDPK TFYESPFTDL DDMGIMGVFS ETQSAEIIRL VRETNQTAAA
|
| |