Gene TM1040_3537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3537 
Symbol 
ID4075215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp572581 
End bp575973 
Gene Length3393 bp 
Protein Length1130 aa 
Translation table11 
GC content53% 
IMG OID638005051 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_611770 
Protein GI99078512 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.244688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAT TCGTTTTCTT GACTGCAGAT TTCCCCGACC TTCTAGCCCA CGCGAAAAAG 
GCTGAGAGTG CGGCACTATC CGACCCACGA GGCGCATGCT TCTGGGCGCG GTTGACCTTG
GAAACAGCGA TAAAATGGAT TTATCGCAAC GACCCGTCGA TGCGCTCGCC CTACCAGGAC
ACTCTAGCGG CATTGATTGC AGAGCCGTCG CTCGGCCAAC TCACTGGCCC GGCTATCGTC
ACCAAAGCGC GATTTATCAA AGATCATGGC AACCGGGCGG CGCATGACAG CGGTAAGCCG
ATCAAGCCTC AGGACGCCGC CGCTGTTGTG CGTGAGCTGT TCCATGTCTG CTATTGGATG
GCGCGGACCT ATGCAAAGGA TGCGAAGCCA GACTCTTCGT TGCAGTTCGA TGCCTCAAAG
TTGGAAAAAA CCCTGACGAT TAGTGCCAGC ACAGTTGCAC AAATCCAGGC ATTGAAAGAG
AAGCACGACG CCCATGCCAA AGCGCTAAAG GACGCTGAAG CCGCCGCGCT CGCATCTGAG
GAGGGTCGAA AGAGGCTCGA GGCTGAACTC GCGCAAGTGC GGGCGGAAAT CAAAGAAATT
CGCCAGGCCA ACACTGCGGT ACAAGATGAC CACGACTACA ATGAGGCCGC GACGCGCGAT
GCTTTCATTG ATCTGCTGCT GAACGAGGCT GGGTGGCCGC TCGACCAGGA GCGCGACCGT
GAATTTCCGG TATCAGGTAT GCCTAATGAC AAAGGTGAAG GCCTTGTCGA TTATGTGCTT
TGGGGAAACG ATGCCAAGCC ACTCGCCTTG GTCGAAGCGA AACGAACCAT GAAGGACAGC
CGGATAGGGC AGCAGCAAGC CAAGCTTTAC GCCGATTGTC TTGAAACGAT GTATGGGCGC
CGTCCGGTGA TTTTCACGAC GAACGGCTAT GAACATTGGC TCTGGGACGA CCAGATGTAC
CCCCCGCGTC GTGTATCGGG CTTCCTGAAA AGGGATGAAC TCGTCCTGCT TCACCAACGT
AGAGGCACCA GAAAATCGTT AGATGGGGTG TCCGTAGACC AGGGTATTGC TGGGCGCTTT
TACCAACAGC GAGCCATTCG CCGGGTAGGC GAGGCTTTTG AACGAGACCG ACAGCGGAAG
GCCTTGCTTG TAATGGCGAC TGGCAGTGGC AAGACGCGAA CGGTCATCGC GCTTATCGAC
CAGCTCATGC GCGCCAATTG GGTGCGGCGT GTGCTCTTCC TCGCGGATCG CGTTGCTCTG
GTGAAACAGG CACACAATGC CTTCAAGACA CACTTGCCGA CGGCGGCAAG TGCAAACCTG
CTGAAGAATC ATGACCCGGC CAGGAACGAT CATTCTGGTG CTCGTATTTG TCTATCGACC
TATCCCACGA TGTTGGGGTT GATCGACGAC GTCAAAGGCG GTGAAAAGCA GTTTGGCCCC
GGCCACTTTG ACTTGATCGT GATCGACGAA GCGCACCGTT CTGTCTATCG GAAGTACCGT
GCCATCTTCG ACTATTTTGA TGGACTCTTG GTCGGCCTCA CGGCGACGCC GCGCGAAGAG
ATTGACCGAG ACACATACTC GCTCTTTGAG CTGGAACGCG GAATACCAAC CGACAGTTAT
GACTTAGAGG ATGCGGTGGC GGATGAATAT CTCGTACCGC CTAGATCGAT CTCGGTCCCT
TTGAAGTTCC AACGTGACGG TATCGACTAC GACAGCCTTT CTGACGAAGA AAAAGCCGAG
TGGGACGCAA TCGAGTGGGA TGATGAGGGC GCGGTTCCTG ATCGGGTGGA GGCGGCTGAC
TTGAACCGCT GGCTGTTCAA CAAGGATACA GTGGACAAAG TCCTTGAGCA TCTAATGACC
AACGGCATCA AAGTCGCCGG AGGCGACAGG CTTGGAAAGA CAATCATTTT CGCTAAAAAC
AGCGATCATG CGAAGTTCAT TGTCGAGCGC TTCGATGCCA ACTACCCACA TTACATGGGT
CAGTTTGCAC GTCTCATCGA CTACAGTGTC ACCTATGCGC AATCGTTGAT CGACGATTTC
TCAGAAGCGG AGGAAGCCCC CCATATCGCG GTCTCTGTCG ATATGTTGGA CACAGGCATT
GATGTGCCCG AGGTCGTGAA TCTCGTTTTT TTCAAGATTG TTCGTTCTAA AACGAAGTTC
TGGCAAATGG TTGGTAGAGG CACCCGAACT TGCGAGGATC TGTTCGGGCC AGGGCAAGAC
AAACAAGAGT TCATCATCTT TGACTTCTGC CAAAACTTAG AGTTCTTCAA CGAGAACCCG
AAAATCACAG ATAGCCCTAC TGCGCGACCA ATTGGCGAAC GGTTGTTTGT GTCGCGTGTC
GAGTTGATTA GCGCTTTGGA CGAAGCCGGG AACGAAAGCC CCTTGGCTGC CGACATTAAG
GACCGTTTGC ATCAGGAAGT TCTCGGCATG AACTTAGAGA ACTTCATCGT ACGTCAAGCG
CGCCGACCGG TCGAACGATT TCAAAGCGCC GAAGCTTGGG ACACTTTGGA CCTCGATGCA
CGTCTAGCTT TGATTGAAGA AGTTGCCGGT CTTCCTACAG CTTTCGAAGA CGGCCGACTC
GCAGCAAAGC AGTTCGATCT ATTGGTGCTA AATGCCCAAC TACTTCTACT TCGCGGTGAC
GCGGCATTCG CCAATCTTCA AAGACGGATC GTGTCCTTTG CTTCCGCCTT GGAAAGCCTT
TCGAATGTTC CGGCAGTGAC CCAAGAGCTT GAGCTCGTAT TGGCGATCCA AACTGATGAG
TTTTGGCAGG ATATCAATGT TGAAATCCTA GAAGACGTCC GGCGACGCTT CCGCAACTTG
GCAGAGCTAA TCCAGCCGAA GGAACGAAAA AACGTTATCA CGGATTTTGA AGACAGCATC
GGCACCTCAA CCACTATCGA CTTGCCCGAA GTGGGAAGCG GCGTGGATAA GGTGCGTTTC
AAAACCAAGA CACGGAAATT CATCGAAGCC CATTCTGACC ACATCGCTCT CCAAAAAATT
CGGCGCGGCG AACAACTTAC GCCCGCCGAT CTTCAGGAGC TAGAACGTAT GCTGATTGAT
GAAGGCGTAG CCGACCATGA CGTTCTGGAT GGCCTGCAGA ATGAAGGTGG TCTCGGAGTG
TTTCTGCGGT CGCTTACGGG GCTTGATCGA GCCGCAGCGA AAGCCTTGTT CAGCGATTTC
GCGGCAGCCA ACCAACTTTC GGCGAACCAA ACAGAATTCA TCGATCTAAT CATCAACAGC
CTTTGCGAAA ATGGCGTTTT GGACCCCAAG ACCTTCTATG AGAGTCCGTT CACTGACCTG
GATGACATGG GGATTATGGG TGTTTTCAGT GAAACTCAAT CGGCCGAGAT AATCAGACTG
GTCAGAGAAA CCAATCAAAC CGCTGCAGCC TAA
 
Protein sequence
MTQFVFLTAD FPDLLAHAKK AESAALSDPR GACFWARLTL ETAIKWIYRN DPSMRSPYQD 
TLAALIAEPS LGQLTGPAIV TKARFIKDHG NRAAHDSGKP IKPQDAAAVV RELFHVCYWM
ARTYAKDAKP DSSLQFDASK LEKTLTISAS TVAQIQALKE KHDAHAKALK DAEAAALASE
EGRKRLEAEL AQVRAEIKEI RQANTAVQDD HDYNEAATRD AFIDLLLNEA GWPLDQERDR
EFPVSGMPND KGEGLVDYVL WGNDAKPLAL VEAKRTMKDS RIGQQQAKLY ADCLETMYGR
RPVIFTTNGY EHWLWDDQMY PPRRVSGFLK RDELVLLHQR RGTRKSLDGV SVDQGIAGRF
YQQRAIRRVG EAFERDRQRK ALLVMATGSG KTRTVIALID QLMRANWVRR VLFLADRVAL
VKQAHNAFKT HLPTAASANL LKNHDPARND HSGARICLST YPTMLGLIDD VKGGEKQFGP
GHFDLIVIDE AHRSVYRKYR AIFDYFDGLL VGLTATPREE IDRDTYSLFE LERGIPTDSY
DLEDAVADEY LVPPRSISVP LKFQRDGIDY DSLSDEEKAE WDAIEWDDEG AVPDRVEAAD
LNRWLFNKDT VDKVLEHLMT NGIKVAGGDR LGKTIIFAKN SDHAKFIVER FDANYPHYMG
QFARLIDYSV TYAQSLIDDF SEAEEAPHIA VSVDMLDTGI DVPEVVNLVF FKIVRSKTKF
WQMVGRGTRT CEDLFGPGQD KQEFIIFDFC QNLEFFNENP KITDSPTARP IGERLFVSRV
ELISALDEAG NESPLAADIK DRLHQEVLGM NLENFIVRQA RRPVERFQSA EAWDTLDLDA
RLALIEEVAG LPTAFEDGRL AAKQFDLLVL NAQLLLLRGD AAFANLQRRI VSFASALESL
SNVPAVTQEL ELVLAIQTDE FWQDINVEIL EDVRRRFRNL AELIQPKERK NVITDFEDSI
GTSTTIDLPE VGSGVDKVRF KTKTRKFIEA HSDHIALQKI RRGEQLTPAD LQELERMLID
EGVADHDVLD GLQNEGGLGV FLRSLTGLDR AAAKALFSDF AAANQLSANQ TEFIDLIINS
LCENGVLDPK TFYESPFTDL DDMGIMGVFS ETQSAEIIRL VRETNQTAAA