Gene TM1040_3549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3549 
Symbol 
ID4075225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp590038 
End bp593316 
Gene Length3279 bp 
Protein Length1092 aa 
Translation table11 
GC content58% 
IMG OID638005061 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_611780 
Protein GI99078522 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.276779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.254191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAC AGATCCCCGA CACCCGCGAA GAAGCAGCCT CCAAGCTCCC AGCGTTGCAC 
ATGCTGATAG CGATGGGCTG GACCTACCTG TCCCCTGCAT CAGCCTTGGG TCTGCGCGGA
TCGACCCGCG CGGTGTTGCT GGAACCTGTG CTGCGGCAGT ACCTGAAAGC GCATCGATTT
GCGTTCAAGG GTCAACGCTA TCCACTGTCC GACGCGGGCA TCGCTCAGGT CCTGAAGGAG
ATCACGGTCA CCGGTCTCAG CGAGGGGCTG CGCTATGCCA ATGAGGCGAC CTATAAGAAG
CTGACGCTTG GTGTGACAGT TACCGAGTTC GTGGACGGGC AGAAGACATC GGTCACGGTG
CCTCTGATCA ACTGGAACAC GCCGCAGGAC AACAGCTTCT ATGTCAGTGA GGAGTTCAGC
GTCCAACGCG CCCAGGGTAA CGACCACTAC CGGCCTGATC TCGTGTGCTT CGTCAACGGC
ATCCCGCTGG CCGTGATCGA GGCCAAGCGC CCGACCCATG CGACCAAGGA AACGGCCATG
GTCGAGGAAG GCATCAGCCA GCACAATCGT AACCAGAAGG AAGGCGGCAT CCCCGCGCTC
TATGTCTACT CGCAGCTGCT GTTGTCGATA TCAGGCTCCG GCGGGCGGTA TGGCACCACC
GGCACCGACA AGAAGTTCTG GTCGACTTGG CACGAGGAGG AGATCGCCGA GGCTGACATG
CAGGCGCTCA AGTCCAAGCC GCTGACGGCC ACTCAGATTG ATGCACTGTT CGCTGACCGT
CCGGCCTGGG CGAGAAATGA GTTCGAGGAA CTGCACAGCG GCAAGCTGAT GCTGACCGAG
CAGGACCGGC TAATCGTCAG CCTGCTGCGC CCGGATCGTC TGCTGGATCT CACCCGGCGC
TTCATCTTCT TCGACAGCAA GATCGGCAAG ATCGTTGCAC GGTATCAGCA GGTGCAGGGA
ACGAAGGCTA TTCTGGCCCA AGTGGCCAAC AAGAGCGAAG ACGGTAGCCG CAAGGGCGGC
GTGATCTGGC ACACCACCGG CTCCGGCAAG AGCAACCTGA TGGTGTTCCT CGCCAAGGCT
CTGCTGACGG ACCCGGAGCT GGCAGACTGC CGGTTGATCA TCGTCACCGA CCGTGTCGAT
CTGGAGAAAC AGCTTGCCAG GACCTTCCTT ACCGGCGGGG CCTTTGGGTC TGTGGTCGCA
ACCAAAAAGG ACGGGGAAAA CGCCAAGGTT CAGTCCGGAG AAGACCTGGC GCAACGGATT
GGCCACGGTA ACGAGCGTAT CATTTTTACG CTACTGCAGA AGTTCAACTC CGCCACCAAA
TACCCGGAGT GCAAGAACAC CTCCGACAAG CTGATAGTGC TGGTCGATGA AGGACATCGC
AGCCAGGGCG GCGAGAACCA TGAGCGGATG CGGCAGGCGC TGCCCAACGC CGCCTTTATT
GCCTTCACGG GGACCCCCCT ACTGAAGAAA GACAAGACCC GCAACAAATT TGGGCCGATC
CTCCACGCCT ATACCATGAG CGATGCGATT GCAGACGGGG CGGTCACACC GCTGGTCTAC
GAGGAGCGCA AGCCGCTGCT GGACGTGAAT GACGCGGCGA TTGATGCATG GTTCGATACC
TTCACCCAGA GCATGACCGA CAAGCAGAAG TCGGACCTGA AGAAGAAATA TTCCAGCCGG
GGACAGATCT ATGGTGCCGA AGACCGGATA GACCGGATCG CGCTGGATAT CGCCACGCAT
TTCAACGAAC ATTTCAAGCA GCACTGCCCC GGCCTCAAAG CGCAGCTGGC CACGGACAGC
AAACTGTCGG CCATCCGCTA TAAGGAAGCG CTTGACGCGA CGGGCTTGGT GTCCAGCGCA
GTCGTGATTT CGCCACCAGA TACCCGTGAG GGGCATGACG ACACGGACGA CCGCAAGACC
CCGGAAGTGC AGGCGTGGTG GGAAAAAACC GTCGGGAAGG ATGCGGAGGC CTACGAGACT
GCTGTTATCT CCGATTTCGG CACTGATGGT GACCCGGATA TCTTGATCGT GGTCGACAAG
CTGCTGACGG GGTTCGATGA GCCACGCAAT GCGGTTCTGT ACATCGACAA GCTCATCAAG
GGTCACAATC TCTTGCAGGC GATTGCTCGT GTGAACAGGC TGCACGAGGA CAAACAGTTC
GGATATCTGA TCGACTATCG CGGCATCCTG GCTGAGCTGG ACACTTCCAT CAAGGACTAC
CAGAATCTCG CCGCGAAGAC GCAGGCGGGG TATGACATCG ACGATCTAAA AGGCACCTTC
GAGGAAGTAT CCGTTGAGTA CAAGCGCTTG CCTTCACGGC ATGACCGTCT GTGGTCCCTG
TTCGCGGAGG TGGAAAACAA GAAAGACCGG GAGCAGTTCC GCCGTGTGCT GGTTCCCCGC
ATGGAAGATG ACGGCACCGG CCACAGCATC GACCTGAGCC AAAAGATCCG GGAGGACTTC
TACGAAGCGC TCACCGAGTT CGGGATGTGC TTGAAGCTTG CCTTGGCCTC ACGGAGTTTC
TACGAGGACA GCTCTTTCGA CGAGGCAGTG ATCGCCACCT ACAAGAGGGA CCTGAAGTTC
TTCACCGAGA TCCGCACCCA GGCCCGTCAG GATGCAGGCG AGACTGTGGA CTTTTCCCAG
TACGAACAGC AGATCAGGCA GCTCGTGGAC AAGCATGTGA TCGGCCAGGA TATCGTTGAG
CCGGAAGGCT TCATCCGGGT GGTAGACCTT GGACAAGCAG GTTCACCTGA GGACTGGTCT
GACGAGAAGA CACGCACCGA AGCCGATGTC ATCAAGACCC GCATCCGCAA GACGATTGAA
CAGGAGCTGA TCGACGATCC CTATGCTCAG AAGGTTTTTT CGGACTTGCT GAAGGACGCC
ATCAAGGCCG CAGAAGCGAT GTTCGATCAC CCCCACAAGC AGTATGTGAT GTTCAAGGAC
TTGGAGCAAC AGGTGTCCAA CAGGGCAACG CCGGGTCTAC CTGACAGTTT CTCCGGGAAT
TATCGGGCGC AAGCATTTTA TGGCGCGTTG CTGGAGGCCA AAGGTTTGGG AACTGACCAA
CTCCTCGATC AAGAGATGCT GGTGCAGGAA GCCTTTCACA TAGACCAAGT AGTAAGTGAG
GCTGTCCAAG CGCACTCGAT CAACCCAGCC AACATCGAAG CCGCGATTTC TAAGTCGCTG
TTGCCGCGCT ACTTCCAGAA GTTCGGAGGG CTGGACCAAG CGGAAATGCT GGTTGGGCAA
CTTGTTGCCA TCGTCCGTTC GGGCGGTAAC AGGGCCTGA
 
Protein sequence
MSQQIPDTRE EAASKLPALH MLIAMGWTYL SPASALGLRG STRAVLLEPV LRQYLKAHRF 
AFKGQRYPLS DAGIAQVLKE ITVTGLSEGL RYANEATYKK LTLGVTVTEF VDGQKTSVTV
PLINWNTPQD NSFYVSEEFS VQRAQGNDHY RPDLVCFVNG IPLAVIEAKR PTHATKETAM
VEEGISQHNR NQKEGGIPAL YVYSQLLLSI SGSGGRYGTT GTDKKFWSTW HEEEIAEADM
QALKSKPLTA TQIDALFADR PAWARNEFEE LHSGKLMLTE QDRLIVSLLR PDRLLDLTRR
FIFFDSKIGK IVARYQQVQG TKAILAQVAN KSEDGSRKGG VIWHTTGSGK SNLMVFLAKA
LLTDPELADC RLIIVTDRVD LEKQLARTFL TGGAFGSVVA TKKDGENAKV QSGEDLAQRI
GHGNERIIFT LLQKFNSATK YPECKNTSDK LIVLVDEGHR SQGGENHERM RQALPNAAFI
AFTGTPLLKK DKTRNKFGPI LHAYTMSDAI ADGAVTPLVY EERKPLLDVN DAAIDAWFDT
FTQSMTDKQK SDLKKKYSSR GQIYGAEDRI DRIALDIATH FNEHFKQHCP GLKAQLATDS
KLSAIRYKEA LDATGLVSSA VVISPPDTRE GHDDTDDRKT PEVQAWWEKT VGKDAEAYET
AVISDFGTDG DPDILIVVDK LLTGFDEPRN AVLYIDKLIK GHNLLQAIAR VNRLHEDKQF
GYLIDYRGIL AELDTSIKDY QNLAAKTQAG YDIDDLKGTF EEVSVEYKRL PSRHDRLWSL
FAEVENKKDR EQFRRVLVPR MEDDGTGHSI DLSQKIREDF YEALTEFGMC LKLALASRSF
YEDSSFDEAV IATYKRDLKF FTEIRTQARQ DAGETVDFSQ YEQQIRQLVD KHVIGQDIVE
PEGFIRVVDL GQAGSPEDWS DEKTRTEADV IKTRIRKTIE QELIDDPYAQ KVFSDLLKDA
IKAAEAMFDH PHKQYVMFKD LEQQVSNRAT PGLPDSFSGN YRAQAFYGAL LEAKGLGTDQ
LLDQEMLVQE AFHIDQVVSE AVQAHSINPA NIEAAISKSL LPRYFQKFGG LDQAEMLVGQ
LVAIVRSGGN RA