Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3549 |
Symbol | |
ID | 4075225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 590038 |
End bp | 593316 |
Gene Length | 3279 bp |
Protein Length | 1092 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638005061 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_611780 |
Protein GI | 99078522 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.276779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.254191 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAAC AGATCCCCGA CACCCGCGAA GAAGCAGCCT CCAAGCTCCC AGCGTTGCAC ATGCTGATAG CGATGGGCTG GACCTACCTG TCCCCTGCAT CAGCCTTGGG TCTGCGCGGA TCGACCCGCG CGGTGTTGCT GGAACCTGTG CTGCGGCAGT ACCTGAAAGC GCATCGATTT GCGTTCAAGG GTCAACGCTA TCCACTGTCC GACGCGGGCA TCGCTCAGGT CCTGAAGGAG ATCACGGTCA CCGGTCTCAG CGAGGGGCTG CGCTATGCCA ATGAGGCGAC CTATAAGAAG CTGACGCTTG GTGTGACAGT TACCGAGTTC GTGGACGGGC AGAAGACATC GGTCACGGTG CCTCTGATCA ACTGGAACAC GCCGCAGGAC AACAGCTTCT ATGTCAGTGA GGAGTTCAGC GTCCAACGCG CCCAGGGTAA CGACCACTAC CGGCCTGATC TCGTGTGCTT CGTCAACGGC ATCCCGCTGG CCGTGATCGA GGCCAAGCGC CCGACCCATG CGACCAAGGA AACGGCCATG GTCGAGGAAG GCATCAGCCA GCACAATCGT AACCAGAAGG AAGGCGGCAT CCCCGCGCTC TATGTCTACT CGCAGCTGCT GTTGTCGATA TCAGGCTCCG GCGGGCGGTA TGGCACCACC GGCACCGACA AGAAGTTCTG GTCGACTTGG CACGAGGAGG AGATCGCCGA GGCTGACATG CAGGCGCTCA AGTCCAAGCC GCTGACGGCC ACTCAGATTG ATGCACTGTT CGCTGACCGT CCGGCCTGGG CGAGAAATGA GTTCGAGGAA CTGCACAGCG GCAAGCTGAT GCTGACCGAG CAGGACCGGC TAATCGTCAG CCTGCTGCGC CCGGATCGTC TGCTGGATCT CACCCGGCGC TTCATCTTCT TCGACAGCAA GATCGGCAAG ATCGTTGCAC GGTATCAGCA GGTGCAGGGA ACGAAGGCTA TTCTGGCCCA AGTGGCCAAC AAGAGCGAAG ACGGTAGCCG CAAGGGCGGC GTGATCTGGC ACACCACCGG CTCCGGCAAG AGCAACCTGA TGGTGTTCCT CGCCAAGGCT CTGCTGACGG ACCCGGAGCT GGCAGACTGC CGGTTGATCA TCGTCACCGA CCGTGTCGAT CTGGAGAAAC AGCTTGCCAG GACCTTCCTT ACCGGCGGGG CCTTTGGGTC TGTGGTCGCA ACCAAAAAGG ACGGGGAAAA CGCCAAGGTT CAGTCCGGAG AAGACCTGGC GCAACGGATT GGCCACGGTA ACGAGCGTAT CATTTTTACG CTACTGCAGA AGTTCAACTC CGCCACCAAA TACCCGGAGT GCAAGAACAC CTCCGACAAG CTGATAGTGC TGGTCGATGA AGGACATCGC AGCCAGGGCG GCGAGAACCA TGAGCGGATG CGGCAGGCGC TGCCCAACGC CGCCTTTATT GCCTTCACGG GGACCCCCCT ACTGAAGAAA GACAAGACCC GCAACAAATT TGGGCCGATC CTCCACGCCT ATACCATGAG CGATGCGATT GCAGACGGGG CGGTCACACC GCTGGTCTAC GAGGAGCGCA AGCCGCTGCT GGACGTGAAT GACGCGGCGA TTGATGCATG GTTCGATACC TTCACCCAGA GCATGACCGA CAAGCAGAAG TCGGACCTGA AGAAGAAATA TTCCAGCCGG GGACAGATCT ATGGTGCCGA AGACCGGATA GACCGGATCG CGCTGGATAT CGCCACGCAT TTCAACGAAC ATTTCAAGCA GCACTGCCCC GGCCTCAAAG CGCAGCTGGC CACGGACAGC AAACTGTCGG CCATCCGCTA TAAGGAAGCG CTTGACGCGA CGGGCTTGGT GTCCAGCGCA GTCGTGATTT CGCCACCAGA TACCCGTGAG GGGCATGACG ACACGGACGA CCGCAAGACC CCGGAAGTGC AGGCGTGGTG GGAAAAAACC GTCGGGAAGG ATGCGGAGGC CTACGAGACT GCTGTTATCT CCGATTTCGG CACTGATGGT GACCCGGATA TCTTGATCGT GGTCGACAAG CTGCTGACGG GGTTCGATGA GCCACGCAAT GCGGTTCTGT ACATCGACAA GCTCATCAAG GGTCACAATC TCTTGCAGGC GATTGCTCGT GTGAACAGGC TGCACGAGGA CAAACAGTTC GGATATCTGA TCGACTATCG CGGCATCCTG GCTGAGCTGG ACACTTCCAT CAAGGACTAC CAGAATCTCG CCGCGAAGAC GCAGGCGGGG TATGACATCG ACGATCTAAA AGGCACCTTC GAGGAAGTAT CCGTTGAGTA CAAGCGCTTG CCTTCACGGC ATGACCGTCT GTGGTCCCTG TTCGCGGAGG TGGAAAACAA GAAAGACCGG GAGCAGTTCC GCCGTGTGCT GGTTCCCCGC ATGGAAGATG ACGGCACCGG CCACAGCATC GACCTGAGCC AAAAGATCCG GGAGGACTTC TACGAAGCGC TCACCGAGTT CGGGATGTGC TTGAAGCTTG CCTTGGCCTC ACGGAGTTTC TACGAGGACA GCTCTTTCGA CGAGGCAGTG ATCGCCACCT ACAAGAGGGA CCTGAAGTTC TTCACCGAGA TCCGCACCCA GGCCCGTCAG GATGCAGGCG AGACTGTGGA CTTTTCCCAG TACGAACAGC AGATCAGGCA GCTCGTGGAC AAGCATGTGA TCGGCCAGGA TATCGTTGAG CCGGAAGGCT TCATCCGGGT GGTAGACCTT GGACAAGCAG GTTCACCTGA GGACTGGTCT GACGAGAAGA CACGCACCGA AGCCGATGTC ATCAAGACCC GCATCCGCAA GACGATTGAA CAGGAGCTGA TCGACGATCC CTATGCTCAG AAGGTTTTTT CGGACTTGCT GAAGGACGCC ATCAAGGCCG CAGAAGCGAT GTTCGATCAC CCCCACAAGC AGTATGTGAT GTTCAAGGAC TTGGAGCAAC AGGTGTCCAA CAGGGCAACG CCGGGTCTAC CTGACAGTTT CTCCGGGAAT TATCGGGCGC AAGCATTTTA TGGCGCGTTG CTGGAGGCCA AAGGTTTGGG AACTGACCAA CTCCTCGATC AAGAGATGCT GGTGCAGGAA GCCTTTCACA TAGACCAAGT AGTAAGTGAG GCTGTCCAAG CGCACTCGAT CAACCCAGCC AACATCGAAG CCGCGATTTC TAAGTCGCTG TTGCCGCGCT ACTTCCAGAA GTTCGGAGGG CTGGACCAAG CGGAAATGCT GGTTGGGCAA CTTGTTGCCA TCGTCCGTTC GGGCGGTAAC AGGGCCTGA
|
Protein sequence | MSQQIPDTRE EAASKLPALH MLIAMGWTYL SPASALGLRG STRAVLLEPV LRQYLKAHRF AFKGQRYPLS DAGIAQVLKE ITVTGLSEGL RYANEATYKK LTLGVTVTEF VDGQKTSVTV PLINWNTPQD NSFYVSEEFS VQRAQGNDHY RPDLVCFVNG IPLAVIEAKR PTHATKETAM VEEGISQHNR NQKEGGIPAL YVYSQLLLSI SGSGGRYGTT GTDKKFWSTW HEEEIAEADM QALKSKPLTA TQIDALFADR PAWARNEFEE LHSGKLMLTE QDRLIVSLLR PDRLLDLTRR FIFFDSKIGK IVARYQQVQG TKAILAQVAN KSEDGSRKGG VIWHTTGSGK SNLMVFLAKA LLTDPELADC RLIIVTDRVD LEKQLARTFL TGGAFGSVVA TKKDGENAKV QSGEDLAQRI GHGNERIIFT LLQKFNSATK YPECKNTSDK LIVLVDEGHR SQGGENHERM RQALPNAAFI AFTGTPLLKK DKTRNKFGPI LHAYTMSDAI ADGAVTPLVY EERKPLLDVN DAAIDAWFDT FTQSMTDKQK SDLKKKYSSR GQIYGAEDRI DRIALDIATH FNEHFKQHCP GLKAQLATDS KLSAIRYKEA LDATGLVSSA VVISPPDTRE GHDDTDDRKT PEVQAWWEKT VGKDAEAYET AVISDFGTDG DPDILIVVDK LLTGFDEPRN AVLYIDKLIK GHNLLQAIAR VNRLHEDKQF GYLIDYRGIL AELDTSIKDY QNLAAKTQAG YDIDDLKGTF EEVSVEYKRL PSRHDRLWSL FAEVENKKDR EQFRRVLVPR MEDDGTGHSI DLSQKIREDF YEALTEFGMC LKLALASRSF YEDSSFDEAV IATYKRDLKF FTEIRTQARQ DAGETVDFSQ YEQQIRQLVD KHVIGQDIVE PEGFIRVVDL GQAGSPEDWS DEKTRTEADV IKTRIRKTIE QELIDDPYAQ KVFSDLLKDA IKAAEAMFDH PHKQYVMFKD LEQQVSNRAT PGLPDSFSGN YRAQAFYGAL LEAKGLGTDQ LLDQEMLVQE AFHIDQVVSE AVQAHSINPA NIEAAISKSL LPRYFQKFGG LDQAEMLVGQ LVAIVRSGGN RA
|
| |