Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_17680 |
Symbol | |
ID | 8395658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | - |
Start bp | 1995629 |
End bp | 1998736 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644986522 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_003144136 |
Protein GI | 257064464 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGACG ACAAAGCCTC CTTCCGCTAC TCTGCCGCCA CGATGCCGAT GAACGAGGAC TTCTTCGAGC AGGTCATCAT CGAGCACCTT CGGGACGAGC ACGGCTACGA GTTCCTGCAC GGCCCCGACG TGCCGCGCAC CACCCCCGAC TACCGCGACG TTTTCCTGCC GGCCGTCCTG CCGGACAGTC TAAGGCGCAT CAACCCGGGC CTTCCCACAG CCGCCATCGA GCAGGCGGTC CTCAAGATCT CGAACATCGA GGCCGGCGGG CTCTACCAGA AGAACGAGGT GTTCAACGAC TACCTGCAGT CGGGCGTTGA GGTGCACTTC TACGACGGAA AGGAGGAGCG GGACGACATC GTCTACCTGC TGGACTTCGA TGACCCCGAG AACAACTCCT TCCACGTGGT CAACCAGTGG ACCTTCGTGG AGTACTCGCA GAAGCGCCCC GACGTCGTCG TGTTCGTGAA CGGCATGCCG CTCGTCGTCT TCGAGCTCAA GAGCCCGTCC CGCGAGGAGA CCGACGCGTC GGATGCCTAC CTGCAGCTGC GGCAGTACAT GAAACAGATT CCCAGCATGT TCGTGCCGAA CGTGTTCTGC GTGATGAGCG ACATGAGCGA GACGCGCGTG GGAACCATCA CCGCCGACGA GGACCGCTAC GTCTCCTGGA AGAGCGTGGA CGGCGACTAC TCCGGCACGA AGGGGGCCAC ATGGAGGACG ATGCTCGACG GCATGATGCC GAAGGCGCGG CTGCTCGACA TCGTCAGGAA CTTCGTATGC TTCAACGACG ACTCGTCCAG GGTCGTCAAG ATCCTCGCGG GCTACCACCA GTATTTCGGC GTGACGAAGG CGGCCGACCG GGCCGTGGAG GCGGTGGCGG GCGACGGCAA GATCGGCGTG TTCTGGCACA CGCAGGGCTC GGGCAAATCG CTCTCGATGG TCTTCTTCGC CCACCTGCTG CAGGAGCGGC TGGAGAGCCC CACCATCGTG GTCATCACCG ACCGAAACGA CCTGGACGAT CAGCTCTACG GGCAGTTCTG CCGCTGCGCG CCGTTCCTTC GCCAGACGGC CGTGCAGGCG ACGAGCCGGG ACGGAGCGAA CGACAGCACC AGCCTGAAGC GGCTGCTCGT CGGGCGCGAG GCGAACGGCA TCGTCTTCAC GACGATGCAG AAGTTCATGG ACGGCGAGGA GCCTTTGTGC GACCGCAGCA ACGTGGTGGT GATGGTTGAC GAGGCGCACC GAGGGCAGTA CGGGCTCACC GAGCGCATGG ACGCAGAGGG CAACGTGCGA GTCGGCGCCG CCCGCGTCGT GCGCAAGGCG CTACCGAACG CGAGCTACAT CGGCTTCACC GGCACCCCGA TATCGACCGA GGACAAGAAC ACCCGCGAGA TCTTCGGCGA CTACATCGAC GTGTACGACA TGACCCAGGC GGTTGAGGAC AACGCGACCC GCCCCGTCTA CTACGAGAGC CGCGTGGTGG CGCTGAAGCT GGACGAGGCC AAGCTCGCGC AACTCGACGC CGCCTACGCC GAGTTCGCAT CCGAGGCCAA CGCGGTCAGC GTGGAGAAGG CCAAACGCGA CACGGGAGGA CTCGACGCAA TATTCGGCGC GCCCGAGACC ATCGACGCGC TCTGCCGCGA CATCATAGAC CACTACGAGA ACAACCGCGC CGACATCCTG GCGGGAAAGG CCCTCATAGT CGCGTACAGC AGGCCCGTGG CCATGAAGAT CTACTACCGC ATGATGGAGC TGCGTCCCGA GTGGAAAGAC AAGCTCGGCG TCGTGATGAC GATGGGCAAC CAGGACCCCG AGGAGTGGTT CGACGTCTGC GGCGGCAAGA CGCACAAGAA GGAGATGGAG CGCCGGTTCA AGGACGACTG CGACCCGCTC AAGATCGCCA TCGTCGTGGA CATGTGGCTC ACCGGCTTCG ACGTGCCGAG CCTCGCCACC ATGTACGTGT TCAAGCCTAT GAGGGGCCAC AACCTCATGC AGGCCATCGC CCGCGTTAAC CGAGTGTGCA AGGGCAAGGA GGGCGGCCTG GTCGTGGACT ACATCGGCAT AGCCCGCGCC CTCAAGCAGG CGATGAAGGA CTACACCAAC CGCGACCAGT CCAACTACGG CGACATGGAC GTAGCGGCCA CCGCCTACCC GAAGTTCCTG GAGAAGCTGG ACGTGTGCCG CGACCTCATG TACGGCTTCG ACTACCGCAA GGCCATCTTC ACCGACAGTA AGGCGCAGCT CGCGTCCGCC ATCGCAGAGG GCACCGACTG GCTGCTGGAG CCCTCGCGCG AGGAGGACCG CGAGGACTTC ATCAAGCAGT GCCAGCTGAT GAACCAGGCG CTCAGCCTAT GCAAGAGCCT GGTATCCGAG GAGGACCAGC ACGAGGCGGC CTACCTCTCC GTCCTGCGCG TGCAGGTGCT GCGGCTGACG GGCCGGCAGG GCTCCGGCGG AGGCGGCATG ACCTACGCCG AGTTCAACAA GCGCGTGACC GCCATTCTCG AGCAGACAGT GCAGGCCGAC GGCGTCATCG ACCTGTTCGA GAAGGACAGC GTGGAGATAT CACTCTTCGA CGAGGCGTTC TTGCAGGAGC TCGCCAACAT GAAGCAGAAG AACATCGCCG TCGAGAGCCT CAAACGCCTC ATCAAGGAGC GCGTGCGCGC CTATCAGCGC ACGAGCGTGG TGAAGGCGGA GAAGTTCAGC GACATGCTGC AGGGCACGCT CAACGCCTAC CTGAACGGCA TGCTCACCAA CGCGCAGGTC ATCGAAGAGC TCGTGAACAT GGCCAAGGAG ATGATGAAGG ACCGCACCGA CGCCGAGAAG CTGGGGCTAT CCGACGAGGA GATGGCCTTC TACGACGCGA TAACCAAGCC GCGGGCCGTG AAGGACTTCT ACGACAACGA CCAGCTCGTC GCCATCACGC GGGAGCTGAC CGAGGCGATG CGCACGAACG CCACCATCGA CTGGCAGCGC AAGGAGTCGG CGCGCGCCGG CATGCGCCGC GCCATCAAGC GGCTTCTGAG GAAGTACAAG TATCCGCCCG AGGGCGCCGA TGAGGCCATG ACCACCGTCA TGGCTCAGTG CGAGCTCTGG GCAGACACCA AGATCTAA
|
Protein sequence | MFDDKASFRY SAATMPMNED FFEQVIIEHL RDEHGYEFLH GPDVPRTTPD YRDVFLPAVL PDSLRRINPG LPTAAIEQAV LKISNIEAGG LYQKNEVFND YLQSGVEVHF YDGKEERDDI VYLLDFDDPE NNSFHVVNQW TFVEYSQKRP DVVVFVNGMP LVVFELKSPS REETDASDAY LQLRQYMKQI PSMFVPNVFC VMSDMSETRV GTITADEDRY VSWKSVDGDY SGTKGATWRT MLDGMMPKAR LLDIVRNFVC FNDDSSRVVK ILAGYHQYFG VTKAADRAVE AVAGDGKIGV FWHTQGSGKS LSMVFFAHLL QERLESPTIV VITDRNDLDD QLYGQFCRCA PFLRQTAVQA TSRDGANDST SLKRLLVGRE ANGIVFTTMQ KFMDGEEPLC DRSNVVVMVD EAHRGQYGLT ERMDAEGNVR VGAARVVRKA LPNASYIGFT GTPISTEDKN TREIFGDYID VYDMTQAVED NATRPVYYES RVVALKLDEA KLAQLDAAYA EFASEANAVS VEKAKRDTGG LDAIFGAPET IDALCRDIID HYENNRADIL AGKALIVAYS RPVAMKIYYR MMELRPEWKD KLGVVMTMGN QDPEEWFDVC GGKTHKKEME RRFKDDCDPL KIAIVVDMWL TGFDVPSLAT MYVFKPMRGH NLMQAIARVN RVCKGKEGGL VVDYIGIARA LKQAMKDYTN RDQSNYGDMD VAATAYPKFL EKLDVCRDLM YGFDYRKAIF TDSKAQLASA IAEGTDWLLE PSREEDREDF IKQCQLMNQA LSLCKSLVSE EDQHEAAYLS VLRVQVLRLT GRQGSGGGGM TYAEFNKRVT AILEQTVQAD GVIDLFEKDS VEISLFDEAF LQELANMKQK NIAVESLKRL IKERVRAYQR TSVVKAEKFS DMLQGTLNAY LNGMLTNAQV IEELVNMAKE MMKDRTDAEK LGLSDEEMAF YDAITKPRAV KDFYDNDQLV AITRELTEAM RTNATIDWQR KESARAGMRR AIKRLLRKYK YPPEGADEAM TTVMAQCELW ADTKI
|
| |