Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ksed_22820 |
Symbol | |
ID | 8373785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Kytococcus sedentarius DSM 20547 |
Kingdom | Bacteria |
Replicon accession | NC_013169 |
Strand | + |
Start bp | 2361880 |
End bp | 2364900 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644992518 |
Product | type I restriction enzyme R protein |
Protein accession | YP_003150024 |
Protein GI | 256826064 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.324704 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAACA CGCGCGAGAA GTCGTTCGAG GCCAATGTCG AGGCCCACCT GACCGGCCAC GGCTGGCAGG CCCTCGCCCC CGAGGGGTAC GACCGGGCCC TCGGGCTGTT CCCCGACGAG GTGGTGGCCT GGCTGGCCGA GAGCCAGCCG AAGGCGTGGG AGCAGCTGGT GGCCCGCAAC GGCGGCGAGG GGCTGGCCCG GCAGCGGTTC CTCACGGCGC TGGCCGGCGA CATCGACCAC CGCGGCACCC TGGCGGTGCT GCGCAGCGGC ACCAAGCACG CCGGCATCAC GGTGGACATG GCCACCTTCC GCCCCGCCAG CGGGCTCAAC GAGACCATCG CCGCGCGGTA CGCCGCCAAC CGGCTGGCCG TGGTGCGCCA GCTGCACCAC TCCGAGAGTC GCCCCGCCGA CTCGGTGGAC GTCACTCTGG TGCTCAACGG CATCCCGGTG GCCACGGCCG AGCTGAAGAA CCCCTTCACG CACCAGACCA TCGAGCACGC CAAGACGCAG TACCGGGAGG ACCGGCACCC CAACGACCTG ATCTTCCGCA GCCGGGCGCT GGTGCACTAT GCGGTGGACC CGCACCACGT GGCCATGACC ACCCGGCTGG CCGGCCAGGA CACGCGCTTC CTGCCCTTCG ACCAGGGCAG CAACGGCCCC GGCCACCCCG GCACGGCGGG CAACCCGCAG CGCGACGGCT ACCAGACGGC CTACCTGTGG GAGGAGGTCT GGCAGCGCGA CGCGTGGCTG GAGCTGCTGG GCTCGTTCAT CCAGCAGCGC AAGGAGGGCA GCAGTTGGCA GGTGCTCTTC CCCCGCTACC ACCAGTGGCA CGCGGTACGC AGCATCCTGG CCGCCACCTA CGAGGACGGC GCCGGGGTGG ACCGCCTCAT CCAGCACTCC GCCGGCTCGG GCAAGTCCAA CACCATCGCC TGGAGCGCCC ACCTGCTCTC TCGGTTGCAC GGCAAGGACG AGGCGCCGGT CTTCGACAAG GTGGTGGTGC TCACCGACCG CAAGGTGCTG GACAAGCAGC TGCAGAACAC GGTCGCTGGG CTGGAGCACA CCGAGGGGAC GATCGTCCGC ATCGACAAGG ACTCCAAGCA GCTCAAGGCC TCCCTGGAGG GCAACGCGGC CCGCATCATC ATCACCACGC TGCAGAAGTT CCCGGTCGTG ATGCAGCTGG CCAAAGAGCA GGCGAAGGAG ACCGGTGATG GCGGCCGCGT GATGGGGCAG CGCTTCGCGG TCATCGTGGA CGAGGCGCAC TCCTCCACCT CCGGCTCGGC CTTCAGCGGC ATGAAACGGG TGCTGAAGGG TGGCGTTGAC GAGGCGCTGG ACGACGCCGA GGCAGCCGAG ACCACCGATG GCAACGTGCC GGGCGCCGAG GAGGACTCCC TGCTGGAGAG CGCGCAGCGG CGCGGCAAGC AGAGCAATCT GTCGTTCTTC GCCTTCACCG CGACGCCCAA GCCCAAGACG CTCAACATCT TTGGCCAGAC CGGGCCCGAT GGGGTGAAGC GCCCGTTCCA CACGTACTCC ATGCGGCAGG CGATCGCCGA GGGGTTCATC CTGGACGTGC TGGCCAGCTA CACGACCTAC GACGTGTATT ACAAGCTGAT CAACACCGAG CCGAGCGATC CCAGCATCGA CACCCGCAAG GGGAAGGCTG CGCTGGCCCG CCACGCCTCT CTGCACGACC ACATGATGGA GGGCAAGGCC GAGGTGATCG TGGAGCACTT CCGCGAGAAG ACAGCCCACA AGGTCGGTGG GCGCGCCAAA GCGATGGTGG TGACGCGGTC GCGGCTGCAC GCACTGAAGA CCCACCAGGC GATCGAGCGG TACATCAAGA AGAACGGGTA CGACAAGGGG CCGGGTGCGC TGTCGGCGCT GGTGGCGTTC TCCGGCTCGG TGACTGACCC CGACACCCCG GAGGTGGCGC TGACCGAGGG GGCGGTGAAC GGGTTCCCCG AGAGTGAGCT GCCGCGGCGC TTCCACGACG AGCACCAGGT GCTGGTGGTG GCCGAGAAGT ACCAGACCGG TTTCGACGAG CCGCTGCTGC ACACGATGTA CGTGGACAAG AAGCTTTCGG GCGTGGCGGC GGTGCAGACG CTGAGCCGGT TGAACCGAAC CATGCCGGGC AAGCACGACA CGTTCGTGCT GGACTTCGCG AACTCCGCCG AGGAGATCCA GGCGGCGTTC GAGCCGTTCT ACGAGCAGAG CCTGGCCGAG CAGGTGGACC CGCAGGCGCT GACCACCATG GAGCAGGAAC TGAAGACGTT TGGCGTGCTG GTGGCAGAGG AGATGCAGCA GGCCGTGGCT CTCGTGCTGG ACGGGGATGA CACAGACCAA GGGCGCATCT ACCAGCTGGT GGGGCAGGCA GCCGGGCGCT GGGTGATGCT CAACGAGCGC GATGAGGACG CCGCGGAGAC GTTCCGCGGG ACGCTTGACG CGTTCTGCCG TGCGTACACC TTCATTGCCC AGGTGATGCC GTGGGCCGAC CCCGAGCAGG AGCGGCTGTT CCTGTTCGGG CGGTTGTTGC TCGCGGACCT ACCGCAGCGC GAGGGTGATT CGATGCCCCA AATCAGCAGG TCGGTGCAGC TTTCGCACTT GCGGATTGCG GCGCAGGGCG AGACGGTTAT TGAGCTGGAG GGCTCCGATG AGCCCGGTGA GGCGCTGCCC GGTGAAGGCA AGGGTGCTGA GGCAGAGCCG GTGGCGGACA AGCTGTCGGC GCTCATTGCG GTGCTGAACG AGAAGTTCGG CGCCGAGCTG GGCGATGCCG ACAGGTTGTG GTTCGAGCAG CAGACGATGG TGGTGGCCCA CGACGTGGAC ATGCAGGATG TGGCGCGTCA CAATGACCGG AGGGCCTACA GGCTGGTGGT GGAGGAACGC GTGGAGGACA TGCTGCTGGA CCGGGACCAG CAGAACGGGA AGTTGTTCAG CATGTTCTTC GAGAACTTGG ACTTCCGCCG GATGATGGTG GAGCACATCG TGGGCCAGAC CTACGACAAG GCAAGGCGCC AGCCGGCATG A
|
Protein sequence | MANTREKSFE ANVEAHLTGH GWQALAPEGY DRALGLFPDE VVAWLAESQP KAWEQLVARN GGEGLARQRF LTALAGDIDH RGTLAVLRSG TKHAGITVDM ATFRPASGLN ETIAARYAAN RLAVVRQLHH SESRPADSVD VTLVLNGIPV ATAELKNPFT HQTIEHAKTQ YREDRHPNDL IFRSRALVHY AVDPHHVAMT TRLAGQDTRF LPFDQGSNGP GHPGTAGNPQ RDGYQTAYLW EEVWQRDAWL ELLGSFIQQR KEGSSWQVLF PRYHQWHAVR SILAATYEDG AGVDRLIQHS AGSGKSNTIA WSAHLLSRLH GKDEAPVFDK VVVLTDRKVL DKQLQNTVAG LEHTEGTIVR IDKDSKQLKA SLEGNAARII ITTLQKFPVV MQLAKEQAKE TGDGGRVMGQ RFAVIVDEAH SSTSGSAFSG MKRVLKGGVD EALDDAEAAE TTDGNVPGAE EDSLLESAQR RGKQSNLSFF AFTATPKPKT LNIFGQTGPD GVKRPFHTYS MRQAIAEGFI LDVLASYTTY DVYYKLINTE PSDPSIDTRK GKAALARHAS LHDHMMEGKA EVIVEHFREK TAHKVGGRAK AMVVTRSRLH ALKTHQAIER YIKKNGYDKG PGALSALVAF SGSVTDPDTP EVALTEGAVN GFPESELPRR FHDEHQVLVV AEKYQTGFDE PLLHTMYVDK KLSGVAAVQT LSRLNRTMPG KHDTFVLDFA NSAEEIQAAF EPFYEQSLAE QVDPQALTTM EQELKTFGVL VAEEMQQAVA LVLDGDDTDQ GRIYQLVGQA AGRWVMLNER DEDAAETFRG TLDAFCRAYT FIAQVMPWAD PEQERLFLFG RLLLADLPQR EGDSMPQISR SVQLSHLRIA AQGETVIELE GSDEPGEALP GEGKGAEAEP VADKLSALIA VLNEKFGAEL GDADRLWFEQ QTMVVAHDVD MQDVARHNDR RAYRLVVEER VEDMLLDRDQ QNGKLFSMFF ENLDFRRMMV EHIVGQTYDK ARRQPA
|
| |