Gene Ksed_22820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_22820 
Symbol 
ID8373785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp2361880 
End bp2364900 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content67% 
IMG OID644992518 
Producttype I restriction enzyme R protein 
Protein accessionYP_003150024 
Protein GI256826064 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.324704 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAACA CGCGCGAGAA GTCGTTCGAG GCCAATGTCG AGGCCCACCT GACCGGCCAC 
GGCTGGCAGG CCCTCGCCCC CGAGGGGTAC GACCGGGCCC TCGGGCTGTT CCCCGACGAG
GTGGTGGCCT GGCTGGCCGA GAGCCAGCCG AAGGCGTGGG AGCAGCTGGT GGCCCGCAAC
GGCGGCGAGG GGCTGGCCCG GCAGCGGTTC CTCACGGCGC TGGCCGGCGA CATCGACCAC
CGCGGCACCC TGGCGGTGCT GCGCAGCGGC ACCAAGCACG CCGGCATCAC GGTGGACATG
GCCACCTTCC GCCCCGCCAG CGGGCTCAAC GAGACCATCG CCGCGCGGTA CGCCGCCAAC
CGGCTGGCCG TGGTGCGCCA GCTGCACCAC TCCGAGAGTC GCCCCGCCGA CTCGGTGGAC
GTCACTCTGG TGCTCAACGG CATCCCGGTG GCCACGGCCG AGCTGAAGAA CCCCTTCACG
CACCAGACCA TCGAGCACGC CAAGACGCAG TACCGGGAGG ACCGGCACCC CAACGACCTG
ATCTTCCGCA GCCGGGCGCT GGTGCACTAT GCGGTGGACC CGCACCACGT GGCCATGACC
ACCCGGCTGG CCGGCCAGGA CACGCGCTTC CTGCCCTTCG ACCAGGGCAG CAACGGCCCC
GGCCACCCCG GCACGGCGGG CAACCCGCAG CGCGACGGCT ACCAGACGGC CTACCTGTGG
GAGGAGGTCT GGCAGCGCGA CGCGTGGCTG GAGCTGCTGG GCTCGTTCAT CCAGCAGCGC
AAGGAGGGCA GCAGTTGGCA GGTGCTCTTC CCCCGCTACC ACCAGTGGCA CGCGGTACGC
AGCATCCTGG CCGCCACCTA CGAGGACGGC GCCGGGGTGG ACCGCCTCAT CCAGCACTCC
GCCGGCTCGG GCAAGTCCAA CACCATCGCC TGGAGCGCCC ACCTGCTCTC TCGGTTGCAC
GGCAAGGACG AGGCGCCGGT CTTCGACAAG GTGGTGGTGC TCACCGACCG CAAGGTGCTG
GACAAGCAGC TGCAGAACAC GGTCGCTGGG CTGGAGCACA CCGAGGGGAC GATCGTCCGC
ATCGACAAGG ACTCCAAGCA GCTCAAGGCC TCCCTGGAGG GCAACGCGGC CCGCATCATC
ATCACCACGC TGCAGAAGTT CCCGGTCGTG ATGCAGCTGG CCAAAGAGCA GGCGAAGGAG
ACCGGTGATG GCGGCCGCGT GATGGGGCAG CGCTTCGCGG TCATCGTGGA CGAGGCGCAC
TCCTCCACCT CCGGCTCGGC CTTCAGCGGC ATGAAACGGG TGCTGAAGGG TGGCGTTGAC
GAGGCGCTGG ACGACGCCGA GGCAGCCGAG ACCACCGATG GCAACGTGCC GGGCGCCGAG
GAGGACTCCC TGCTGGAGAG CGCGCAGCGG CGCGGCAAGC AGAGCAATCT GTCGTTCTTC
GCCTTCACCG CGACGCCCAA GCCCAAGACG CTCAACATCT TTGGCCAGAC CGGGCCCGAT
GGGGTGAAGC GCCCGTTCCA CACGTACTCC ATGCGGCAGG CGATCGCCGA GGGGTTCATC
CTGGACGTGC TGGCCAGCTA CACGACCTAC GACGTGTATT ACAAGCTGAT CAACACCGAG
CCGAGCGATC CCAGCATCGA CACCCGCAAG GGGAAGGCTG CGCTGGCCCG CCACGCCTCT
CTGCACGACC ACATGATGGA GGGCAAGGCC GAGGTGATCG TGGAGCACTT CCGCGAGAAG
ACAGCCCACA AGGTCGGTGG GCGCGCCAAA GCGATGGTGG TGACGCGGTC GCGGCTGCAC
GCACTGAAGA CCCACCAGGC GATCGAGCGG TACATCAAGA AGAACGGGTA CGACAAGGGG
CCGGGTGCGC TGTCGGCGCT GGTGGCGTTC TCCGGCTCGG TGACTGACCC CGACACCCCG
GAGGTGGCGC TGACCGAGGG GGCGGTGAAC GGGTTCCCCG AGAGTGAGCT GCCGCGGCGC
TTCCACGACG AGCACCAGGT GCTGGTGGTG GCCGAGAAGT ACCAGACCGG TTTCGACGAG
CCGCTGCTGC ACACGATGTA CGTGGACAAG AAGCTTTCGG GCGTGGCGGC GGTGCAGACG
CTGAGCCGGT TGAACCGAAC CATGCCGGGC AAGCACGACA CGTTCGTGCT GGACTTCGCG
AACTCCGCCG AGGAGATCCA GGCGGCGTTC GAGCCGTTCT ACGAGCAGAG CCTGGCCGAG
CAGGTGGACC CGCAGGCGCT GACCACCATG GAGCAGGAAC TGAAGACGTT TGGCGTGCTG
GTGGCAGAGG AGATGCAGCA GGCCGTGGCT CTCGTGCTGG ACGGGGATGA CACAGACCAA
GGGCGCATCT ACCAGCTGGT GGGGCAGGCA GCCGGGCGCT GGGTGATGCT CAACGAGCGC
GATGAGGACG CCGCGGAGAC GTTCCGCGGG ACGCTTGACG CGTTCTGCCG TGCGTACACC
TTCATTGCCC AGGTGATGCC GTGGGCCGAC CCCGAGCAGG AGCGGCTGTT CCTGTTCGGG
CGGTTGTTGC TCGCGGACCT ACCGCAGCGC GAGGGTGATT CGATGCCCCA AATCAGCAGG
TCGGTGCAGC TTTCGCACTT GCGGATTGCG GCGCAGGGCG AGACGGTTAT TGAGCTGGAG
GGCTCCGATG AGCCCGGTGA GGCGCTGCCC GGTGAAGGCA AGGGTGCTGA GGCAGAGCCG
GTGGCGGACA AGCTGTCGGC GCTCATTGCG GTGCTGAACG AGAAGTTCGG CGCCGAGCTG
GGCGATGCCG ACAGGTTGTG GTTCGAGCAG CAGACGATGG TGGTGGCCCA CGACGTGGAC
ATGCAGGATG TGGCGCGTCA CAATGACCGG AGGGCCTACA GGCTGGTGGT GGAGGAACGC
GTGGAGGACA TGCTGCTGGA CCGGGACCAG CAGAACGGGA AGTTGTTCAG CATGTTCTTC
GAGAACTTGG ACTTCCGCCG GATGATGGTG GAGCACATCG TGGGCCAGAC CTACGACAAG
GCAAGGCGCC AGCCGGCATG A
 
Protein sequence
MANTREKSFE ANVEAHLTGH GWQALAPEGY DRALGLFPDE VVAWLAESQP KAWEQLVARN 
GGEGLARQRF LTALAGDIDH RGTLAVLRSG TKHAGITVDM ATFRPASGLN ETIAARYAAN
RLAVVRQLHH SESRPADSVD VTLVLNGIPV ATAELKNPFT HQTIEHAKTQ YREDRHPNDL
IFRSRALVHY AVDPHHVAMT TRLAGQDTRF LPFDQGSNGP GHPGTAGNPQ RDGYQTAYLW
EEVWQRDAWL ELLGSFIQQR KEGSSWQVLF PRYHQWHAVR SILAATYEDG AGVDRLIQHS
AGSGKSNTIA WSAHLLSRLH GKDEAPVFDK VVVLTDRKVL DKQLQNTVAG LEHTEGTIVR
IDKDSKQLKA SLEGNAARII ITTLQKFPVV MQLAKEQAKE TGDGGRVMGQ RFAVIVDEAH
SSTSGSAFSG MKRVLKGGVD EALDDAEAAE TTDGNVPGAE EDSLLESAQR RGKQSNLSFF
AFTATPKPKT LNIFGQTGPD GVKRPFHTYS MRQAIAEGFI LDVLASYTTY DVYYKLINTE
PSDPSIDTRK GKAALARHAS LHDHMMEGKA EVIVEHFREK TAHKVGGRAK AMVVTRSRLH
ALKTHQAIER YIKKNGYDKG PGALSALVAF SGSVTDPDTP EVALTEGAVN GFPESELPRR
FHDEHQVLVV AEKYQTGFDE PLLHTMYVDK KLSGVAAVQT LSRLNRTMPG KHDTFVLDFA
NSAEEIQAAF EPFYEQSLAE QVDPQALTTM EQELKTFGVL VAEEMQQAVA LVLDGDDTDQ
GRIYQLVGQA AGRWVMLNER DEDAAETFRG TLDAFCRAYT FIAQVMPWAD PEQERLFLFG
RLLLADLPQR EGDSMPQISR SVQLSHLRIA AQGETVIELE GSDEPGEALP GEGKGAEAEP
VADKLSALIA VLNEKFGAEL GDADRLWFEQ QTMVVAHDVD MQDVARHNDR RAYRLVVEER
VEDMLLDRDQ QNGKLFSMFF ENLDFRRMMV EHIVGQTYDK ARRQPA