Gene CPF_2596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2596 
SymbolhsdR 
ID4202108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2860476 
End bp2863655 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content29% 
IMG OID638083463 
Producttype I restriction-modification system, R subunit 
Protein accessionYP_696986 
Protein GI110800925 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATACT TAGGTAATGA AGAGACTTTG GTTGAATTAC CAGCTGTTGA TTATTTAGAG 
AAAAATTTAG GCTATTCTTT TATTCATGGT AAGGATTTAA CTCCTGAAAG TGGGGAAAGA
GATTCTTTAT CTGATGTTGT TTTGATTAAT AGATTAAGGG ATGCTTTAAA AAGATTAAAT
CCATGGATGA ATGAAGAAAA TTTAGATAGA GCAATTAGAT ATATTTCTAG AGCAGATAAT
CTAGGAAGTG GTTTATTAGA GATAAATGAA AAGATATATG ATGCATTAGT AGATTTAACT
TTTACTGTTG AACAGGATTT ATTTGGGAAT GGTCAGAAAA AACCACAGAC AGTTCATTTT
ATAGATTGGA ATGATGTTGA TAATAATGAT TTTCTTGTCG TTAGACAGTT TGAAGTTCAA
ACTCTTTCAG GAAAGTCTAT ATTTCCTGAC ATAGTTATTT TTATTAATGG TATTCCAGTA
GTTGTTTTAG AAGCTAAATC ACCTTTTTTA GAAAAAGGAA ACAATGAATG TATTGGAAAA
AAACAAGCTT ATGAACAGTT AAGAAGATAT ATGAATGCTA GAGATGAGTC TTTAGGAGAA
GGAGCTCCAA AGCTATTCTA TACTAACTTT TTTACTGGAA TTTTAAATAG ATATAATGCT
TATGTTGGCA CTATAAGTTC AAGTTATAAT TACTATTTAG AATGGAAAGA CCCATATCCA
TTTAAGCTTG AAGAAGTTGA AGATTATAAA AATTGTGGAC AAAATATACT CTTACAAGGA
TTTTTAGAAA AGAAAAATCT TTTAGATTTA ATGAGAAATT TTATAGTTTT TGAGGCAGAA
GATGGAGTTG TAATTAAAAA GGTTTGTAGA TATCAACAAT TTAGAGCTGT GAACAAAGCT
CTTAATAAAA TTAAAAATGG AAAAGATAAG ATTTCAAGAG GTGGAGTTGT TTGGCATACT
CAGGGGTCAG GTAAATCTTT AACTATGGTT TTCTTAGCTA GAAAGATTAA AAGAACACAA
GGATTAACAG ATTCTACAAT AGTTATAGTA ACAGATAGAA TTGACCTTGA TAAGCAAATA
GCAGGAACTT TTGAAAGAAC TTTAGGTAAA ATAACTACTC CTGTTAGAGC TGATACTATT
GATAAAATGA AAAAATTATT ATCAAATCCT CAACCACAAA TTATAATGAC TACTATTCAG
AAATTTCAAA GTGAAACTGA AGAAAAGGAA GTTATGCTTG ATGGAGAAAA TTTAACTCAA
AAATATGCTG TTGAGTACCC TGTTTTAAGC ACAAAACAAA ATATTATTGT ATTGGCTGAT
GAGGCTCATA GAAGTCAATA TAAGGACACC GCAGCTAATA TGAGAAAGGC ATTGCCTAAT
GCTGTTTTTA TCGGATTTAC AGGTACCCCA ATAGATAAAG AGGATAAATC AACACCAAGA
ACTTTTGGAG GATATATAGA TAAATATTCT ATTAAACAAG CAGTAGATGA TGGTGCTACT
GTTAAGATTG TTTATGAAGG TAGAAGACCA GACTTACAAG TAATAGGAGA ATCTTTAGAA
GAATTATTTG ATGAAGCTTT TTCAGATAGA ACTGATGAAG AAAAAGAAGC TATTAAACAA
AAATATGCCA ATAAAAAAAC TGTTGTTGAA TCAGAAGATA GGATTGATGA AATTGCTAAA
GATTTATTAA AACACTATAA AGAACAAATA CTTCCTAATG GATTTAAAGC ACAGATTGTA
TGTGTATCAA GAGAAGCTTG TGTAAAATAT TATGATGCTT TAAATAGACA TATGAAAGAA
ATATTAGGTG AAGGATTTGA AGCTCAAGTG ATATTCTCAG GAGATAATAA TGATAAACCT
CACTTAAAAA AGCATTTTAC TACAAAATCA GAGCAAGAAA AGTTAATAAA AAGATTTAAG
AAACCTATAA ATAAGGACAA GTTAACATTT TTAATAGTCA AAGATATGTT GTTAACAGGA
TTTGACGCAC CTATTGAACA AGTTATGTAT CTTGATAGAC CACTAAAAGA ACATACTTTA
CTTCAAGCTA TTGCAAGGGT TAATAGAACA TCAACAAGAG AAGTTGAGAG ACAACTTGAA
AATGGAGAAA TAGTAAAAGA AAATATTACA AAGCAATGTG GATACATAGT AGATTATTAT
GGTATTTCAA ATTACTTAGA AGATGCTTTA GCTATATTTG ATAAAGAAGA GCTGGGTAAA
CCTATGGAAT CTTTAGATGA TTTATATAAT GAAATGTTAT CTTATAGAGA ATCTATAATG
TCTATGTTTA GAGGTGTGGA TAAAAATAAT TTAGATGCTC TTGTGAATAA AATTGAGCCA
GAAGATAAAA GAGCTGAATT TGAATTAATG TATAGAAAAT TCTCAGGGGC AGTTGAAAGT
CTTTTACCAT CACATGTTGA TACAGAAATA TTAAATGATT TGAAATGGCT ATCATATATT
AGAGCAGCTG CTAAAGCTAA GTTTAGTCCA AGTGAATCAA TAGATATAGC TGATTGTGGA
GAAAAAGTTA GAGAAATAAT AGAAACTCAT TTAAAATCAC TAGGAGTTAG AAGTTGGATT
GAACCAATAA CATTATTTGA AAAAGATTTT AAAGATAAGA TAAACACTCT AAAATCAGAT
GAAGCTCAAG CATCTGCTAT GGAACATGCA ATTAAGCATA CAATTAATAT TAGAAGAAAA
GAAAATCCAG TTTATTATGC TTCTCTATTG GAAAGACTTC AAAAGATATT AGATGAAACA
AAAAATGATT GGATTGAAAG AAAAATAAGA TTAAATAAGT TTATTAAGAA TCATGTTGAA
GATGGAGCTG CTAATGAGGC TAGTGATTTA GGACTAGATG AGAAAGAGTT TGCGTTCTTT
AAAGTAGTAA AAAAATACTT AGAAGATGGT GGAGAAGAAT TCATAGCTAA GGAAGAAAAA
GCATGTTATA TATCAGATGA AACAGTAGAA CTTTCAAAGC AAATTGCTAA AGAAGTTAAA
GAAATTGCTG AAAATGCAGG AATCGATTGG GTTACAACTC CGTATAAAAC TAATAATGTT
GAAAGAGAAA TAAAGTTAAT GCTAATTAGA AAATATGCTA AGAAAATTCC TAGAAATATT
AGAGAAAAAT TAATGGAACC ACTTCTAAAT TTAGCCAAGA TACATTTTGA TATAGTTTAA
 
Protein sequence
MSYLGNEETL VELPAVDYLE KNLGYSFIHG KDLTPESGER DSLSDVVLIN RLRDALKRLN 
PWMNEENLDR AIRYISRADN LGSGLLEINE KIYDALVDLT FTVEQDLFGN GQKKPQTVHF
IDWNDVDNND FLVVRQFEVQ TLSGKSIFPD IVIFINGIPV VVLEAKSPFL EKGNNECIGK
KQAYEQLRRY MNARDESLGE GAPKLFYTNF FTGILNRYNA YVGTISSSYN YYLEWKDPYP
FKLEEVEDYK NCGQNILLQG FLEKKNLLDL MRNFIVFEAE DGVVIKKVCR YQQFRAVNKA
LNKIKNGKDK ISRGGVVWHT QGSGKSLTMV FLARKIKRTQ GLTDSTIVIV TDRIDLDKQI
AGTFERTLGK ITTPVRADTI DKMKKLLSNP QPQIIMTTIQ KFQSETEEKE VMLDGENLTQ
KYAVEYPVLS TKQNIIVLAD EAHRSQYKDT AANMRKALPN AVFIGFTGTP IDKEDKSTPR
TFGGYIDKYS IKQAVDDGAT VKIVYEGRRP DLQVIGESLE ELFDEAFSDR TDEEKEAIKQ
KYANKKTVVE SEDRIDEIAK DLLKHYKEQI LPNGFKAQIV CVSREACVKY YDALNRHMKE
ILGEGFEAQV IFSGDNNDKP HLKKHFTTKS EQEKLIKRFK KPINKDKLTF LIVKDMLLTG
FDAPIEQVMY LDRPLKEHTL LQAIARVNRT STREVERQLE NGEIVKENIT KQCGYIVDYY
GISNYLEDAL AIFDKEELGK PMESLDDLYN EMLSYRESIM SMFRGVDKNN LDALVNKIEP
EDKRAEFELM YRKFSGAVES LLPSHVDTEI LNDLKWLSYI RAAAKAKFSP SESIDIADCG
EKVREIIETH LKSLGVRSWI EPITLFEKDF KDKINTLKSD EAQASAMEHA IKHTINIRRK
ENPVYYASLL ERLQKILDET KNDWIERKIR LNKFIKNHVE DGAANEASDL GLDEKEFAFF
KVVKKYLEDG GEEFIAKEEK ACYISDETVE LSKQIAKEVK EIAENAGIDW VTTPYKTNNV
EREIKLMLIR KYAKKIPRNI REKLMEPLLN LAKIHFDIV