Gene CPF_0362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0362 
Symbol 
ID4201617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp434536 
End bp437700 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content27% 
IMG OID638081246 
Producttype III restriction-modification system, res subunit 
Protein accessionYP_694819 
Protein GI110800765 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.262363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGA GTGTTAAGTT TAAGTTTGAA AAGAATTTGC CTCATCAAGA TAAAGCAATT 
GATTCAGTAG TTTCTCTTTT TAGTGATGTT GATAAAAGTA TAGATAATTC TGTTTATTCT
GGAACTATAA ATGCAATTAG AGAAAAGTTG CAAGGAGAAA TTGTTGATAG AAGAAATAAT
CAAATTTCTA ATGGGAAAAA GTTAAGAGAT AATCTAAGAA AGATTCAATC TAATAATGGA
TTAATCTTAA ATGATGAGGT AGAAACAGAT AATAATACAT TAACCTTCAC CATAGAAATG
GAAACAGGTA CAGGAAAAAC CTATGTTTAT TTAAAGACTA TTTTAGAGTT ATATAAGAAG
TATGATGGAA AGTTTAAAAA GTTTATTATA GTAGTTCCTA CTGTTCCTAT TAGAATGGGA
GTAGAAAAGA GTATAGAAAT GCTTAATGAA CATTTTTCAG AACAATTTGA TGGACTTGAT
ATATCAAAAC ATGTTTTCAT TTATAATTCT AAACTTAAAG ATGTTGCAGA AAGTGTTAGA
AGGAAATTTA TTGATAGTCA GGATTTAAGT ATTTTAGTTA TGAATACTCA AGCTTTTAAT
AAGGATACTA ATATTCTTAG AAATACAGAT CATGAGAAAA ATACTGAAAA TATAAGTGTT
TGGGATGAAA TAAGACAACT TCATCCAGTT GTTATAATTG ATGAACCACA AAAATTTGAT
GGGGGTAAGA ATAGTAAGAA AAGAACATCT TCTTTATCAG CTATAGATGA GATTCAACCT
ACATTTACAC TAAGATATTC AGCTACTCAT AATGAGATTA TAAATCCCAC TTATAAATTG
GATTCATATG CTGCATATAG AGACAAGCTT GTTAAAAGAA TAAAAGTTAA AACTGTTAAT
AAACTTGTTC CAACAGATTT TCCTTATATA AGATATTTAG AATTTACCAA GGATTTATAT
GCAAAAATTG AAATTTTTTA TGCTGAGCAG GGAAAAGAAA TTACTAAGAA AACTTTTAAG
GTTAGACAAG ATAGTAATGG AAATATATTT GAGTTATCAG GAGATTTACC TCAATATAGA
AATTTTAGAA TAGCTGAGAA TCCTTTTAAA GGAAAGAGTT TAAAAATAGA AACTTCAGAT
AGACTGGTTG AAATAAATGA AGGGGATTGT TTAAGTCCAT TTTCAGAAAA GGAAAATGTA
AGAATTCAAA TGAAAATAGC CATACAATCT CATTTAGAAC AACAATTTAA TCTTCTTGAA
AGTGGGCAGA AAATAAAAGC TTTAAGTTTA TTCTTTATAG ATAGGGTAAG TAAGGTAAGA
GGAGAAGATG GAGAGGATGG AGAATATTTA AAAATATTTG ATTCAGTTTA TAATGAAGTA
ATTTCTGACC CTACTTACAA TAAGATTTTT GAAAAGTATC CAGATTATTT TAAAGAATAT
AAGGACACTA AGAAAGTAAG ACAAGGATAT TTTGCTATTG ATAAGAAAAA AGGAAGTACA
ATAGTTAAAG AAATTGAGGG ATGGAATGAG GATGGAAATG ATTCTTCAAT TTCTTTAAAA
GCTAAGGATA AAGAATATAT TGAAAGAGGT ATTGAATTGA TTTTAGAGAA AAAGGATGGA
TTAATTTCTT TTGAAGAGCC TCTTGCATTT ATATTTTCTC ACTCAGCACT TAGAGAGGGG
TGGGATAATC CTAATGTATT TACTCTTTGT ACCCTAAAAA ATAGTAGCAA TTCAATAGCT
AAGAAGCAAG AAATAGGAAG GGGATTAAGA CTTCCTGTTG ATACAGAGGG GAATAGATGC
AAGGATGAAA GTTTAAATGT TCTAACTGTG GTGGCTAATG ATAGTTATGA TCACTTTAGT
GAAAAACTTC AACAAAGCTA TGATGAAGAA TCAGGATTTA ACAAGGATGA AGTAACTAAT
GATGTTATAA ATATAGTATT CAAAAAAGCA GGGATTCCAA TAGAAAAAAT AACATCAGAA
TTAACAACTG CTTTTAGAAA TGAATTAAGA GAAAATGGAC TTATAGATAA AAATGATATT
CTTAAAAAGG ATGCTGAAGA AAAGTTTAGT GAAATTCAGT TTAAAAATGA AACTTTAAAA
GAACACTCAG CTAATATTAG AAAAGAATTT GTAAACCAAA TGAAAAGCAA AGGCTCTAAG
AAAATTAATA TTGAAAATGG TGATTTAACC CCTGTTAAAA ATAAAATGCA ATCATATGTT
AAGGAAGCTG ATTTTAAAAG GCTATTAAAA GGAATTAATG AAAGATTAGC TCAAAAAACT
ATATATAAAA TTAATATTAA TAAAGATAAT TTCATAGAAG ATTGCATTAA AGAAATAAAT
AATACTCTTC ATTGTAAGGA AATAAGCAAT ATAGTTGAGC ATAGTGAAGG TATGCATTAT
ATTGATGAAA ATAATAAAGC TAAATTTGAA AAGTCTACTG AATTTATAAC AGAAGAAGTA
GTTGAAAATA TCCAAGTTAA AAGTGATTTT GAAATCATAA ATTATATAAT GGTTGGTACA
GAATTGCCGA GAATGACTAT AAGTAAAATA TTAAAAGAAA TTAAAAATAG AAAAATATTA
AATAGTCAAG ATTATTTAGA TGATGTAATG ATGATAATAA AAGAAATATT CATAAGACAC
CAAGTAAAAG ATAAAATATC TTATGAACTT TTAAATGGGT ATAAGTTTGA TAGTAAGACT
ATATTTTCAA TAGATGAGAT TAATGCAACA GAATTAAATG ACCCCAAAAT TATAACTTAT
AAAGCAAATA GCTTAAAAAG AAAGGCTATT AATGAGTATT ATAAATTTGA TAATGAAAAA
GAATTAGAAT TTGCACAATC CTTAGAAAAT GATCCTAATA TATTACTCTT TACTAAGATT
AAAAAGGGAG GGTTTGTAAT ATTAACTCCT TATGGAAACT ATTCTCCAGA CTGGGCTGTA
GTCTACAAAA ATAATTATGG TAAAGCAGAA ATTTATTTTA TAGTTGAAAC AAAATTTGAT
AAAAAGGAAG CAGATTTATC GGAAGTAGAA AAATTCAAAA TACATTGTGG AAAAGAGCAT
TTTAAAGTAG TATCAAAAAA TAGTGAAGAC TATGTTAAGT TTGATTGGGC AAATTCATAT
GAACAATTTA AAGAAGAATC TTCAAAGGTT AAAGTAATGC AGTAG
 
Protein sequence
MSKSVKFKFE KNLPHQDKAI DSVVSLFSDV DKSIDNSVYS GTINAIREKL QGEIVDRRNN 
QISNGKKLRD NLRKIQSNNG LILNDEVETD NNTLTFTIEM ETGTGKTYVY LKTILELYKK
YDGKFKKFII VVPTVPIRMG VEKSIEMLNE HFSEQFDGLD ISKHVFIYNS KLKDVAESVR
RKFIDSQDLS ILVMNTQAFN KDTNILRNTD HEKNTENISV WDEIRQLHPV VIIDEPQKFD
GGKNSKKRTS SLSAIDEIQP TFTLRYSATH NEIINPTYKL DSYAAYRDKL VKRIKVKTVN
KLVPTDFPYI RYLEFTKDLY AKIEIFYAEQ GKEITKKTFK VRQDSNGNIF ELSGDLPQYR
NFRIAENPFK GKSLKIETSD RLVEINEGDC LSPFSEKENV RIQMKIAIQS HLEQQFNLLE
SGQKIKALSL FFIDRVSKVR GEDGEDGEYL KIFDSVYNEV ISDPTYNKIF EKYPDYFKEY
KDTKKVRQGY FAIDKKKGST IVKEIEGWNE DGNDSSISLK AKDKEYIERG IELILEKKDG
LISFEEPLAF IFSHSALREG WDNPNVFTLC TLKNSSNSIA KKQEIGRGLR LPVDTEGNRC
KDESLNVLTV VANDSYDHFS EKLQQSYDEE SGFNKDEVTN DVINIVFKKA GIPIEKITSE
LTTAFRNELR ENGLIDKNDI LKKDAEEKFS EIQFKNETLK EHSANIRKEF VNQMKSKGSK
KINIENGDLT PVKNKMQSYV KEADFKRLLK GINERLAQKT IYKININKDN FIEDCIKEIN
NTLHCKEISN IVEHSEGMHY IDENNKAKFE KSTEFITEEV VENIQVKSDF EIINYIMVGT
ELPRMTISKI LKEIKNRKIL NSQDYLDDVM MIIKEIFIRH QVKDKISYEL LNGYKFDSKT
IFSIDEINAT ELNDPKIITY KANSLKRKAI NEYYKFDNEK ELEFAQSLEN DPNILLFTKI
KKGGFVILTP YGNYSPDWAV VYKNNYGKAE IYFIVETKFD KKEADLSEVE KFKIHCGKEH
FKVVSKNSED YVKFDWANSY EQFKEESSKV KVMQ