Gene CPF_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1016 
Symbol 
ID4201602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1166971 
End bp1168458 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content29% 
IMG OID638081897 
Productputative type II restriction endonuclease 
Protein accessionYP_695462 
Protein GI110800750 
COG category[L] Replication, recombination and repair 
COG ID[COG3066] DNA mismatch repair protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000561814 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACAA AATTATATGA TGAAACCAAT CCTATAAGTA TAGAAAATTA TGCTCAAAAA 
CTTATAGGCA AAACATTTAA TGATGTTTTA AACGATTATA CAAAATATGA ATCTGAACTT
TTAATCAGTG AAGAAAAAGA AGAATATGCA GAAAATCATG AAAACAAAAA AAGAAAAGGT
GGATTAGGTG ACCTTATAGA AGAATGTTAC TTCCATTATA AATGTAATAA TGATTCTAGA
CCTGATTTTC CTGATGCAGG GGTAGAACTT AAAGTAACAC CATATAAAAT AAACAAAAAT
AAAACTCTTT CAGCTAAGGA ACGTTTAATA ATAACTATGA TTGATTACTT CAAAGTTATT
GAAGAAAGCT TTGAAGATAG TCACCTTTGG AATAAGTCTC AACTTATATT ACTTATTTAC
TACCTATATT CTAAGGATAT AGGAAATAGA TTAGATTATA AAATAAACTA TGCTAAGCTA
TTTACTCCAC CTGAAGAAGA TTTAGAAATA ATAAAAAATG ACTTCAAAAT AATTGTAGAT
AAAATAAAAG ATGGTAAAGC TCATGAACTT TCAGAAGGAG ATACAATGTA TCTTGGTGCT
GCTACTAAAG CATCCTCTTC TTCTGATAGA CGTGAACAAC CTTTTTCAAA TATTCTTGCT
AAACCTAGAG CCTTTTCATT TAAAGCCTCT TATATGACTT ATGTTCTTAA TAACTTCATA
GTTCCAAATA AGACAACCTA TGAACCAATT ATTAAAGATG CTAATGAACT TAAATTTAAT
ACTTTTGAAG AAATAATAAT AGATAAAATC AACTCCTATG CAGGAAAAAC TGATGGGGAG
CTTTGCAAAA TATTTGATAA AGAATATAAA AACAATAAGG CTCAATGGAT TGACCTAGCC
TACCGTATGC TAGGAATAAA ATCTAACAAA GCAGAAGAAT TTGAAAAAGC TAATATAACC
GTAAAAGCAC TGCGCATTGA GTCTAAAGAT AAAATAGTTG AAAGCAGTCC CCTACCTACA
TTTAAATTTA AAAAACTTGT AGAAGAAACT TGGGAAGAAT CAAAATTATT TAACTATCTT
GATCAGCAAA AATTCTTATT TGTAGTTTAT AAAAAAGATG GTGATAAATA TGTTCTAAAA
GGAGCTCAGC TTTGGAACAT ACCTTATGAT GATTTAAACA CTACTGTACG CGAAGGTTGG
GAAAACATAA GAAACGTAAT AATAGATGGA GTAAAATTCA CTCCTAAAAC TGATAAAAAT
GGCAAAGTAA TTTATAGCAA TAACTTACCT AATAAAGAAT CAAATAGAGT TATTCATATA
AGACCTCACG CGCAAAAAAG TGCTTATAGA TTTGAAGATG GTACTGAAAT TGGAAATGTT
TCAAGGGATG CTAATGAGTT ACCAGATGGA AGATATATGA CAAACCAAAG CTTTTGGCTA
AATAATACTT ATGTTTTAAG TCAGTTGAAT AAAAATTTAC TGGATTAA
 
Protein sequence
MSTKLYDETN PISIENYAQK LIGKTFNDVL NDYTKYESEL LISEEKEEYA ENHENKKRKG 
GLGDLIEECY FHYKCNNDSR PDFPDAGVEL KVTPYKINKN KTLSAKERLI ITMIDYFKVI
EESFEDSHLW NKSQLILLIY YLYSKDIGNR LDYKINYAKL FTPPEEDLEI IKNDFKIIVD
KIKDGKAHEL SEGDTMYLGA ATKASSSSDR REQPFSNILA KPRAFSFKAS YMTYVLNNFI
VPNKTTYEPI IKDANELKFN TFEEIIIDKI NSYAGKTDGE LCKIFDKEYK NNKAQWIDLA
YRMLGIKSNK AEEFEKANIT VKALRIESKD KIVESSPLPT FKFKKLVEET WEESKLFNYL
DQQKFLFVVY KKDGDKYVLK GAQLWNIPYD DLNTTVREGW ENIRNVIIDG VKFTPKTDKN
GKVIYSNNLP NKESNRVIHI RPHAQKSAYR FEDGTEIGNV SRDANELPDG RYMTNQSFWL
NNTYVLSQLN KNLLD