Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1016 |
Symbol | |
ID | 4201602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 1166971 |
End bp | 1168458 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 638081897 |
Product | putative type II restriction endonuclease |
Protein accession | YP_695462 |
Protein GI | 110800750 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3066] DNA mismatch repair protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000561814 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAACAA AATTATATGA TGAAACCAAT CCTATAAGTA TAGAAAATTA TGCTCAAAAA CTTATAGGCA AAACATTTAA TGATGTTTTA AACGATTATA CAAAATATGA ATCTGAACTT TTAATCAGTG AAGAAAAAGA AGAATATGCA GAAAATCATG AAAACAAAAA AAGAAAAGGT GGATTAGGTG ACCTTATAGA AGAATGTTAC TTCCATTATA AATGTAATAA TGATTCTAGA CCTGATTTTC CTGATGCAGG GGTAGAACTT AAAGTAACAC CATATAAAAT AAACAAAAAT AAAACTCTTT CAGCTAAGGA ACGTTTAATA ATAACTATGA TTGATTACTT CAAAGTTATT GAAGAAAGCT TTGAAGATAG TCACCTTTGG AATAAGTCTC AACTTATATT ACTTATTTAC TACCTATATT CTAAGGATAT AGGAAATAGA TTAGATTATA AAATAAACTA TGCTAAGCTA TTTACTCCAC CTGAAGAAGA TTTAGAAATA ATAAAAAATG ACTTCAAAAT AATTGTAGAT AAAATAAAAG ATGGTAAAGC TCATGAACTT TCAGAAGGAG ATACAATGTA TCTTGGTGCT GCTACTAAAG CATCCTCTTC TTCTGATAGA CGTGAACAAC CTTTTTCAAA TATTCTTGCT AAACCTAGAG CCTTTTCATT TAAAGCCTCT TATATGACTT ATGTTCTTAA TAACTTCATA GTTCCAAATA AGACAACCTA TGAACCAATT ATTAAAGATG CTAATGAACT TAAATTTAAT ACTTTTGAAG AAATAATAAT AGATAAAATC AACTCCTATG CAGGAAAAAC TGATGGGGAG CTTTGCAAAA TATTTGATAA AGAATATAAA AACAATAAGG CTCAATGGAT TGACCTAGCC TACCGTATGC TAGGAATAAA ATCTAACAAA GCAGAAGAAT TTGAAAAAGC TAATATAACC GTAAAAGCAC TGCGCATTGA GTCTAAAGAT AAAATAGTTG AAAGCAGTCC CCTACCTACA TTTAAATTTA AAAAACTTGT AGAAGAAACT TGGGAAGAAT CAAAATTATT TAACTATCTT GATCAGCAAA AATTCTTATT TGTAGTTTAT AAAAAAGATG GTGATAAATA TGTTCTAAAA GGAGCTCAGC TTTGGAACAT ACCTTATGAT GATTTAAACA CTACTGTACG CGAAGGTTGG GAAAACATAA GAAACGTAAT AATAGATGGA GTAAAATTCA CTCCTAAAAC TGATAAAAAT GGCAAAGTAA TTTATAGCAA TAACTTACCT AATAAAGAAT CAAATAGAGT TATTCATATA AGACCTCACG CGCAAAAAAG TGCTTATAGA TTTGAAGATG GTACTGAAAT TGGAAATGTT TCAAGGGATG CTAATGAGTT ACCAGATGGA AGATATATGA CAAACCAAAG CTTTTGGCTA AATAATACTT ATGTTTTAAG TCAGTTGAAT AAAAATTTAC TGGATTAA
|
Protein sequence | MSTKLYDETN PISIENYAQK LIGKTFNDVL NDYTKYESEL LISEEKEEYA ENHENKKRKG GLGDLIEECY FHYKCNNDSR PDFPDAGVEL KVTPYKINKN KTLSAKERLI ITMIDYFKVI EESFEDSHLW NKSQLILLIY YLYSKDIGNR LDYKINYAKL FTPPEEDLEI IKNDFKIIVD KIKDGKAHEL SEGDTMYLGA ATKASSSSDR REQPFSNILA KPRAFSFKAS YMTYVLNNFI VPNKTTYEPI IKDANELKFN TFEEIIIDKI NSYAGKTDGE LCKIFDKEYK NNKAQWIDLA YRMLGIKSNK AEEFEKANIT VKALRIESKD KIVESSPLPT FKFKKLVEET WEESKLFNYL DQQKFLFVVY KKDGDKYVLK GAQLWNIPYD DLNTTVREGW ENIRNVIIDG VKFTPKTDKN GKVIYSNNLP NKESNRVIHI RPHAQKSAYR FEDGTEIGNV SRDANELPDG RYMTNQSFWL NNTYVLSQLN KNLLD
|
| |