Gene CPF_0840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0840 
SymbolcloSI 
ID4201384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp993403 
End bp994971 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content30% 
IMG OID638081724 
Productalpha-clostripain 
Protein accessionYP_695291 
Protein GI110798971 
COG category 
COG ID 
TIGRFAM ID[TIGR02806] clostripain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.832986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAGA AAAAGTTATC ATTATTGATG GCTACTATTA CCATTGGTAG CGTTCTATTA 
GGAGGAGTTT CAACAGTAAG TGCTGCTCCA AGACAAAAGC ATAAAACTGT ATCAGAGAAA
ATTAAAGAGG CCGAAAAGAC TGAAGGTGAT AAAAAACTAA CGGTTATGGT ATATGCTGAT
TGTGATAATA ACCTAGAAGA GTACATACTT AATGATATTG AAGAAATGAA AGAGGGTTAT
AAAAATAATC CTAATCTAAA TATTATAGTA TTAGTTGATA GAATTCCTGG TTATTCTAAT
GATTCAAAAG TGTTAGGTTC TAATTTTGAA GACACAAGAC TTTATAAAAT AGGTGAAAAT
TCTGCTGAAA GAATTAGTGG GAAAAGTGAA TTTCCAGAGA TAACTACTAC AAGTAATTAT
GAAGCTAACA TGGGAGATGC TAATACTTTA AAAAAATTCA TTAAATTCTG TAAAAAGAAC
TATGAAGCTG ATAAGTATAT GTTAATCATG TCTAACCATG GTGGTGGGGC TAAGGATGAT
AAAGATAGAG CTAGTACTGT AAATAAGGCA ATATGCTGGG ATGATAGTAA CAACAAAGAT
TGTCTTTATA CTGGTGAAAT TTCAGATGTT TTAACTAAGG ATGAATCTGT TGATGTTTTA
GTATTTGATG CTTGTCTAAT GGGAAATTCA GAGGTAGCTT ATCAATATAG ACCTAATAAT
GGAAGTTTTG AAGCTAAAAC ACTTGTAGCA TCAGCACCAG TTGTTTGGGG TTTTGGATAT
CCTTATGACA AAATATTTTC TCGTTTAAGA TCTACTAAAG GTGACAATGG AGAAGTTGAT
TCTACTTTAG GAGGAAAAGA AAAAATATTT GATCCATCTA CAGTTACAAA TAATGAACTT
GGTGCCTTAT TTGTAGAAGA GCAGCGTGAT TCTGTAAATA GCTGTGGAGT AACAGATCAA
CAACTAAGTT GCTACGATCT TTCAAAAATT GAAAAAGTTA AAAAATCAGT TGATACTTTA
GCTAGAAATT TAAGTAAAAA TAATAAAAAA GATGCTATTG AAAGCTTAAG AGGTACAGGA
AAAAATGCAC CAACTATGCA CTATTTTAAG AATTATGATG AATATGAATG GATTGAATAT
CCATACTTTG ATTTATATGA TTTATGTGAA AAAATTAGTT TAAGTGATGA ATTTAATGAA
ACTACTAAAA AATTATCAAA GAATGTTATG AAAAATGTTG ACCAATTAAT TTTATATTCA
TTTGCAGGTA ATGACTTTAA AGGCTTCAAA GAAGGGAAAA ATGGTATAAG TATTTTCCTT
CCTGATGGTA ATAGAAATTA TTATGATCAA TATTCTGGAC AAGCGATACC ACATTGGGCT
ATTCAAAGAT GGTACAATCC TTTAGACACT AATGCTTATA GATTAAGAAG TGGATATGGA
AAACTATCAT GGTGTAAAGA TGGATTAGAT CCAAAAATAA ATAAAGTTGG TAACTGGTTT
GAACTTTTAG ATTCTTGGTT CGATAAAGAT AATACTTCAC TTGGTGGATA TAATAGATAT
AGATATTAA
 
Protein sequence
MFKKKLSLLM ATITIGSVLL GGVSTVSAAP RQKHKTVSEK IKEAEKTEGD KKLTVMVYAD 
CDNNLEEYIL NDIEEMKEGY KNNPNLNIIV LVDRIPGYSN DSKVLGSNFE DTRLYKIGEN
SAERISGKSE FPEITTTSNY EANMGDANTL KKFIKFCKKN YEADKYMLIM SNHGGGAKDD
KDRASTVNKA ICWDDSNNKD CLYTGEISDV LTKDESVDVL VFDACLMGNS EVAYQYRPNN
GSFEAKTLVA SAPVVWGFGY PYDKIFSRLR STKGDNGEVD STLGGKEKIF DPSTVTNNEL
GALFVEEQRD SVNSCGVTDQ QLSCYDLSKI EKVKKSVDTL ARNLSKNNKK DAIESLRGTG
KNAPTMHYFK NYDEYEWIEY PYFDLYDLCE KISLSDEFNE TTKKLSKNVM KNVDQLILYS
FAGNDFKGFK EGKNGISIFL PDGNRNYYDQ YSGQAIPHWA IQRWYNPLDT NAYRLRSGYG
KLSWCKDGLD PKINKVGNWF ELLDSWFDKD NTSLGGYNRY RY