Gene CPR_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1231 
Symbol 
ID4204323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1383054 
End bp1384451 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content29% 
IMG OID642565787 
Productdipeptidase PepV 
Protein accessionYP_698553 
Protein GI110801474 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01887] dipeptidase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00676632 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTAA TAAGTAGCAA AATTGATGAA ATGAAAGATG ATTTAATTCA ATCAGTACAA 
AATATTATTA GAATCAAAAG TGTTGAGGAT GAACCAAAAG AAGGTATGCC TTTTGGAGAA
GGAGTTTCTA AATCTTTAGA ATATGCCTTG GAAGTTTCAA AAAAATTAGG ATTTAAAGTA
GTTAATTTAA ATGGACATGT AGGATATGCT GAATATGGTG ATGGAGAAGA ATATGTAGCA
GCTTTAGGAC ATTTAGATGT AGTTCCAGAA GGTGATGATT GGATTTATCC ACCATATGGA
GCAGAAATAC ATGATGGAAA AATATATGGA AGAGGTACAA CTGATGACAA AGGTCCAATA
ATGGCATCAT TATATGGATT AAAAGCTATA AAAGAATTAG GATTACCAAT ATCAAAGAAA
ATAAGAATAA TATTTGGTAC TAATGAAGAG ACTGGAAGCA AGGATATAGA ATACTATTTA
GAACATGAAA AGGCTCCAGT TTTAGGGTTT ACTCCTGACG CAGAATTCCC TATAATAAAT
GGAGAAAAGG GTATCACTAT TTTTGATATT GTTAAAAATT TTGGAGAAAA AACAACTGAT
GGAGATGTAT TAGTTGAGAG TATAAAGGGT GGAATAGCAT CAAATGTAGT TGCTAGTCTT
TGTGAAACAA AATTAAGAGC TAAGGAAGCA AATAAGGTTT GTGAAGAAAT ATCAAAATTT
GCTAAAGAAA GTAATATAAA GTTTGAAGTA TCACATAAAA ATGATGAAAT AGAGTTAAAA
GTATTTGGTG TCTCAGCTCA TGGTAGTACT CCAGAAAAGG GTATAAACGC TATAATGCAA
ACTATTAAAA TTTTATCAGA GTTAAATTTA GCACAAGAAG ATATAAAAGA TTTTATTAAA
TTCTTAAATG ATAACATAGG TGAAGATGTC TATGGAGAAA AATTCGGAAT TTTATTACAA
GATGAAGCAT CAGGAAAATT AAGCTTTAAT GTTGGAGTAA TTGACTTAAA TGATAAAGTA
GGTAGGCTTA CTTTAAACTT AAGATATCCT GTAACTAAGA CTTTAGATGA TATGATGATT
CCTTTTAATG AAAGAATAAA AGGTACTTGT ATAGAAATAG AAAACTTTAA ACATCAAAAA
CCATTATATT TCTCTCCAGA TCATCCTTTA ATAAAGACTT TAAAGAGTGT ATATAAAGAA
GAGACAGGAA CAGAAGGAGA ATTAATTTCT ATTGGAGGAG GAACATATGC TAAGGAAATG
CCTAATATAG TAGCATTTGG ACCAATATTC CCTGGAGAAC CAGATGTTAT TCATAAACCA
AATGAATATA TTAAAATAGA CGACTTAATT TTAATAAGTA AAATATATGC AAAAGCTTTA
TATCAATTAG CTAAATAA
 
Protein sequence
MNLISSKIDE MKDDLIQSVQ NIIRIKSVED EPKEGMPFGE GVSKSLEYAL EVSKKLGFKV 
VNLNGHVGYA EYGDGEEYVA ALGHLDVVPE GDDWIYPPYG AEIHDGKIYG RGTTDDKGPI
MASLYGLKAI KELGLPISKK IRIIFGTNEE TGSKDIEYYL EHEKAPVLGF TPDAEFPIIN
GEKGITIFDI VKNFGEKTTD GDVLVESIKG GIASNVVASL CETKLRAKEA NKVCEEISKF
AKESNIKFEV SHKNDEIELK VFGVSAHGST PEKGINAIMQ TIKILSELNL AQEDIKDFIK
FLNDNIGEDV YGEKFGILLQ DEASGKLSFN VGVIDLNDKV GRLTLNLRYP VTKTLDDMMI
PFNERIKGTC IEIENFKHQK PLYFSPDHPL IKTLKSVYKE ETGTEGELIS IGGGTYAKEM
PNIVAFGPIF PGEPDVIHKP NEYIKIDDLI LISKIYAKAL YQLAK