Gene CPR_2216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2216 
SymbolpepD 
ID4205672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2448639 
End bp2450090 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content30% 
IMG OID642566768 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_699513 
Protein GI110802619 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0150704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTAT TAGAAAACCT AGAGCCTAAA AGTGTATTTA GATTTTTTGA AGATTTAACA 
AGAATACCAC ATGATTCTGG AAATGAAAAA GAACTTAGTG ATTATCTTGT TAAGTTTGCA
AAGGATAGAA ATCTTGAAGT TATTCAGGAT GAGGCACTAA ATGTAATAAT TAAAAAGCCT
GCGACTAAGG GGTATGAAAA TGTACCAGGA GTTATAATTC AAGGGCATAT GGATATGGTA
TGTGAAAAAT TAAAGAGTAG TAATCACGAT TTTAAAAAAG ATCCTTTAAA ATTAAGAATT
ATTGATGATA AATTTGTTTA TGCTACAGAT ACTACTTTAG GAGCAGATGA TGGTATTTCA
TTAGCTTATG GATTAGCTAT TTTAGATTCT AACAATATAG AACATCCAGC AATAGAGTTT
GTAGCAACTA CAGAGGAAGA AACAGTTATG GGGGGAGCTA CTGCTTTAGA TACTTCACTT
TTAAAAGGAA AGGTTTTATT AAACATAGAT GCTGAAGAAG AAGGAGTATT TATTGTAGGG
TGTGCTGGAG GAATTATGGT TTATCCAGAA ATAAATGCTG AATTTGAAGA TTTTAATGGA
GAAGCTTTAA AATTAGAGAT TTCAGGTTTT AAGGGTGGAC ACTCAGGAAT GGAAATCCAT
AAACAAAGAG GAAATGCCAA TAAGTTAATG GGAAGAATCT TATATGCTTT AAGCAAAGAA
GTTGACTTTA ATATTGCATC AATTAAGGGT GGATCAAAAC ATAATGCAAT ACCACAATAT
TGTCAAAGCA TAATTGCAGT TAAGAAAGAA GATAGAGAAA AGGTTAAAGA AATTTGTACT
GCCTTAGAAA AAGATTTAAA GGCAGAATAC AGAATTGGAG AGCCAGATGT GAATCTTTCT
GTTAAAAGCA TTGGAGGAGT AGAAAAACAA TTAACTAAAA AGGTTACAGA TGATATAACT
AGATTTTTAG TTTTAGTTCC AGATGGATTA CAATCTATGA GTCAAGAGAT AAATGGATTA
GTTGAAAGTA GCTTAAATCT TGGAATAGTA GAAATGGTAG AGGATAAAAT TAAATTTATT
ATTGATATAA GAAGTGCAGT TAAGAGTAAA AAGATAGAAA TCACAAATAG AGTAGAGGCT
CTTTGTAAAG TTATAGGAGC TAATATGACT AAAGATGGAG ATTATCCAGA GTGGGAATAT
GAAGCAGAAT CAAAAATAAA AGATTTAAGT ATTAAAACTT ACAGTGACTT ATTTGGAATT
GAGCCTAAAA TAACAGCTTT ACATGCAGGG CTTGAGTGTG GAATCTTTAA AGAAAAGATG
GGAAAAGAAG TGGAAATTAT AAGCTTTGGT CCAGATATAT TTGATGTGCA TACAGCAAAT
GAGCATTTTA AAATAGAATC TGTTGAAAGA TGTTATAGAT TCTTAATTGA ATTATTAAAG
AACATGAAAT AA
 
Protein sequence
MRVLENLEPK SVFRFFEDLT RIPHDSGNEK ELSDYLVKFA KDRNLEVIQD EALNVIIKKP 
ATKGYENVPG VIIQGHMDMV CEKLKSSNHD FKKDPLKLRI IDDKFVYATD TTLGADDGIS
LAYGLAILDS NNIEHPAIEF VATTEEETVM GGATALDTSL LKGKVLLNID AEEEGVFIVG
CAGGIMVYPE INAEFEDFNG EALKLEISGF KGGHSGMEIH KQRGNANKLM GRILYALSKE
VDFNIASIKG GSKHNAIPQY CQSIIAVKKE DREKVKEICT ALEKDLKAEY RIGEPDVNLS
VKSIGGVEKQ LTKKVTDDIT RFLVLVPDGL QSMSQEINGL VESSLNLGIV EMVEDKIKFI
IDIRSAVKSK KIEITNRVEA LCKVIGANMT KDGDYPEWEY EAESKIKDLS IKTYSDLFGI
EPKITALHAG LECGIFKEKM GKEVEIISFG PDIFDVHTAN EHFKIESVER CYRFLIELLK
NMK