Gene CPR_2115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2115 
SymbolpepD 
ID4205972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2342192 
End bp2343643 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content29% 
IMG OID642566665 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_699424 
Protein GI110803954 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAT TAAAAGGTTT AGAACCACAA AGTGTTTTAA AATATTTTGA AGAAATATCA 
CAAATTCCAA GAGGATCAGG TAATGAAAAG GGAATAAGTG ATTTTCTAGT TAACTTTGGA
AAAAATTTAG GACTTGAAAC AATACAAGAT GAATCATTAA ATGTAATAAT AAGAAAACCT
GCAACTCCAG GATATGAAAA TGCACCAGGA GTAATAATAC AAGGTCATAT GGATATGGTA
TGTGAAAAAA ATAAAGATAC TATACATGAT TTTGAAAAAG ATCCTATTAA ACTTAGAGTA
GATGGAGATT ATATATACGC TACAGGAACT ACATTAGGAG CAGATAATGG TATAGCAGTA
GCTTATGGAA TGGCTGTTTT AGCTTCAAAT GATATAGCAC ACCCTGCTAT AGAACTTTTA
GTTACAACTG ATGAAGAAGT TGGAATGGGT GGAGCTATTG CTTTAGATGG AACTTTATTA
AAAGGTAAAT ATCTTTTAAA CATAGATTCA GAGGAAGAAG GAAAACTTTT AGTAAGCTGT
GCAGGTGGAG CTAGAAGTGA AGTTACTTTA CCAATAAACT TTGAAGAAAT GGAAAAAGAT
TTTGAAGTTT ATGAAATCAT GCTAAGAGGA CTAAAGGGTG GTCACTCTGG AATGGAAATA
GATAAACAAA GAGGAAACTC TAATAAGTTA ATGGGAAGAG TATTAAATGA TATTAATGCT
AACTGTGATA TTAGATTAAT ATCAATTAAT GGTGGATCTA AGGTAAATGC TATTCCAAGA
GAATGTGATA CTTTACTAGC TGTTAAAAAA GAAGATGTTA AAAAATTAGA AGAATTAATT
CAAAAATGGG ATTCAATATT AAAGGATGAA TATCATGCTA ATGATAGTGG AGTTAATGTA
ACTTTAGTTA AAAAAGAAGA AAATCATAAA GTATTTTCTA AAGACACTAC ATTTAAAGCT
ATAAAAATAA TGAACTTAAT TCCTGATGGA GTTGATACTT ATAGTATAGA AATGAAAGGA
TTAGTTCAAA GTTCAACAAA CCTAGGTGTT GTTACTACAG AAGGAGATAA AATTGTCTTT
GCTAGTTCAA CAAGAAGTTC AGTTGAAACT TTAAAAACTA AACTTTTAGA TGAAATAGCT
GATGTTGCAG AAATATTAGG TGGAGAATTT GAAATACAAG CACCATACCC AGCTTGGCAA
TATAATCCAG ATTCAAAAAT AAGAGAACTT TGCAGCAATG TATATAAAAA TATGACAGGA
AAAGATCCTG AAATAATAGC TATACATGCT GGATTAGAAT GTGGATTATT AGGAGAAAAA
ATAGAAGGAT TAGATATGAT TTCATTTGGT CCTAATATGT ATGATGTTCA TACTCCAAAT
GAACATGTTA GCATATCTTC AGTAAAAAAT GTTTGGGATT TCTTAGTTGA AATATTAAAA
GCTATAAAAT AA
 
Protein sequence
MNILKGLEPQ SVLKYFEEIS QIPRGSGNEK GISDFLVNFG KNLGLETIQD ESLNVIIRKP 
ATPGYENAPG VIIQGHMDMV CEKNKDTIHD FEKDPIKLRV DGDYIYATGT TLGADNGIAV
AYGMAVLASN DIAHPAIELL VTTDEEVGMG GAIALDGTLL KGKYLLNIDS EEEGKLLVSC
AGGARSEVTL PINFEEMEKD FEVYEIMLRG LKGGHSGMEI DKQRGNSNKL MGRVLNDINA
NCDIRLISIN GGSKVNAIPR ECDTLLAVKK EDVKKLEELI QKWDSILKDE YHANDSGVNV
TLVKKEENHK VFSKDTTFKA IKIMNLIPDG VDTYSIEMKG LVQSSTNLGV VTTEGDKIVF
ASSTRSSVET LKTKLLDEIA DVAEILGGEF EIQAPYPAWQ YNPDSKIREL CSNVYKNMTG
KDPEIIAIHA GLECGLLGEK IEGLDMISFG PNMYDVHTPN EHVSISSVKN VWDFLVEILK
AIK