Gene CPF_2403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2403 
SymbolpepD 
ID4203488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2673232 
End bp2674683 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content30% 
IMG OID638083268 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_696826 
Protein GI110801407 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTAT TAAAAGGTTT AGAACCACAA AGTGTTTTAA AATATTTTGA AGAAATATCA 
CAAATTCCAA GAGGATCAGG TAATGAAAAG GGAATAAGTG ATTTCCTAGT TAACTTTGGA
AAAAGTTTAG GACTTGAAAC AATACAAGAT GAATCATTAA ATGTAATAAT AAGAAAACCT
GCAACTCCAG GATATGAAAA TGCACCAGGA GTAATAATAC AAGGTCATAT GGATATGGTA
TGTGAAAAAA ATAAAGATAC TATACATGAT TTTGAAAAAG ATCCTATTGA ACTTAGAGTA
GATGGAGATT ATATATATGC TACAGGAACT ACATTAGGAG CAGATAATGG TATAGCAGTA
GCTTATGGAA TGGCTGTTTT AGCTTCAAAT GATATAGCGC ACCCTGCTAT AGAGCTTTTA
GTTACAACTG ATGAAGAAGT TGGAATGGGT GGAGCAATTG CCTTAGATGG AACTTTATTA
AAAGGTAAAT ATCTTTTAAA CATAGATTCA GAGGAAGAAG GAAAACTTTT AGTAAGCTGT
GCAGGTGGAG CTAGAAGTGA AGTTACTTTA CCAATAAACT TTGAAGAAAT GGAAAAAGAT
TTTGAAGTTT ATGAAATCAT GCTAAGAGGA CTAAAGGGTG GTCACTCTGG AATGGAAATA
GATAAACAAA GAGGAAACTC TAATAAGTTA ATGGGAAGAG TATTAAATGA TATTAATGCT
AACTGTGATA TTAGATTAAT ATCAATTAAT GGTGGATCTA AGGTAAATGC TATTCCAAGA
GAATGTGATA CTTTACTAGC TGTTAAAAAA GAAGATGTTA AAAAATTAGA AGAATTAATT
CAAAAATGGG ATTCAATATT AAAGGATGAA TACCATACTA ATGATAGTGG AGTTAATGTA
ACTTTAGTTA AAAAAGAAGA AAATCATAAA GTATTCTCTA AAGACACTAC ATTTAAGGCT
ATAAAAATAA TGAACTTAAT TCCTGATGGA GTTGATACTT ATAGTATAGA AATGAAAGGA
TTAGTTCAAA GTTCAACAAA CCTAGGTGTT GTTACTACAG AAGGAGATAA AATTGTCTTT
GCTAGTTCAA CAAGAAGTTC AGTTGAAACT TTAAAAACTA AACTTTTAGA TGAAATAGCT
GATGTTGCAG AAGTATTAGG TGGAGAATTT GAAATACAAG CACCATACCC AGCTTGGCAA
TATAATCCTG ATTCAAAAAT AAGAGAACTT TGCAGCAATG TATATAAAAA TATGACAGGA
AAAGATCCTG AAATAATAGC TATACATGCT GGATTAGAAT GTGGATTATT AGGAGAAAAA
ATAGAAGGAT TAGATATGAT TTCATTTGGT CCTAATATGT ATGATGTTCA TACTCCAAAT
GAACATGTTA GCATATCTTC AGTAAAAAAT GTTTGGGATT TCTTAGTTGA AATATTAAAA
GCTATAAAAT AA
 
Protein sequence
MNVLKGLEPQ SVLKYFEEIS QIPRGSGNEK GISDFLVNFG KSLGLETIQD ESLNVIIRKP 
ATPGYENAPG VIIQGHMDMV CEKNKDTIHD FEKDPIELRV DGDYIYATGT TLGADNGIAV
AYGMAVLASN DIAHPAIELL VTTDEEVGMG GAIALDGTLL KGKYLLNIDS EEEGKLLVSC
AGGARSEVTL PINFEEMEKD FEVYEIMLRG LKGGHSGMEI DKQRGNSNKL MGRVLNDINA
NCDIRLISIN GGSKVNAIPR ECDTLLAVKK EDVKKLEELI QKWDSILKDE YHTNDSGVNV
TLVKKEENHK VFSKDTTFKA IKIMNLIPDG VDTYSIEMKG LVQSSTNLGV VTTEGDKIVF
ASSTRSSVET LKTKLLDEIA DVAEVLGGEF EIQAPYPAWQ YNPDSKIREL CSNVYKNMTG
KDPEIIAIHA GLECGLLGEK IEGLDMISFG PNMYDVHTPN EHVSISSVKN VWDFLVEILK
AIK