Gene CPR_1776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1776 
Symbolpdp 
ID4204660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1975353 
End bp1976654 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content32% 
IMG OID642566326 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_699091 
Protein GI110802351 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGT ATGATTTAAT ATTAAAAAAG AGAAATGGTG GGGAATTAAG CACTGAAGAA 
ATAAACTTTT TTGTTGATAA ATATACAAAT GGTGAAATAC CTGATTATCA AGTAGCAGCC
TTATTAATGG CTATATATTT CCAAAAAATG AATAAGAGAG AAACTTCGGA TTTAACTATG
GCTATGGTTA ATTCAGGAGA TATATTAGAT TTAAGTGAAA TACACGGTAT AAAAGTAGAC
AAGCATTCAA CTGGTGGGGT TGGTGATACT ACTACTTTAG TTTTGGGACC TATGGTTTCA
GCTTTAGGAA TTCCTGTAGC TAAAATGTCA GGAAGAGGTC TTGGACATAC TGGTGGAACA
ATAGATAAAT TAGAAAGTTT TGATGGATTC TCAGTAGAAA TGACTAAGGA TCAATTTATA
AATAATGTCA ATAATATAAA ATTAGCAGTT GGTGGACAAA CAGGAGATTT AGCTCCAGCT
GATAAAAAAC TTTATGCTTT AAGAGATGTT ACTGGAACTG TTGATAATGT ATCATTAATA
GCTTCAAGTA TTATGAGTAA AAAAATTGCT GCAGGCGCTG ACGCCATAGT ATTAGACGTT
AAGGTTGGAG ATGGAGCATT TATGAAAACT CCAGAGGCAG CTAGAGAACT TGCAACTGAA
ATGGTTGGAA TAGGTAAACA TGTTGGAAGA AATACTGTTG CCATAATTTC AGATATGGAT
CAACCTTTAG GATTTGCTAT AGGTAATGCT TTAGAAGTTA AAGAAGCCAT AGAAACTTTA
AGAGGAAATG GTCCTAAGGA TCTTTTAGAG CTTTGTTTAA CACTTGGAAG CAACATGGTT
GTCTTAGCAG GTGCTGCTAA AGATACTGAT GAAGCAAGAA AAATGTTAAT GGAAACAATA
ACTTCAGGAA AAGCAATTGA AAAATTAAAA GAATTTGTAA AGGCTCAAGG TGGAGATGCA
TCAGTTATTG ATAATATATC AAATTTCCAT AATGCTAAAT ATGTAATTCC AGTTAAAGCT
AATAAATCAG GGGTTGTAAG TAAAATACAT GCTGAAAACA TAGGATTAGT TGCTATGGAA
CTTGGAGCAG GAAGAGCTAC TAAAGAAAGT ATTATAGATT TAGCTGTTGG TATTGTTCTT
CAAAAGAAAA GAGGAGACAA AGTTAATGAC GGAGATATAA TAGCATATAT TCATGCTAAT
GATGAAGAAA AAGGTAATAA AGCAATAGAT GGAATATTAG CTAATTACGA AATATCAGAT
TCAGTAAAAG ATATTCCATT AATATATGAT ATAGTTAAAT AG
 
Protein sequence
MRMYDLILKK RNGGELSTEE INFFVDKYTN GEIPDYQVAA LLMAIYFQKM NKRETSDLTM 
AMVNSGDILD LSEIHGIKVD KHSTGGVGDT TTLVLGPMVS ALGIPVAKMS GRGLGHTGGT
IDKLESFDGF SVEMTKDQFI NNVNNIKLAV GGQTGDLAPA DKKLYALRDV TGTVDNVSLI
ASSIMSKKIA AGADAIVLDV KVGDGAFMKT PEAARELATE MVGIGKHVGR NTVAIISDMD
QPLGFAIGNA LEVKEAIETL RGNGPKDLLE LCLTLGSNMV VLAGAAKDTD EARKMLMETI
TSGKAIEKLK EFVKAQGGDA SVIDNISNFH NAKYVIPVKA NKSGVVSKIH AENIGLVAME
LGAGRATKES IIDLAVGIVL QKKRGDKVND GDIIAYIHAN DEEKGNKAID GILANYEISD
SVKDIPLIYD IVK