Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1776 |
Symbol | pdp |
ID | 4204660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1975353 |
End bp | 1976654 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642566326 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_699091 |
Protein GI | 110802351 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGT ATGATTTAAT ATTAAAAAAG AGAAATGGTG GGGAATTAAG CACTGAAGAA ATAAACTTTT TTGTTGATAA ATATACAAAT GGTGAAATAC CTGATTATCA AGTAGCAGCC TTATTAATGG CTATATATTT CCAAAAAATG AATAAGAGAG AAACTTCGGA TTTAACTATG GCTATGGTTA ATTCAGGAGA TATATTAGAT TTAAGTGAAA TACACGGTAT AAAAGTAGAC AAGCATTCAA CTGGTGGGGT TGGTGATACT ACTACTTTAG TTTTGGGACC TATGGTTTCA GCTTTAGGAA TTCCTGTAGC TAAAATGTCA GGAAGAGGTC TTGGACATAC TGGTGGAACA ATAGATAAAT TAGAAAGTTT TGATGGATTC TCAGTAGAAA TGACTAAGGA TCAATTTATA AATAATGTCA ATAATATAAA ATTAGCAGTT GGTGGACAAA CAGGAGATTT AGCTCCAGCT GATAAAAAAC TTTATGCTTT AAGAGATGTT ACTGGAACTG TTGATAATGT ATCATTAATA GCTTCAAGTA TTATGAGTAA AAAAATTGCT GCAGGCGCTG ACGCCATAGT ATTAGACGTT AAGGTTGGAG ATGGAGCATT TATGAAAACT CCAGAGGCAG CTAGAGAACT TGCAACTGAA ATGGTTGGAA TAGGTAAACA TGTTGGAAGA AATACTGTTG CCATAATTTC AGATATGGAT CAACCTTTAG GATTTGCTAT AGGTAATGCT TTAGAAGTTA AAGAAGCCAT AGAAACTTTA AGAGGAAATG GTCCTAAGGA TCTTTTAGAG CTTTGTTTAA CACTTGGAAG CAACATGGTT GTCTTAGCAG GTGCTGCTAA AGATACTGAT GAAGCAAGAA AAATGTTAAT GGAAACAATA ACTTCAGGAA AAGCAATTGA AAAATTAAAA GAATTTGTAA AGGCTCAAGG TGGAGATGCA TCAGTTATTG ATAATATATC AAATTTCCAT AATGCTAAAT ATGTAATTCC AGTTAAAGCT AATAAATCAG GGGTTGTAAG TAAAATACAT GCTGAAAACA TAGGATTAGT TGCTATGGAA CTTGGAGCAG GAAGAGCTAC TAAAGAAAGT ATTATAGATT TAGCTGTTGG TATTGTTCTT CAAAAGAAAA GAGGAGACAA AGTTAATGAC GGAGATATAA TAGCATATAT TCATGCTAAT GATGAAGAAA AAGGTAATAA AGCAATAGAT GGAATATTAG CTAATTACGA AATATCAGAT TCAGTAAAAG ATATTCCATT AATATATGAT ATAGTTAAAT AG
|
Protein sequence | MRMYDLILKK RNGGELSTEE INFFVDKYTN GEIPDYQVAA LLMAIYFQKM NKRETSDLTM AMVNSGDILD LSEIHGIKVD KHSTGGVGDT TTLVLGPMVS ALGIPVAKMS GRGLGHTGGT IDKLESFDGF SVEMTKDQFI NNVNNIKLAV GGQTGDLAPA DKKLYALRDV TGTVDNVSLI ASSIMSKKIA AGADAIVLDV KVGDGAFMKT PEAARELATE MVGIGKHVGR NTVAIISDMD QPLGFAIGNA LEVKEAIETL RGNGPKDLLE LCLTLGSNMV VLAGAAKDTD EARKMLMETI TSGKAIEKLK EFVKAQGGDA SVIDNISNFH NAKYVIPVKA NKSGVVSKIH AENIGLVAME LGAGRATKES IIDLAVGIVL QKKRGDKVND GDIIAYIHAN DEEKGNKAID GILANYEISD SVKDIPLIYD IVK
|
| |