Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_2062 |
Symbol | pdp |
ID | 4203206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 2301785 |
End bp | 2303086 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638082927 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_696491 |
Protein GI | 110799911 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGT ATGATTTAAT ATTAAAAAAG AGAAATGGTG GGGAATTAAG CACTGAAGAA ATAAACTTTT TTGTTGATAA ATATACAAAT GGTGAAATAC CAGATTATCA AGTAGCAGCC TTATTAATGG CTATATATTT CCAAAAAATG AATAAGAGAG AAACTTCAGA TTTAACTATG GCTATGGTTA ATTCAGGAGA TATATTAGAT TTAAGTGAAA TACACGGTAT AAAAGTAGAT AAGCATTCAA CTGGTGGGGT TGGTGATACT ACTACTTTAG TTTTAGGACC TATGGTTGCA GCTTTAGGAA TTCCTGTAGC TAAAATGTCA GGAAGAGGTC TTGGACATAC TGGTGGAACA ATAGATAAAT TAGAAAGTTT TGATGGATTC TCAGTAGAAA TGACTAAGGA TCAATTTATA AATAATGTTA ATAATATAAA ATTAGCAGTT GGTGGACAAA CAGGAGATTT AGCTCCAGCT GATAAAAAAC TTTATGCTTT AAGAGATGTT ACTGGAACTG TTGATAATGT ATCATTAATA GCTTCAAGTA TTATGAGTAA AAAAATTGCT GCAGGCGCTG ATGCCATAGT ATTAGACGTT AAGGTTGGAG ATGGAGCATT TATGAAAACT CCAGAGGCAG CTAGAGAACT TGCAACTGAA ATGGTTGGAA TAGGTAAACA TGTTGGAAGA AATACTGTTG CCATAATTTC AGATATGGAT CAACCTTTAG GATTTGCTAT AGGTAATGCA TTAGAAGTTA AAGAAGCCAT AGAAACTTTA AGAGGAAATG GTCCTAAGGA TCTTTTAGAG CTTTGCTTAA CACTTGGAAG CAACATGGTT GTTTTAGCAG GTGCTGCTAA AGATACTGAT GAAGCAAGAA AAATGTTAAT GGAAACAATA ACTTCAGGAA AAGCAATTGA AAAATTAAAA GAATTTGTAA AAGCTCAAGG TGGAGATGCA TCAGTTATTG ATGATATATC AAACTTCCAT AATGCTAAAT ATGTAATTCC AGTTAAAGCT AATAAATCAG GGGTTGTAAG CAAAATACAT GCTGAAAACA TAGGATTAGT TGCTATGGAA CTTGGAGCAG GAAGAGCTAC TAAAGAAAGT ATTATAGATT TAGCTGTTGG TATTGTTCTT CAAAAGAAAA GAGGAGACAA AGTTAATGAA GGAGATATAA TAGCATATAT TCATGCTGAT GATGAAGAAA AAGGTAAGAA AGCAGTAGAT GGAATATTAG CTAATTACGA AATATCAGAT TCAGTAAAAG ATATTCCATT AATATATGAT ATAGTTAAAT AG
|
Protein sequence | MRMYDLILKK RNGGELSTEE INFFVDKYTN GEIPDYQVAA LLMAIYFQKM NKRETSDLTM AMVNSGDILD LSEIHGIKVD KHSTGGVGDT TTLVLGPMVA ALGIPVAKMS GRGLGHTGGT IDKLESFDGF SVEMTKDQFI NNVNNIKLAV GGQTGDLAPA DKKLYALRDV TGTVDNVSLI ASSIMSKKIA AGADAIVLDV KVGDGAFMKT PEAARELATE MVGIGKHVGR NTVAIISDMD QPLGFAIGNA LEVKEAIETL RGNGPKDLLE LCLTLGSNMV VLAGAAKDTD EARKMLMETI TSGKAIEKLK EFVKAQGGDA SVIDDISNFH NAKYVIPVKA NKSGVVSKIH AENIGLVAME LGAGRATKES IIDLAVGIVL QKKRGDKVNE GDIIAYIHAD DEEKGKKAVD GILANYEISD SVKDIPLIYD IVK
|
| |