Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TDE2191 |
Symbol | pdp |
ID | 2740213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Treponema denticola ATCC 35405 |
Kingdom | Bacteria |
Replicon accession | NC_002967 |
Strand | - |
Start bp | 2226410 |
End bp | 2227741 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637161081 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | NP_972792 |
Protein GI | 42527694 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGCAA CGGATATTAT TATGAAAAAA CGCGGAATAA AGGGGCAGGC TATAGAGCCC TTAAACCGCA AAGAGATAGA ATTTATTGTA AATTCTTATG TAAGGGGTGA AATTCCCGAA TATCAGATTT CGGCATGGCT TATGGCCGTG TATTTTAACG GAATGACCTT TGAAGAAACT GCTATTCTTA CGGATGTTAT GCTTCATTCC GGGAAGGTGA TGGACCTTTC AAGCCTTGAA GGCCCCTTTG TCGATAAACA TTCTACCGGA GGGGTAGGGG ATAAGCTCTC GCTTCCTCTT GCTCCCATTG TTGCGGCAAA CGGTGTTAAG GTTCCAATGA TGAGCGGCCG GGCACTTGGC CACACGGGAG GAACTCTCGA TAAGCTTGAG GCAGTTACGG GCTACCGCAC CAACTTGACC GAAGCCGAAT TTAGGAATTT TATAGAAAAA ACAGGCTTTG CCATGACGGG GCAGACAAAG GAAATCGTTC CTGCAGACCG CCTTCTATAT GCGATGCGGG ATGTTACGGC CACAGTTGAA TCCGTTCCGC TTATAACTTC GAGTATTCTT TCAAAAAAAG TTGCAGAAGG TTCCGAAGCC CTCGTTTTTG ATGTAAAATG CGGAAAGGGT GCCTTTATGA AAACCTTGAG CGATGCGAAA GCCTTGGCGG TAAGCCTTGT AGGTACGGCT AAGGCTATGG GTAAAAAGGC GCGGGCTCTT ATAACCAATA TGAATGAACC TCTCGGCACG ATGGCCGGAA ACTTTTTGGA AATAGAAGAA ACGATAGATA TTTTAAAAGG ACAAGGACCC GCGGACAGTA CTGAGCTTAC CTTGCAGCTG GCCGCTCACA TGCTCGTTCT GGGAGGCAAG GCTAAGACGG AAGAAGAGGG GCTTTCTCTT GCAAAAGAAG CCGTAAGCTC AGGAAAAGCC TTGGATCTTT TTATAAAAAA TATAGAACTT CAGGGAGGCA ATCCCAAGAC CCTTATGGCG GAATATAAAA CCCGCCGCAG CAAATTCTTT GAAGAATTGA AGGCAGAAAG GGACGGCTTT ATCGAGAGCA TTAATGCTTT TGAGGTCGGT ATGGCCGGTG TAAACCTTGG AGTCGGAAGA AATAAAACCA CCGATCCCGT ATGCCCCGAT GCCGGAGTTG AAATTTTAAA ACACAAGGGC GATTCCGTAA AAAAAGGAGA CCTTATAATG AGGGTCTACG GAAAGGATTC GGCTTCGGTT TCCGCTTCAA TGCCCTTATT GAAAAACGCT ATAGAATATT CCGATAAGGC TCCGCAAAAA AATAAGCTTA TTTTTAAAAT TATCAAACAA GAAGAACTTT AA
|
Protein sequence | MRATDIIMKK RGIKGQAIEP LNRKEIEFIV NSYVRGEIPE YQISAWLMAV YFNGMTFEET AILTDVMLHS GKVMDLSSLE GPFVDKHSTG GVGDKLSLPL APIVAANGVK VPMMSGRALG HTGGTLDKLE AVTGYRTNLT EAEFRNFIEK TGFAMTGQTK EIVPADRLLY AMRDVTATVE SVPLITSSIL SKKVAEGSEA LVFDVKCGKG AFMKTLSDAK ALAVSLVGTA KAMGKKARAL ITNMNEPLGT MAGNFLEIEE TIDILKGQGP ADSTELTLQL AAHMLVLGGK AKTEEEGLSL AKEAVSSGKA LDLFIKNIEL QGGNPKTLMA EYKTRRSKFF EELKAERDGF IESINAFEVG MAGVNLGVGR NKTTDPVCPD AGVEILKHKG DSVKKGDLIM RVYGKDSASV SASMPLLKNA IEYSDKAPQK NKLIFKIIKQ EEL
|
| |