Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1750 |
Symbol | |
ID | 7312276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2096302 |
End bp | 2097624 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643608679 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_002506081 |
Protein GI | 220929172 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGT ACGAAATAAT TGAGAAAAAA CGAGATGGCT TCGAACTTAG CTTGGATGAA ATAAATTTCT TTATTACGGA ATATTGTAAT AATAGTATTC CTGACTATCA GGCGGCTGCA CTTTTAATGG CTATTTTCCT TAGGGGTATG AATGAAAGGG AAACGGCTGA TTTAACAAAC GTAATGGCAA ATTCAGGTGA CAGGATAGAT CTTTCTTCAA TTGCGGGTAT AAAAGTTGAT AAACATAGTA CAGGCGGTGT AGGTGACAAG ACCACGCTGA TTCTCGCACC TATTGTAGCT GCTTGCGGAA TACCTGTTGC CAAAATGTCA GGAAGAGGAC TTGGGCACAC AGGCGGAACA ATAGACAAGC TTGAGTCAAT ACCCGGATTT AATACAAGCC TTTCAACAGA ACAGTTTATA GATAATGTAA AAAGCATAGG CATATCAATA GCCGGACAAA CAGGTAATCT GGCTCCTGCT GATAAAAAAC TATATGCTTT GAGAGATGTT ACTGCAACTG TAAATAATAC ATCACTTATT GCAAGCAGTA TTATGAGTAA GAAGCTTGCT TCGGGTGCCG ACAGGATAGT ACTGGACGTA AAAACCGGAA GCGGAGCATT TATGAAAACT TTTGAGGATT CTGTGGAACT TGCCAAAACA ATGGTTAAAA TAGGTAAAAA CACAGGAAGA AAAACCATTG CGGTAGTTAC CGACATGGAC ATACCACTTG GTTTTGCAGT AGGAAATTCA CTCGAAATAA TTGAAGTCAT TGATACTCTG AAAAGTAAAG GGCCAGAGGA TTTGAAAGTT GTTTCATTTG AACTGGCTGC CAGAATGCTT GAGCTTAGCG GGATAGGTAA TATTGATGAG TGCAGGGCCA GAGTTTCCAA CGCAGTTGAA AGCGGTAAAG CACTATCTAA ATTTGCCGAG CTGATAGAAA ATCAGGGAGG AAATAAAGAC GTTATAAATG ACAGTACTTT GTTCCCACAG CCTGCTTATA AGCTGGACTT TATATGTGAA AAGCCTGGTT ATATACAATT TATGAAAACC GACCAAATAG GTATAGCTTC CCTGGTGCTT GGTGCAGGAA GAGAAACAAA GGAAAGTAAA ATCGACTATA GTGCTGGTAT TATGTTTAAT AAAAAAACCG GAGACAGGGT CGAAGCCGGA GAGAGTGTTG CAGTGTTGTA TACAAATAGG GAAGAGACGC TTTTACATGC TGTTTCCCTT TTAAAAGAAG CGGTTGTAAT AAGTGAACTG CCTCATGAAA GAAAACCGCT GATTCTGGCG TATATCGACA GTGAAGGAAA CATAAAAAAA TAA
|
Protein sequence | MRMYEIIEKK RDGFELSLDE INFFITEYCN NSIPDYQAAA LLMAIFLRGM NERETADLTN VMANSGDRID LSSIAGIKVD KHSTGGVGDK TTLILAPIVA ACGIPVAKMS GRGLGHTGGT IDKLESIPGF NTSLSTEQFI DNVKSIGISI AGQTGNLAPA DKKLYALRDV TATVNNTSLI ASSIMSKKLA SGADRIVLDV KTGSGAFMKT FEDSVELAKT MVKIGKNTGR KTIAVVTDMD IPLGFAVGNS LEIIEVIDTL KSKGPEDLKV VSFELAARML ELSGIGNIDE CRARVSNAVE SGKALSKFAE LIENQGGNKD VINDSTLFPQ PAYKLDFICE KPGYIQFMKT DQIGIASLVL GAGRETKESK IDYSAGIMFN KKTGDRVEAG ESVAVLYTNR EETLLHAVSL LKEAVVISEL PHERKPLILA YIDSEGNIKK
|
| |