Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0678 |
Symbol | |
ID | 4810296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 833900 |
End bp | 835201 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106095 |
Product | thymidine phosphorylase |
Protein accession | YP_001037106 |
Protein GI | 125973196 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGG TTGATCTTAT AAACAAAAAA AAGCGGGGAG AAGCTCTGTC CGCCGCTGAA ATTGATTATA TTGTCCAAGG CTATACAAAG GGCGAAATAC CGGACTATCA GATGTCGGCA TTTTTGATGG CTGTATATTT CAAAGGAATG AACAGAGAAG AGACGGCAAA CCTTACTTTA TCCATGGTCA ATTCCGGTGA AACGGTTGAC CTTTCGATGA TTGAAGGAAT AAAGGTTGAC AAGCATTCTT CCGGTGGCGT TGGAGACAAA ATCAGTCTTG TAATAGTTCC GCTTTGTGCC TGTGTTGGGA TACCGGTTGC AAAAATGTCC GGAAGAGGGC TTGGGCACAC CGGCGGAACA ATTGATAAGC TGGAATCCAT AGAAGGATTT AGAACCGAGC TTACGAAAGA GGAGTTTGTG AACAACGTAA ACAAATATAA AATGGCCATA GTAGGCCAAT CGCCAAATCT CACTCCTGCG GACAAAAAAA TATATGCTCT GAGAGATGTT ACCGGTACGG TGGACAGCAT ACCGCTTATA GCAAGCTCAA TAATGAGTAA AAAAATCGCC TCCGGGTGCG ATTGCATTGT CCTGGATGTC AAGGTGGGAT CCGGAGCCTT CATGAAGTCC GTGGACGAAG CCGTGATTTT GGCAAAAACC ATGGTGGAAA TAGGCAAAGC TTTGGGAAGA AGAACTGTTG CGGTTGTAAC AGACATGAGC CAGCCTTTGG GATATGAAGT GGGAAACGCC AACGAAGTTA AAGAAGCAAT AGAAATATTG AAGGGCCACG GTGCCGAGGA CGAGACAACG GTGGCACTCA CAATTGCATC CCATATGGCG GTATTGGGCG GTGCTTTTTC AGATTATGAA TCGGCTTACA ACCATATGCG CAAATTGATA GAATCCGGCA AGGCAGTGGA AAAATTAAAG GAATTAATCA GAATACAGGG GGGAAATACC GATGTGGTGG ACAACCCAAA TCTTTTACCC CAGGCCGAAA AACACATAGA AGTTAAATCC TCAACGGCAG GTTATATAAA TTCTGTCAAT GCCGAGGACA TAGGAGTTTC GGCAATGCTT CTTGGAGCAG GAAGAAAGAC CAAAAACGAC AGCATAGATT TTTCAGCGGG CATCACAATG GTAAAAAAGA TTGGGGATTG GGTTGATGAA GGTGATACTT TGTGCATACT TCACACAAAC AAGTCCGACT TTCAAGAGGC AGAAAGGCTT TCCAAAAATG CTTTTGTCAT AAAAAACACG AAACCTGAAC CGATTAAATA TGTTCACTGT GTTATTGACT GA
|
Protein sequence | MRMVDLINKK KRGEALSAAE IDYIVQGYTK GEIPDYQMSA FLMAVYFKGM NREETANLTL SMVNSGETVD LSMIEGIKVD KHSSGGVGDK ISLVIVPLCA CVGIPVAKMS GRGLGHTGGT IDKLESIEGF RTELTKEEFV NNVNKYKMAI VGQSPNLTPA DKKIYALRDV TGTVDSIPLI ASSIMSKKIA SGCDCIVLDV KVGSGAFMKS VDEAVILAKT MVEIGKALGR RTVAVVTDMS QPLGYEVGNA NEVKEAIEIL KGHGAEDETT VALTIASHMA VLGGAFSDYE SAYNHMRKLI ESGKAVEKLK ELIRIQGGNT DVVDNPNLLP QAEKHIEVKS STAGYINSVN AEDIGVSAML LGAGRKTKND SIDFSAGITM VKKIGDWVDE GDTLCILHTN KSDFQEAERL SKNAFVIKNT KPEPIKYVHC VID
|
| |