Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0787 |
Symbol | deoA |
ID | 8332117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 919156 |
End bp | 920445 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644953939 |
Product | thymidine phosphorylase |
Protein accession | YP_003111563 |
Protein GI | 256389999 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCCA TCACCGTCAT CCGCGCCAAG CGCGACCGCG CGGAGCTCAC CGACGAGCAG ATCGACTGGA TCCTCGCCGC CTACACCGAC GGCCGGGTGG CCGAGGAGCA GATGTCCGCG CTGGCGATGG CGATCCTGCT CAACGGCATG AACCGGCGCG AGATCTCACG CTGGACGCAG GCGATGATCG ACTCCGGCGA GCGGATGGAC TTCTCCGCGC TCACGCGCCC GACCGTGGAC AAGCACTCCA CCGGCGGCGT CGGCGACAAG ATCACGCTGC CGCTGGCGCC GCTGGTGGCG GCGTGCGGCG CGGCGGTCCC GCAGCTGTCC GGCCGCGGTC TGGGCCACAC CGGCGGCACG CTGGACAAGC TGGAGTCGAT CCCGGGCTGG CGCGCCTCGC TGTCCACCGC CGAGATGCTC GCGATCCTGG CCGACGCCGG CGCCGTGGTC TGCGCGGCCG GCGACGGTCT GGCGCCGGCG GACAAGAAGC TCTACGCGCT GCGCGACGTC ACCGGGACCG TCGAGGCGAT CCCGCTGATC GCGTCCTCGA TCATGTCCAA GAAGATCGCG GAGGGCACCG GCGCGCTGGT GCTGGACGTG AAGTGCGGCA GCGGCGCGTT CATGAAGGAC TTCGCCGACG CCCGCGAACT GGCCGAGACC ATGGTCGCCC TCGGCACCGA CCACGGCGTC CGCACCACCG CGCTGATCAC CGCGATGGAC ACCCCGCTGG GCCGCACCGC GGGCAACGCC CTGGAGGTGC GCGAGTCGGT CGAGGTGCTC TCCGGCGGCG GTCCGGCCGA CATCAGGGAG CTGACACTCG CGCTCGCCCG CGAGATGCTC GACGCCGCCG GCCTGGCCGG CGTGGACCCG GCGGCCAAGC TCGACGACGG CTCGGCGCTG GACGCCTGGA AGCGCATGAT CCGCGCCCAG GGCGGCGATC CCGACGCCCC GCTGCCGACG GCGAAGGAGA CCCACGTGGT CGCCGCCGAC CGCGACGGCG TCCTGGTCGC CATGGACTCC TTCAAGGTCG GCGTGGCCGC CTGGCGCCTC GGCGCGGGCC GGGCGCGCAA GGAGGACGCG GTGCAGGCCG GCGCCGGCGT GGAGTGGCAC GCGGTCCCCG GCGACACCGT GCGCGCCGGA CAGCCGCTGT TCACCCTGCA CACCGACACC CCGGAGACCT TCGACACCGC GCTGGAAAGC CTGGCCGGCG CCTGCGACAT CGCCGCGCCG GGGACGGAGT TCACCCGGTC GCCGCTTGTC CTGGACCGCG TCGCCGCCTC GAACCGGTAG
|
Protein sequence | MDAITVIRAK RDRAELTDEQ IDWILAAYTD GRVAEEQMSA LAMAILLNGM NRREISRWTQ AMIDSGERMD FSALTRPTVD KHSTGGVGDK ITLPLAPLVA ACGAAVPQLS GRGLGHTGGT LDKLESIPGW RASLSTAEML AILADAGAVV CAAGDGLAPA DKKLYALRDV TGTVEAIPLI ASSIMSKKIA EGTGALVLDV KCGSGAFMKD FADARELAET MVALGTDHGV RTTALITAMD TPLGRTAGNA LEVRESVEVL SGGGPADIRE LTLALAREML DAAGLAGVDP AAKLDDGSAL DAWKRMIRAQ GGDPDAPLPT AKETHVVAAD RDGVLVAMDS FKVGVAAWRL GAGRARKEDA VQAGAGVEWH AVPGDTVRAG QPLFTLHTDT PETFDTALES LAGACDIAAP GTEFTRSPLV LDRVAASNR
|
| |