Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_1595 |
Symbol | deoA |
ID | 3718585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 189717 |
End bp | 191024 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640069746 |
Product | thymidine phosphorylase |
Protein accession | YP_351640 |
Protein GI | 77462136 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACGCGC GATCGATCAA CGCAAAGCTG CGCCGGGGCG AAGTGCCCTC GGCCGCTGAA CTGGGCTGGT TCGCGGAAGG TCTGGCCTCG GGGCATGTCA CCGATGCGCA GGCCGGCGCC TTCGCCATGG CGGTCTGCCT GCAGGGTCTG GGCGAGGAGG GGCGTGTGGC CCTGACGCGC GCCATGCGCG ACTCCGGCCG GGTGCTCGCC TGGGACCTGC CGGGGCCGGT GCTCGACAAA CATTCGACGG GCGGCATCGG AGACTGCACC TCGCTGCTCC TCGCGCCGGC TCTGGCTGCC TGCGGGGCCT ATGTGCCGAT GATCTCGGGT CGGGGCCTCG GCCATACGGG GGGCACGCTC GACAAGCTGG AAGCGATCCC CGGCTTCCGC GTGGCCCTGG GCGAAGAGCG GTTGCGCGCG CAGATCGAGG ATGTCCGCTG CGCCATCGTC GCCGCCGACG AGGGCATGGC GCCCGCGGAC CGGCGGCTCT ACCTCATCCG CGACGTGACC GGCACGGTCG AATCGATCGA CCTCATCACC GCGTCGATCC TGTCGAAGAA GCTGGCGGCC GGGCTCGAGG GGCTGGTGCT CGATGTGAAA GTGGGTTCGG GCGCCTTCAT GAAGTCGATG GACGAGGCCG AGGCGCTGGC GCGTGCGCTG GTGGGCACGG CGCAGGGGGC GGGCTGCATG ACATCGGCCC TCATCACCGA CATGAGCCAG CCGCTCGCGA CCGCGGCCGG CAATGCGCTC GAGGTGATCG AGGTGATGGA GACGCTGACC GGCACCTCGA TCAATGCCGC GCTCTGGGAC GTGACGGTGG CGCTCGGCGG CGAGGCCCTG GCGCTGGGCG GCCTTGCCGC CGACGCCGAG GACGGGGCGC ACCGGATCGA GCAGGCGCTG GAGAGCGGGC ATGCGGCCGA ATTCTTCGCC CGCATGGTGG CGGCGCAGGG CGGCCCGGTC GATTTCGTCG AGCGCTGGCC CGACCGGCTG CCCTCGGCCC CCGTGATGCG CGAGGTGCCG AGCCTGCGCA CGGGCTTCGT GCTGCGCATC GACACGGCGG CGCTCGGTCA GGCGGTGGTG CATCTGGGTG GCGGGCGGCT GCGCGAGACC GACCGGGTGA ATCCCTCGGT GGGTCTGGCC GATATCGCCG GGATCGGCGA GGAAGTGTCC GAGGATCTGC CGCTCGCCAT GATCCATGCC GCGACCGAGG CCGATGCCGA TGCTGCGGTG GCCGCGATTC AGGCGGCCTA TGTGATCTCG GATCAGGAAC CGGCCGAGCC GCCGCTCATC CATGCGAGGA TCGCCTGA
|
Protein sequence | MDARSINAKL RRGEVPSAAE LGWFAEGLAS GHVTDAQAGA FAMAVCLQGL GEEGRVALTR AMRDSGRVLA WDLPGPVLDK HSTGGIGDCT SLLLAPALAA CGAYVPMISG RGLGHTGGTL DKLEAIPGFR VALGEERLRA QIEDVRCAIV AADEGMAPAD RRLYLIRDVT GTVESIDLIT ASILSKKLAA GLEGLVLDVK VGSGAFMKSM DEAEALARAL VGTAQGAGCM TSALITDMSQ PLATAAGNAL EVIEVMETLT GTSINAALWD VTVALGGEAL ALGGLAADAE DGAHRIEQAL ESGHAAEFFA RMVAAQGGPV DFVERWPDRL PSAPVMREVP SLRTGFVLRI DTAALGQAVV HLGGGRLRET DRVNPSVGLA DIAGIGEEVS EDLPLAMIHA ATEADADAAV AAIQAAYVIS DQEPAEPPLI HARIA
|
| |