Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0828 |
Symbol | deoA |
ID | 5711454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 835752 |
End bp | 837059 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641266737 |
Product | thymidine phosphorylase |
Protein accession | YP_001532174 |
Protein GI | 159043380 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.61021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGCGC GGACGGTGAT CGCGGCGGTG CGGGACGGCG CGCCGGTGGT GGCGGAGGAC CTGCGCGCCT TCGCCCGGGG CCTCGCCTCC GGCACGGTCA GCGATGCCCA GGCCGGGGCC TTCGCCATGG CGGTCCTGTT GCGGGGCTTG GGCGATGCGG GGCGCGTGGC GCTGACGCAA GCCATGCGCG ACAGTGGCGA TGTGCTGGCC TGGGACCTGC CCGGGCCGGT GCTCGACAAG CATTCCACGG GCGGGATCGG GGATACGGTG TCGCTCCTGC TGGCCCCGGC GCTGGCGGCC TGCGGGGCTT ACGTGCCGAT GATCTCGGGC CGGGGGCTGG GTCATACGGG CGGCACGCTC GACAAACTCG AGGCGCTGGA GGGCTACCGC ACCGACATTT CCGAGGATGC GCTGAGGGCG GTGGTGGCCG AGACCGGCTG CGCCATCGTC GGGGCCACGG GCCAGATCGC GCCCGCGGAC CGGCGGCTCT ATGCCATCCG CGACGTCACC GCCACGGTCG AAAGCATCGA CCTGATCACC GCCTCGATCC TGTCCAAGAA GCTCGCGGCG GGGCTGGGCG GGCTGGTGCT GGATGTCAAA TGCGGCTCGG GCGCGTTTCT GACCGACCCC GCAGCGGCCC GGGATCTGGC GCAGGCGCTG GTGCGCACGG CCAATGGCGC GGGCTGTCCC ACCTGGGCGC TGCTGACGGA TATGGATGAG AGCCTCGCCA GTGCCGCGGG CAATGCGCTG GAGGTGGCGG AGGTGCTGCG TATCCTGCGG GGCGACGCGA CGAGCCCGCG CCTGCGCGCG GCGACCGTGG CCCTGGGCGG GGCGCTGCTG GCCCTGGGCG GGCTGGCGGA CGACCCCGCG GACGGGGCTG CACGGATCGA CGCCACCCTG CGCGATGGCC GCGCGGCGGA GGTTTTCGCC CGCATGGTCG CTGCCCTTGG CGGGCCGCGC GACGTGGTGG CCACGGGCCT TGCCGGTTTG CCCGAGGCGC CGGTGGTGCG CGAAGTGCCC GCACCCGCGG CGGGCGTGCT GACCGCCTGC GATGCCAAGG CGCTGGGCTG GGCGGTGGTG TCCCTGGGCG GCGGACGGCA GGTGGAAACC GACCCGGTCG ACCCGGCGGT GGGCCTGTCG GATATCGCCC CGCTCGGCAC CCGCGTCGTC CGGGGCGAGC CCATCGCCCG GGTCCATGCC GCCCGCGAAG AGGCCGCCGA CCGCGCCGTG GCCGAGGTTC AGGCGGCCTT CACCCTCGGC ACGAGTTTCG ACGCGAAGCC GCTTGTCCTC GGAGAGGTCC GCCCATGA
|
Protein sequence | MTARTVIAAV RDGAPVVAED LRAFARGLAS GTVSDAQAGA FAMAVLLRGL GDAGRVALTQ AMRDSGDVLA WDLPGPVLDK HSTGGIGDTV SLLLAPALAA CGAYVPMISG RGLGHTGGTL DKLEALEGYR TDISEDALRA VVAETGCAIV GATGQIAPAD RRLYAIRDVT ATVESIDLIT ASILSKKLAA GLGGLVLDVK CGSGAFLTDP AAARDLAQAL VRTANGAGCP TWALLTDMDE SLASAAGNAL EVAEVLRILR GDATSPRLRA ATVALGGALL ALGGLADDPA DGAARIDATL RDGRAAEVFA RMVAALGGPR DVVATGLAGL PEAPVVREVP APAAGVLTAC DAKALGWAVV SLGGGRQVET DPVDPAVGLS DIAPLGTRVV RGEPIARVHA AREEAADRAV AEVQAAFTLG TSFDAKPLVL GEVRP
|
| |