Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_1141 |
Symbol | deoA |
ID | 7087594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 1342473 |
End bp | 1343804 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643460052 |
Product | thymidine phosphorylase |
Protein accession | YP_002357079 |
Protein GI | 217972328 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000000198917 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTCTAG CTCAAGAGAT TATTCGTAAA AAACGTAATG GTTTAGCGCT AAGTTCCGAA GAAATACAGT TCTTTGTTCA AGGTATTACC ACCAACTCCG TATCTGAAGG TCAGATCGCC GCATTAGGCA TGGCGGTGTA TTTTAATGAC ATGAATATGG ATGAAAGAAT CGCTTTAACG ACAGCAATGC GTGATTCTGG CACTGTGCTT AATTGGCAAT CATTGGGACT CAACGGTCCA GTTATCGACA AACACAGCAC TGGTGGTGTC GGCGATGTGA TTAGTCTCAT GCTTGGCCCT ATGGCTGCGG CTTGTGGCGG TTATGTGCCT ATGATTTCTG GGCGCGGCCT AGGTCATACT GGCGGTACAC TCGATAAGTT TGATGCCATT CCGGGTTACC AAACAGAGCC TTCAAGTGAA TTGTTCCGCA AAGTGGTTAA AGATGTCGGG GTGGCGATTA TTGGCCAGAC GGGCGATCTC GTTCCAGCCG ATAAACGCTT CTATTCTATT CGCGATAATA CCGCCACCGT TGAATCCATT TCCCTCATCA CAGCATCGAT TTTGTCTAAG AAATTAGCCT GTAATTTAGA TGCGTTGGCG ATGGACGTAA AAGTCGGTAG CGGCGCTTTC ATGCCAACCT ATGAGGCATC TGAAGAATTA GCTCGCAGTA TTGCAGCTGT TGCTAATGGT GCGGGTACTA AAACGACGGC TTTACTTACC GACATGAATC AAGTGCTTGC ATCTTGTGCA GGTAACGCGG TTGAAGTGAA AGAAGCCATC GACTTTTTAA CGGGTGCTTA CCGTAACCCG CGTTTATATG AAGTCACTAT GGGTCTTTGT GCTGAGATGC TGCTCCTTGG CGGTCTTGCA AGCAATGAAG CCGATGCTCG CGCTAAACTG AATCGTGTAC TCGACAATGG TCGCGCTGCA GAACTCTTTG GCAAGATGGT GTCGGGTCTT GGTGGTCCGG TTGATTTTGT TGAAAACTAC AGTAAATACC TGCCGCAGTC ACAAATTATT CGTCCCGTCT TTGCCGATAT GCAAGGTTAT GCCTATAGCA TGGATACCCG TGAGTTAGGT TTAGCGGTTG TGACCTTAGG TGGCGGCCGC CGTAAGCCCG GCGATGCACT AGACTATAGT GTTGGCTTAA CCCAAGTGTG TGCCCTAGGC GATAAAGTGG ATTCATCGAC GCCGATTGCC GTTATCCATG CACAATCTGA AGCCGCGTTC GCAGAAGCTG AACTTGCGGT GAAAAAAGCG ATTCACATTG GTGAAACCGC TCCAGAAAAA ACACCTGAGA TCTATGCCTA TATTCGTGCA TCGGATCTTT AA
|
Protein sequence | MFLAQEIIRK KRNGLALSSE EIQFFVQGIT TNSVSEGQIA ALGMAVYFND MNMDERIALT TAMRDSGTVL NWQSLGLNGP VIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT GGTLDKFDAI PGYQTEPSSE LFRKVVKDVG VAIIGQTGDL VPADKRFYSI RDNTATVESI SLITASILSK KLACNLDALA MDVKVGSGAF MPTYEASEEL ARSIAAVANG AGTKTTALLT DMNQVLASCA GNAVEVKEAI DFLTGAYRNP RLYEVTMGLC AEMLLLGGLA SNEADARAKL NRVLDNGRAA ELFGKMVSGL GGPVDFVENY SKYLPQSQII RPVFADMQGY AYSMDTRELG LAVVTLGGGR RKPGDALDYS VGLTQVCALG DKVDSSTPIA VIHAQSEAAF AEAELAVKKA IHIGETAPEK TPEIYAYIRA SDL
|
| |