Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_3365 |
Symbol | deoA |
ID | 5755169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | - |
Start bp | 3973675 |
End bp | 3975006 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641289698 |
Product | thymidine phosphorylase |
Protein accession | YP_001555787 |
Protein GI | 160876471 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.604066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00212197 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTCTAG CTCAAGAGAT TATTCGTAAA AAACGTAATG GTTTAGCGCT AAGTTCCGAA GAAATACAGT TCTTTGTTCA AGGTATCACC ACCAACTCTG TATCTGAAGG TCAGATCGCC GCATTAGGTA TGGCGGTGTA TTTTAATGAC ATGAATATGG ATGAAAGAAT CGCTTTAACG ACAGCAATGC GTGATTCTGG CACTGTGCTT AATTGGCAAT CATTGGGACT CAACGGTCCT GTCATCGACA AACACAGCAC TGGTGGTGTC GGCGATGTGA TTAGTCTCAT GCTTGGCCCT ATGGCTGCGG CGTGTGGCGG TTATGTGCCT ATGATTTCTG GGCGCGGCCT AGGTCATACT GGCGGTACAC TCGATAAGTT TGATGCCATT CAGGGTTACC AAACTGAGCC TTCAAGTGAA TTGTTCCGCA AAGTGGTTAA AGAAGTCGGG GTGGCGATTA TTGGCCAAAC GGGCGATCTC GTCCCTGCCG ATAAACGCTT CTATTCTATT CGCGATAATA CCGCCACCGT TGAATCCATT TCCCTCATCA CTGCGTCGAT TCTGTCTAAG AAATTAGCCT GTAATTTAGA TGCGTTGGCG ATGGACGTAA AAGTTGGTAG CGGCGCTTTC ATGCCAACCT ATGAGGCATC GGAAGAATTA GCTCGCAGTA TCGCCGCCGT TGCAAATGGC GCGGGTACTA AAACGACGGC TTTACTTACC GATATGAATC AAGTGCTTGC ATCTTGTGCA GGTAACGCGG TTGAAGTGAA AGAAGCCATC GACTTTTTAA CGGGTGCTTA CCGTAACCCG CGTTTATATG AAGTCACTAT GGGTCTTTGC GCTGAGATGT TGCTTCTGGG CGGTCTTGCA AGCAATGAAG CCGATGCTCG CGCTAAACTG AATCGTGTAC TCGATAATGG TCGCGCTGCA GAACTCTTTG GCAAGATGGT GTCGGGTCTT GGTGGTCCGG TTGATTTTGT TGAAAACTAC AGTAAATACC TACCACAGTC ACAAATTATT CGCCCAGTTT TTGCCGATAT GCAAGGTTAT GCCTACAGCA TGGATACCCG TGAGTTAGGT TTAGCGGTTG TGACCTTAGG TGGCGGCCGC CGTAAGCCCG GCGATACACT AGACTATAGT GTTGGCTTAA CCCAAGTGTG TGCCCTAGGC GATAAAGTGG ATTCATCGAC GCCGATTGCC GTTATCCATG CACAGTCTGA AGCCGCCTTC GCGGAAGCGG AACTTGCGGT GAAAAAAGCG ATTCACATTG GTGAAACCGC TCCAGAAAAA ACACCTGAGA TCTATGCCTA TATTCGTGCA TCGGATCTTT AA
|
Protein sequence | MFLAQEIIRK KRNGLALSSE EIQFFVQGIT TNSVSEGQIA ALGMAVYFND MNMDERIALT TAMRDSGTVL NWQSLGLNGP VIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT GGTLDKFDAI QGYQTEPSSE LFRKVVKEVG VAIIGQTGDL VPADKRFYSI RDNTATVESI SLITASILSK KLACNLDALA MDVKVGSGAF MPTYEASEEL ARSIAAVANG AGTKTTALLT DMNQVLASCA GNAVEVKEAI DFLTGAYRNP RLYEVTMGLC AEMLLLGGLA SNEADARAKL NRVLDNGRAA ELFGKMVSGL GGPVDFVENY SKYLPQSQII RPVFADMQGY AYSMDTRELG LAVVTLGGGR RKPGDTLDYS VGLTQVCALG DKVDSSTPIA VIHAQSEAAF AEAELAVKKA IHIGETAPEK TPEIYAYIRA SDL
|
| |