Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewana3_1042 |
Symbol | deoA |
ID | 4479387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. ANA-3 |
Kingdom | Bacteria |
Replicon accession | NC_008577 |
Strand | + |
Start bp | 1218945 |
End bp | 1220276 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639725585 |
Product | thymidine phosphorylase |
Protein accession | YP_868683 |
Protein GI | 117919491 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.429173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000000248089 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTCTAG CTCAGGAAAT TATACGTAAG AAACGCAATG GGTTAGCCTT AAGTACAGAA GAGATCCAGT TCTTCGTTAA GGGCATAACC ACTAATGCAG TGTCGGAAGG TCAGATCGCC GCACTAGGCA TGGCTGTGTA TTTTAATGAC ATGAATATGG ATGAAAGAAT CGCTTTGACC ACGGCAATGC GCGATTCTGG CACTGTACTC AACTGGCAAT CACTTGGTCT GAATGGCCCT GTCATCGATA AACACAGTAC AGGTGGTGTC GGTGATGTGA TTAGTCTCAT GCTCGGCCCC ATGGCTGCGG CTTGCGGTGG TTATGTGCCG ATGATTTCGG GTCGCGGACT CGGACACACA GGCGGTACGC TCGATAAGTT CGACGCTATT CCCGGTTATC AAACCGAACC TTCGAGTGAA TTGTTCCGCA AAGTAGTTAA AGACGTTGGT GTGGCGATTA TCGGCCAAAC TGGCGATCTG GTTCCCGCCG ATAAACGTTT TTATTCCATC CGTGACAACA CTGCGACCGT CGAATCCATC TCCCTCATCA CCGCCTCAAT TCTCTCTAAG AAATTAGCTT GTAGTCTCGA TGCATTGGCG ATGGACGTCA AAGTCGGTAG CGGCGCATTT ATGCCAACTT ACGAAGCCTC TGAAGAGCTT GCTCGCAGCA TTGCGGCGGT AGCCAATGGC GCAGGTACTA AAACGACGGC CTTACTCACC GACATGAACC AAGTGTTAGC CTCATGTGCG GGTAATGCGG TTGAAGTGAA AGAAGCCATC GATTTTTTAA CCGGTGCTTA CCGTAATCCT CGCCTCTACG CAGTGACTAT GGGGCTATGT GCCGAGATGT TACTCCTGGG TGGTCTGGCG AGCGATGAAG CCGATGCCCG TGCCAAGTTG AACCGCGTGC TAGACAACGG CCGTGCTGCC GAGATCTTTG GCAAGATGGT TTCAGGCCTC GGTGGCCCCG TCGATTTTGT CGAAAACTAC AGTAAGTACT TACCGCAATC ACAAATTATT CGCCCTGTCT TTGCGGATAC CCAAGGTTAT GCTTACAGCA TGGATACCCG CGAACTCGGT TTAGCCGTGG TTACCTTAGG TGGTGGTCGT CGCAAACCTG GTGATGCACT CGACTACAGT GTTGGTTTGA CGCAAGTCTG TGCCCTTGGC GATAAAATTG ATGCTTCTAC GCCGATTGCT GTGATCCACG CGCAATCTGA AGAAGCCTTT GCGCAGGCAG AAGAAGCGGT GAAAAAAGCG ATTCATATCG ATGAAGTCGC TCCAGAAAAA ACACCTGAGA TCTATGCTTA TATTCGAGCT TCGGATCTTT AA
|
Protein sequence | MFLAQEIIRK KRNGLALSTE EIQFFVKGIT TNAVSEGQIA ALGMAVYFND MNMDERIALT TAMRDSGTVL NWQSLGLNGP VIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT GGTLDKFDAI PGYQTEPSSE LFRKVVKDVG VAIIGQTGDL VPADKRFYSI RDNTATVESI SLITASILSK KLACSLDALA MDVKVGSGAF MPTYEASEEL ARSIAAVANG AGTKTTALLT DMNQVLASCA GNAVEVKEAI DFLTGAYRNP RLYAVTMGLC AEMLLLGGLA SDEADARAKL NRVLDNGRAA EIFGKMVSGL GGPVDFVENY SKYLPQSQII RPVFADTQGY AYSMDTRELG LAVVTLGGGR RKPGDALDYS VGLTQVCALG DKIDASTPIA VIHAQSEEAF AQAEEAVKKA IHIDEVAPEK TPEIYAYIRA SDL
|
| |