Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1038 |
Symbol | deoA |
ID | 4251111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 1211341 |
End bp | 1212672 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 638117611 |
Product | thymidine phosphorylase |
Protein accession | YP_733175 |
Protein GI | 113969382 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.101588 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000000837764 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTCTAG CTCAGGAAAT TATACGTAAG AAACGCAATG GGTTAGCCTT AAGCGCCGAA GAGATCCAGT TCTTCGTTAA GGGTATAACC ACTAATGCAG TGTCGGAAGG TCAGATCGCC GCATTAGGCA TGGCTGTGTA TTTTAATGAC ATGAATATGG ATGAAAGAAT CGCTTTGACC ACGGCAATGC GCGATTCTGG CACTGTACTC AATTGGCAAT CACTTGGTCT TAATGGCCCA GTCATCGATA AACACAGTAC TGGTGGTGTC GGTGATGTGA TTAGTCTCAT GCTCGGCCCC ATGGCTGCGG CTTGCGGTGG TTATGTGCCG ATGATTTCGG GTCGCGGACT CGGACACACA GGCGGTACGC TCGATAAGTT CGACGCTATT CCCGGTTATC AAACCGAACC TTCGAGTGAA TTGTTCCGCA AAGTAGTTAA AGACGTTGGT GTGGCGATTA TCGGCCAAAC TGGCGATCTG GTTCCCGCCG ATAAACGTTT TTATTCCATC CGTGACAACA CTGCGACCGT TGAATCCATC TCCCTCATTA CCGCCTCTAT TCTCTCTAAG AAATTAGCTT GTAGTCTCGA TGCATTGGCG ATGGACGTCA AAGTCGGTAG CGGCGCATTT ATGCCAACCT ACGAAGCCTC TGAAGAGCTT GCACGCAGTA TTGCGGCGGT AGCTAATGGC GCAGGCACTA AAACGACGGC CTTACTCACC GACATGAACC AAGTATTGGC CTCATGTGCG GGTAATGCGG TTGAAGTGAA AGAAGCCATC GACTTCCTAA CTGGTGCTTA TCGTAATCCT CGTCTGTACG CTGTGACTAT GGGGCTTTGT GCCGAGATGT TACTCCTAGG CGGCCTCGCC ACCGATGAAG CGGATGCCCG TGCCAAGTTA AATCGAGTAT TAGATAACGG CCGCGCTGCC GAGATCTTTG GCAAGATGGT TTCAGGCCTC GGTGGCCCAG TCGATTTTGT TGAAAATTAC AGTAAGTACT TACCGCAATC GCAAATTATT CGCCCTGTCT TTGCGGATAC CCAAGGTTAT GCCCACAGCA TGGACACCCG TGAACTCGGT TTAGCCGTGG TTACCTTAGG TGGTGGTCGT CGCAAGCCTG GTGATGCACT CGACTACAGT GTTGGTCTGA CGCAAGTCTG TGCCCTTGGC GATAAGATTG ATGCTTCAAC GCCGATTGCC GTGATCCACG CGCAATCTGA AGATGCCTTT GCTCAGGCGG AAGAAGCCGT GAAAAAAGCG ATTCGTATTG ATGAAGTCGC TCCAGAAAAA ACACCTGAGA TCTATGCTTA TATCCGAGCA GCGGATCTTT AA
|
Protein sequence | MFLAQEIIRK KRNGLALSAE EIQFFVKGIT TNAVSEGQIA ALGMAVYFND MNMDERIALT TAMRDSGTVL NWQSLGLNGP VIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT GGTLDKFDAI PGYQTEPSSE LFRKVVKDVG VAIIGQTGDL VPADKRFYSI RDNTATVESI SLITASILSK KLACSLDALA MDVKVGSGAF MPTYEASEEL ARSIAAVANG AGTKTTALLT DMNQVLASCA GNAVEVKEAI DFLTGAYRNP RLYAVTMGLC AEMLLLGGLA TDEADARAKL NRVLDNGRAA EIFGKMVSGL GGPVDFVENY SKYLPQSQII RPVFADTQGY AHSMDTRELG LAVVTLGGGR RKPGDALDYS VGLTQVCALG DKIDASTPIA VIHAQSEDAF AQAEEAVKKA IRIDEVAPEK TPEIYAYIRA ADL
|
| |