Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr7_1103 |
Symbol | deoA |
ID | 4258674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-7 |
Kingdom | Bacteria |
Replicon accession | NC_008322 |
Strand | + |
Start bp | 1273688 |
End bp | 1275019 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638121724 |
Product | thymidine phosphorylase |
Protein accession | YP_737159 |
Protein GI | 114046609 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.239025 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000305013 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTCTAG CTCAAGAAAT TATACGTAAG AAACGCAATG GTTTAGCCTT AAGCACCGAA GAGATCCAGT TCTTCGTCAA GGGCATAACC ACTAATGCAG TGTCGGAAGG TCAGATCGCC GCATTAGGCA TGGCTGTGTA CTTTAATGAC ATGAATATGG ATGAAAGAAT CGCTTTGACT ACTGCTATGC GCGATTCTGG CACTGTACTC AATTGGCAAT CACTTGGTCT TAATGGCCCA GTCATCGATA AACACAGTAC TGGTGGTGTC GGTGATGTGA TTAGTCTCAT GCTCGGCCCT ATGGCTGCAG CTTGTGGTGG TTATGTGCCG ATGATTTCGG GTCGTGGACT CGGACACACA GGCGGTACGC TCGATAAGTT CGACGCTATT CCCGGTTATC AAACCGAACC TTCGAGTGAA TTGTTCCGAA AAGTAGTTAA AGACGTTGGT GTGGCGATTA TCGGCCAAAC TGGCGATCTG GTTCCCGCCG ATAAACGTTT TTACTCCATC CGTGACAACA CTGCGACCGT CGAATCCATC TCCCTCATCA CCGCCTCAAT TCTCTCTAAG AAATTAGCTT GTAGTCTCGA TGCATTGGCG ATGGACGTCA AAGTCGGTAG CGGCGCATTT ATGCCAACCT ACGAAGCCTC TGAAGAGCTT GCACGCAGTA TTGCGGCGGT AGCCAATGGC GCAGGCACTA AAACGACGGC CTTACTCACC GACATGAACC AAGTATTGGC CTCATGTGCC GGTAATGCGG TTGAAGTGAA AGAAGCCATC GATTTCTTGA CTGGTGCTTA CCGTAATCCC CGTTTATACG CTGTGACTAT GGGGCTTTGT GCCGAGATGT TACTCCTAGG CGGCCTCGCC ACCGATGAAG CGGATGCCCG TGCCAAGTTA AATCGAGTAT TAGATAACGG CCGCGCTGCC GAGATCTTTG GCAAGATGGT TTCAGGCCTC GGTGGTCCAG TCGATTTTGT TGAAAATTAC AGTAAGTACT TACCGCAATC GCAAATTATT CGCCCTGTCT TTGCTGATAC TCAAGGTTAT GCTCACAGCA TGGACACCCG TGAACTCGGT TTAGCCGTGG TCACCTTAGG AGGCGGTCGT CGTAAACCCG GTGATGCACT CGACTACAGT GTTGGTCTGA CGCAAGTTTG TGCACTTGGT GATAAGATTG ATGCTTCTAC GCCGATTGCC GTGATCCATG CTCAATCCGA AGATGCCTTT GCTCAAGCGG AAGAAGCCGT GAAAAAAGCG ATTCGTATCG ACGAAGTCGC TCCAGAAAAA ACACCTGAGA TCTATGCTTA TATCCGAGCA GCGGATCTTT AA
|
Protein sequence | MFLAQEIIRK KRNGLALSTE EIQFFVKGIT TNAVSEGQIA ALGMAVYFND MNMDERIALT TAMRDSGTVL NWQSLGLNGP VIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT GGTLDKFDAI PGYQTEPSSE LFRKVVKDVG VAIIGQTGDL VPADKRFYSI RDNTATVESI SLITASILSK KLACSLDALA MDVKVGSGAF MPTYEASEEL ARSIAAVANG AGTKTTALLT DMNQVLASCA GNAVEVKEAI DFLTGAYRNP RLYAVTMGLC AEMLLLGGLA TDEADARAKL NRVLDNGRAA EIFGKMVSGL GGPVDFVENY SKYLPQSQII RPVFADTQGY AHSMDTRELG LAVVTLGGGR RKPGDALDYS VGLTQVCALG DKIDASTPIA VIHAQSEDAF AQAEEAVKKA IRIDEVAPEK TPEIYAYIRA ADL
|
| |