Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfri_1003 |
Symbol | deoA |
ID | 4278377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella frigidimarina NCIMB 400 |
Kingdom | Bacteria |
Replicon accession | NC_008345 |
Strand | + |
Start bp | 1173712 |
End bp | 1175043 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638133771 |
Product | thymidine phosphorylase |
Protein accession | YP_749694 |
Protein GI | 114562181 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.028852 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTTAG CTCAAGAAAT TATTCGTAAA AAACGTAATG GCGATGTATT AAGCACTGCA GAAATTCAAT TCTTTGTTGA TGGCATTACT CATAACACGG TTTCAGAAGG CCAAATTGCT GCGTTCGGCA TGGCAGTGTA TTTCAAAGAT ATGAATATGG ATGAACGGAT TGCATTAACG ATTGCGATGC GTGATTCCGG AACCGTATTA AATTGGGATT CTCTGGGCCT CAATGGCCCG ATTATCGATA AGCATAGCAC TGGTGGTGTT GGCGATGTAA TCAGCCTAAT GCTTGGCCCT ATGGCTGCAG CTTGTGGCGG TTATGTGCCA ATGATTTCAG GTCGTGGCTT AGGTCACACC GGTGGTACGT TAGATAAATT TGATGCCATT CCTGGTTATC AAACTGAGCC TTCAAGTGAA TTATTCCGCA AAGTGGTTAA AGACGCTGGT GTTGCCATTA TTGGTCAAAC CGGTGACTTA GTCCCTGCTG ATAAGCGTTT CTATTCGATT CGTGATAATA CCGCCACCGT TGAATCTATT TCGTTAATCA CCGCATCTAT TTTATCTAAA AAGTTAGCCG CGGGGCTTGA TGCTCTAGCA ATGGATGTCA AAGTCGGCAG CGGTGCATTT ATGCCAACTT ATGAAGCATC TGAAGAATTA GCTCGCAGTA TTACTGCTGT TGCTAATGGC GCAGGAACTA AAACAACGGC ATTGTTAACT GACATGAACC AAGTATTGGC TTCGTGTGCA GGTAATGCGG TTGAGGTTCG TGAGGCGATA AACTTTTTGA CTGGTCAATA TCGTAATCCA CGTTTATATG CTGTCACCAT GGGATTATGT GCAGAAATGC TAATTTTGGG TGGTATTGCA CAGAATGAAG CTGAAGCTCG TCATAAACTA AATACTGTAT TAGATAATGG TAAAGCAGCA GAGGCTTTTG CTAAAATGGT ATCTGGCTTA GGTGGCCCAA CAGACTTTGT TGAGGCATAT GATAAGTATT TACCTCATGC GAAAATTGTA CGACCTGTAT ATGCCAACAC ATCTGGCTTT GCTTACAAAA TGGATACACG AGAACTTGGT TTAGCCGTAG TGACCTTAGG TGGTGGACGT CGCAAACCAG GTGATGCACT GGATTACAGT GTTGGATTAA CTCAAGTATG CGCATTGGGT CAGGAAGTTA ACAAAGACGT TCCGTTAGCG ATGATCCATG CTCAATCTGA AGATGCTTTC GCCGAAGCTG CCGCTGCTAT TCAGCAAGCC ATTATTATTG GTGATAGCGC GCCTGAGAAA ACGCCTGAAA TATATCGTTA TATTCGCGCT TCAGATTTAT AA
|
Protein sequence | MFLAQEIIRK KRNGDVLSTA EIQFFVDGIT HNTVSEGQIA AFGMAVYFKD MNMDERIALT IAMRDSGTVL NWDSLGLNGP IIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT GGTLDKFDAI PGYQTEPSSE LFRKVVKDAG VAIIGQTGDL VPADKRFYSI RDNTATVESI SLITASILSK KLAAGLDALA MDVKVGSGAF MPTYEASEEL ARSITAVANG AGTKTTALLT DMNQVLASCA GNAVEVREAI NFLTGQYRNP RLYAVTMGLC AEMLILGGIA QNEAEARHKL NTVLDNGKAA EAFAKMVSGL GGPTDFVEAY DKYLPHAKIV RPVYANTSGF AYKMDTRELG LAVVTLGGGR RKPGDALDYS VGLTQVCALG QEVNKDVPLA MIHAQSEDAF AEAAAAIQQA IIIGDSAPEK TPEIYRYIRA SDL
|
| |