Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spea_3047 |
Symbol | deoA |
ID | 5663437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella pealeana ATCC 700345 |
Kingdom | Bacteria |
Replicon accession | NC_009901 |
Strand | - |
Start bp | 3737259 |
End bp | 3738590 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641237686 |
Product | thymidine phosphorylase |
Protein accession | YP_001502899 |
Protein GI | 157962865 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0184522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTTAG CTCAAGAAAT TATTCGTAAA AAACGTAATG CTGAGACTCT TTCGACTGAA GAAATTCAAT TCTTCGTTAA AGGGATCACC AACAATACAG TTTCAGAAGG TCAGATCGCA GCCTTAGGCA TGGCGGTCTA TTTCAATGAC ATGAACATGG ATGAACGTAT TGCGTTAACT ACGGCGATGC GTGACTCCGG CACAGTTCTC AACTGGCAAT CACTGGATCT TAATGGTCCA ATTATCGATA AGCACTCAAC TGGCGGAGTG GGGGATGTCA TTAGTTTGAT GTTAGGCCCT ATGGCGGCTG CGTGTGGTGG TTATGTACCC ATGATCTCGG GTCGTGGCCT TGGCCATACC GGTGGTACTC TCGATAAGTT TGATGCCATT CCAGGTTATA ATACCGAGCC GGACAGCGCA CTGTTTCGCA AGGTAGTAAA AGAGGCTGGC GTGGCCATTA TTGGTCAAAC TGGCGATCTG GTTCCTGCCG ATAAACGTTT CTATTCTATT CGCGATAATA CCGCAACGGT TGAGTCCATC TCTTTAATCA CAGCTTCCAT TCTTTCTAAA AAGCTCGCCG CAGGTTTAGA TGCATTAGCC ATGGACGTTA AAGTCGGTAC TGGCGCATTT ATGCCAACCT ATGAAGCGTC AGAAGAGTTA GCGCGCAGTA TTACTGCCGT TGCTAATGGT GCAGGCACTA AGACAACAGC ATTACTCACC GACATGAACC AGGTATTGGC ATCATGTGCC GGTAACGCAT TAGAAGTTAA AGAAGCAGTC GACTTTATGA CTGGTGCATA TCGCAACCCA CGCCTTTACG AAGTGACTAT GGGCTTGTGT GCAGAAATGC TGGTATTAGG TGGCCTTGCA AGCAACGAGA GCGAAGCTCG CGTTAAGTTA AATACCGTAC TTGATAACGG TAAGGCGGCT GAGATATTCG GCCGTATGGT TTCAGGTTTA GGTGGTCCAG CTGATTTCGT TGAGAACTAC AGCAAGTACC TGCCTGATTC GCAAATTATT CGTCCTGTTT ACGCGGATAG ATCCGGTTTT GCTTCGGCAA TGGATACCCG TGAACTTGGA CTTGCAGTAG TGACCTTGGG CGGTGGTCGC CGCAAGCCTG GGGATGCTCT CGATTACAGT GTTGGACTTT CTAAAGTTTG CGCACTAGGT GACGAGATTA ACCCTGAGCA ACCTATTGCA TTTATTCACG CCCAATCTGA AAGCGCCTTT GCTGAAGCGG AAGCTGCAGT GAAAAAAGCG ATTCATATTG GCGACAGCAA ACCAGAGAAA ACTCCGGAAA TATATCGTTA TATCCGCGAG TCAGATCTGT AA
|
Protein sequence | MFLAQEIIRK KRNAETLSTE EIQFFVKGIT NNTVSEGQIA ALGMAVYFND MNMDERIALT TAMRDSGTVL NWQSLDLNGP IIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT GGTLDKFDAI PGYNTEPDSA LFRKVVKEAG VAIIGQTGDL VPADKRFYSI RDNTATVESI SLITASILSK KLAAGLDALA MDVKVGTGAF MPTYEASEEL ARSITAVANG AGTKTTALLT DMNQVLASCA GNALEVKEAV DFMTGAYRNP RLYEVTMGLC AEMLVLGGLA SNESEARVKL NTVLDNGKAA EIFGRMVSGL GGPADFVENY SKYLPDSQII RPVYADRSGF ASAMDTRELG LAVVTLGGGR RKPGDALDYS VGLSKVCALG DEINPEQPIA FIHAQSESAF AEAEAAVKKA IHIGDSKPEK TPEIYRYIRE SDL
|
| |