Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shew_2814 |
Symbol | deoA |
ID | 4922484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella loihica PV-4 |
Kingdom | Bacteria |
Replicon accession | NC_009092 |
Strand | - |
Start bp | 3334338 |
End bp | 3335669 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640164409 |
Product | thymidine phosphorylase |
Protein accession | YP_001094939 |
Protein GI | 127513742 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000364581 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000355939 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTTTCTAG CACAAGAGAT TATTCGTAAA AAACGTAATG GCGAAGTGTT GTCTACCCAA GAGATCCAAT TTTTCGTTCA GGGGATCACC AACAACACAG TTTCTGAAGG CCAAATCGCT GCACTGGGTA TGGCGGTCTA TTTCAAAGAC ATGAACATGG ATGAACGTAT TGCGCTGACG ACAGCCATGC GTGACTCTGG CACCGTCCTT AACTGGAAAA ACCTCGGCCT CGACGGCCCT ATCATCGACA AGCACAGCAC TGGTGGTGTG GGCGATGTGA TCAGCCTGAT GCTTGGCCCC ATGGCTGCCG CCTGTGGCGG ATATGTGCCC ATGATCTCCG GCCGTGGCCT GGGTCATACT GGCGGAACCC TGGATAAGTT CGATGCGATT CCAGGTTACC AGACTGAGCC AGACAGCGAC CTGTTCAGAA AAGTCGTTAA AGAAGCCGGT GTCGCCATCA TAGGTCAAAC TGGAGACCTG GTCCCCGCCG ATAAGCGCTT CTATTCTATT CGCGATAATA CCGCAACCGT TGAATCCATC TCTCTCATCA CAGCTTCTAT TCTTTCTAAG AAACTTGCCG CCGGTCTCGA TGCATTAGCC ATGGACGTTA AGGTAGGCAG TGGCGCCTTC ATGCCAACTT ATGAAGCCTC TGAAGAGCTG GCTCGCAGCA TTACCGCGGT AGCCAATGGT GCCGGTACTA AGACGACGGC TCTGCTGACC GACATGAACC AAGTATTGGC ATCATGTGCC GGTAACGCGG TAGAAGTGAA AGAAGCGGTG GATTTCCTGA CCGGTGCATA CCGTAATCCA CGCTTGTATG AAGTGACTAT GGGTCTGTGC GCCGAGATGT TGCAGCTAGG TGGCCTGGCC GCCAGCGAAG CCGATGCCCG TGAGAAACTC AACCGCGTGC TGGATAACGG TAAGGCTGCC GATATCTTCG GCCGCATGAT TGCCGGCCTG GGTGGCCCAG CCGACTTTAT CGAAAACTAT GCTAAGTATC TGCCACAGTC GCAGATCATT CGTCCTGTTT ACGCCGACCG TAGCGGTTTC GCGGCCTCTA TGGATACCCG TGAGCTAGGC CTTGCGGTAG TAACCCTAGG CGGTGGACGT CGTAAGCCAG GTGATGCCCT AGATTACAGC GTGGGCTTGA CCCAAGTCTG TGCGCTGGGC GATGAGATCA CCTCAGATAA GCCGATTGCC ATGGTACATG CGCAATCTGA AAGCGCCTTC GAAGAAGCGG CGGCAGCCGT GAAGAAAGCG ATTCACATTG GCGATGAAGC GCCAGAGAAA ACACCAGAGA TCTATCGTTA CATTCGTCAG TCAGATCTCT AG
|
Protein sequence | MFLAQEIIRK KRNGEVLSTQ EIQFFVQGIT NNTVSEGQIA ALGMAVYFKD MNMDERIALT TAMRDSGTVL NWKNLGLDGP IIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT GGTLDKFDAI PGYQTEPDSD LFRKVVKEAG VAIIGQTGDL VPADKRFYSI RDNTATVESI SLITASILSK KLAAGLDALA MDVKVGSGAF MPTYEASEEL ARSITAVANG AGTKTTALLT DMNQVLASCA GNAVEVKEAV DFLTGAYRNP RLYEVTMGLC AEMLQLGGLA ASEADAREKL NRVLDNGKAA DIFGRMIAGL GGPADFIENY AKYLPQSQII RPVYADRSGF AASMDTRELG LAVVTLGGGR RKPGDALDYS VGLTQVCALG DEITSDKPIA MVHAQSESAF EEAAAAVKKA IHIGDEAPEK TPEIYRYIRQ SDL
|
| |