Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4983 |
Symbol | deoA |
ID | 6872012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4811862 |
End bp | 4813184 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642787853 |
Product | thymidine phosphorylase |
Protein accession | YP_002218443 |
Protein GI | 198242938 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.615857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 98 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTCTCG CACAAGAAAT TATTCGTAAA AAGCGTGATG GTCATGCGTT GAGTGACGAA GAAATTCGTT TCTTTATTAA TGGTATTCGT GACAATACTA TCTCTGAAGG GCAGATTGCC GCCCTGGCGA TGACCATCTT CTTCCACGAT ATGACCATGC CGGAGCGTGT TTCGCTGACC ATGGCGATGC GGGATTCCGG TACTGTCCTT GACTGGAAAA GCCTGAATCT CAATGGCCCG ATTGTCGATA AGCATTCGAC CGGCGGCGTA GGGGACGTGA CGTCTCTGAT GTTGGGGCCA ATGGTAGCGG CCTGCGGCGG TTATGTGCCG ATGATCTCCG GTCGCGGTCT CGGACATACC GGCGGTACGC TCGACAAACT GGAAGCGATC CCGGGCTTCG ACATCTTCCC GGACGACAAC CGTTTCCGCG AAATTATTCA AGACGTGGGT GTGGCGATTA TTGGGCAAAC CAGCTCGCTT GCACCGGCGG ACAAACGTTT TTACGCCACC CGCGATATTA CCGCGACGGT GGACTCTATT CCGCTGATCA CTGGTTCCAT CCTCGCCAAG AAACTGGCCG AAGGGCTGGA TGCGCTGGTA ATGGACGTCA AAGTCGGCAG TGGCGCGTTT ATGCCAACCT ATGAACTTTC TGAAGCCCTT GCTGAAGCGA TTGTTGGCGT GGCAAACGGC GCGGGAGTTC GCACTACGGC TTTGTTAACC GATATGAACC AGGTGCTGGC TTCGAGCGCC GGTAACGCGG TGGAAGTGCG TGAAGCCGTG CAGTTCCTGA CCGGAGAATA CCGCAATCCG CGCTTGTTTG ACGTTACCAT GGCGCTATGC GTGGAGATGC TGATCTCCGG CCAGCTGGCG AAAGACGACG CCGAAGCGCG TGCGAAATTA CAGGCGGTGC TGGATAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT GGCCGCGCAG AAAGGGCCAA GCGATTTCGT TGAGAACTAC GATAAATACT TGCCGACCGC CATGTTGAGC AAAGCGGTAT ATGCTGATAC CGAAGGGTTT ATCAGCGCAA TGGATACGCG TGCGCTGGGG ATGGCGGTCG TCTCGATGGG CGGCGGCCGT CGTCAGGCGT CAGATACCAT TGATTACAGC GTTGGCTTTA CCGACATGGC CCGTCTGGGC GACAGCATCG ACGGGCAGCG CCCGCTGGCG GTGATTCATG CCAAAGACGA AGCCAGTTGG CAGGAAGCGG CGAAGGCCGT CAAAGCGGCA ATTATCCTTG ACGATAAAGC GCCAGCAAGC ACACCTTCGG TCTATCGTCG AATTACTGAA TAG
|
Protein sequence | MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT MAMRDSGTVL DWKSLNLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYVP MISGRGLGHT GGTLDKLEAI PGFDIFPDDN RFREIIQDVG VAIIGQTSSL APADKRFYAT RDITATVDSI PLITGSILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGQLA KDDAEARAKL QAVLDNGKAA EVFGRMVAAQ KGPSDFVENY DKYLPTAMLS KAVYADTEGF ISAMDTRALG MAVVSMGGGR RQASDTIDYS VGFTDMARLG DSIDGQRPLA VIHAKDEASW QEAAKAVKAA IILDDKAPAS TPSVYRRITE
|
| |