Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5897 |
Symbol | deoA |
ID | 6971427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5549695 |
End bp | 5551017 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643389512 |
Product | thymidine phosphorylase |
Protein accession | YP_002273903 |
Protein GI | 209398702 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTTCTCG CACAAGAAAT TATTCGTAAA AAACGTGATG GTCATGCGCT GAGCGATGAA GAAATTCGTT TCTTTATCAA CGGTATTCGC GACAACACTA TCTCCGAAGG GCAGATTGCC GCCCTCGCGA TGACCATTTT CTTCCACGAT ATGACAATGC CTGAGCGTGT CTCGCTGACC ATGGCGATGC GAGATTCAGG AACCGTTCTC GACTGGAAAA GCCTGCATCT GAATGGCCCG ATTGTTGATA AACACTCCAC CGGTGGCGTC GGCGATGTGA CTTCGCTGAT GTTGGGGCCG ATGGTCGCAG CCTGCGGCGG CTATATTCCG ATGATCTCTG GTCGCGGCCT CGGTCATACT GGCGGTACGC TCGACAAACT GGAATCCATC CCTGGCTTCG ACATTTTCCC GGATGACAAC CGTTTCCGCG AAATTATTAA AGACGTCGGC GTGGCGATTA TCGGTCAGAC CAGCTCACTG GCTCCGGCGG ATAAACGTTT CTACGCGACC CGTGATATTA CCGCAACCGT GGACTCCATC CCGCTGATCA CTGCCTCGAT CCTGGCGAAG AAACTGGCGG AAGGTCTGGA CGCGCTGGTG ATGGACGTGA AAGTGGGTAG CGGCGCGTTT ATGCCGACCT ACGAACTCTC TGAAGCCCTT GCCGAAGCGA TTGTTGGCGT GGCTAACGGC GCAGGCGTGC GCACCACCGC ACTGCTCACC GACATGAATC AGGTGCTGGC CTCCAGTGCC GGTAACGCGG TTGAAGTACG TGAAGCGGTG CAGTTCCTGA CGGGTGAATA TCGTAACCCG CGTCTGTTTG ATGTCACGAT GGCGCTGTGT GTGGAGATGC TTATCTCCGG CAAACTGGCG AAAGATGACG CCGAAGCGCG CGCGAAATTG CAGGCGGTGC TGGACAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT AGCGGCACAA AAAGGCCCGA CCGACTTCGT TGAGAACTAC GCGAAGTATC TGCCGACAGC GATGCTGACG AAAGCAGTCT ATGCTGATAC CGAAGGTTTT GTCAGTGAAA TGGATACCCG CGCGCTGGGG ATGGCAGTGG TTGCAATGGG CGGCGGACGC CGTCAGGCAT CTGACACCAT CGATTACAGC GTCGGCTTTA CTGATATGGC ACGTCTGGGC GACCAGGTAG ACGGCCAGCG TCCGCTGGCG GTTATCCACG CGAAAGACGA AAACAGCTGG CAGGAAGCGG CGAAAGCGGT GAAAGCGGCA ATTAAACTTG CCGATAAAGC ACCGGAAAGC ACACCAACTG TCTATCGCCG TATCAGCGAA TAA
|
Protein sequence | MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT MAMRDSGTVL DWKSLHLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYIP MISGRGLGHT GGTLDKLESI PGFDIFPDDN RFREIIKDVG VAIIGQTSSL APADKRFYAT RDITATVDSI PLITASILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGKLA KDDAEARAKL QAVLDNGKAA EVFGRMVAAQ KGPTDFVENY AKYLPTAMLT KAVYADTEGF VSEMDTRALG MAVVAMGGGR RQASDTIDYS VGFTDMARLG DQVDGQRPLA VIHAKDENSW QEAAKAVKAA IKLADKAPES TPTVYRRISE
|
| |