Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4617 |
Symbol | deoA |
ID | 5593925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4621240 |
End bp | 4622562 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640923711 |
Product | thymidine phosphorylase |
Protein accession | YP_001461148 |
Protein GI | 157163830 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 66 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTCTCG CACAAGAAAT TATTCGTAAA AAACGTGATG GTCATGCGCT AAGCGATGAA GAAATTCGTT TCTTTATCAA CGGTATTCGC GACAACACTA TCTCCGAAGG GCAGATTGCC GCCCTCGCGA TGACCATTTT CTTCCACGAT ATGACAATGC CTGAGCGTGT CTCGCTGACC ATGGCGATGC GAGATTCAGG AACCGTTCTC GACTGGAAAA GCCTGCATCT GAATGGCCCG ATTGTTGATA AACACTCCAC CGGCGGCGTC GGCGATGTGA CTTCGCTGAT GTTGGGGCCG ATGGTCGCAG CCTGCGGCGG CTATATTCCG ATGATCTCCG GTCGCGGCCT CGGTCATACT GGCGGTACGC TCGACAAACT GGAATCCATC CCTGGCTTCG ACATTTTCCC GGATGACAAC CGTTTCCGCG AAATTATTAA AGACGTCGGC GTAGCGATTA TCGGCCAGAC CAGCTCACTG GCTCCGGCGG ATAAACGTTT CTACGCGACC CGTGATATTA CCGCAACCGT GGACTCCATC CCGCTGATCA CTGCCTCGAT CCTGGCGAAG AAACTGGCGG AAGGTCTGGA TGCGCTGGTG ATGGACGTGA AAGTGGGTAG CGGCGCGTTT ATGCCGACCT ACGAACTCTC TGAAGCCCTT GCCGAAGCGA TTGTTGGCGT GGCTAACGGC GCTGGCGTGC GCACCACCGC GCTGCTCACC GATATGAATC AGGTACTGGC CTCCAGTGCA GGTAACGCGG TTGAAGTTCG TGAAGCGGTG CAGTTCCTGA CGGGTGAATA TCGTAACCCG CGTCTGTTTG ATGTCACGAT GGCGCTGTGC GTGGAGATGC TTATCTCCGG CAAACTGGCG AAAGATGACG CCGAAGCGCG CGCGAAATTG CAAGCAGTGC TGGACAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT AGCGGCACAA AAAGGCCCAA CCGACTTCGT TGAGAACTAC GCGAAGTATC TGCCGACAGC GATGCTGACG AAAGCAGTCT ATGCTGATAC CGAAGGTTTT GTCAGTGAAA TGGATACCCG CGCGCTGGGG ATGGCAGTGG TTGCAATGGG CGGCGGTCGT CGTCAGGCAT CTGACACCAT TGATTACAGC GTCGGCTTTA CTGATATGGC GCGTTTGGGC GACCAGGTAG ACGGTCAGCG TCCGCTGGCT GTTATCCACG CGAAAGACGA AAACAGCTGG CAGGACGCGG CGAAAGCGGT GAAAGCGGCA ATTAAACTTG CCGATAAAGC ACCGGAAAGC ACACCAACTG TCTATCGCCG TATCAGCGAA TAA
|
Protein sequence | MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT MAMRDSGTVL DWKSLHLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYIP MISGRGLGHT GGTLDKLESI PGFDIFPDDN RFREIIKDVG VAIIGQTSSL APADKRFYAT RDITATVDSI PLITASILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGKLA KDDAEARAKL QAVLDNGKAA EVFGRMVAAQ KGPTDFVENY AKYLPTAMLT KAVYADTEGF VSEMDTRALG MAVVAMGGGR RQASDTIDYS VGFTDMARLG DQVDGQRPLA VIHAKDENSW QDAAKAVKAA IKLADKAPES TPTVYRRISE
|
| |