Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3674 |
Symbol | deoA |
ID | 6066547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4023324 |
End bp | 4024646 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641603089 |
Product | thymidine phosphorylase |
Protein accession | YP_001726612 |
Protein GI | 170021658 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.104507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTTCTCG CACAAGAAAT TATTCGTAAA AAACGTGATG GTCATGCGCT AAGCGATGAA GAAATTCGTT TCTTTATCAA CGGTATTCGC GACAACACTA TCTCCGAAGG GCAGATTGCC GCCCTCGCGA TGACCATTTT CTTCCACGAT ATGACAATGC CTGAGCGTGT CTCGCTGACC ATGGCGATGC GAGATTCAGG AACCGTTCTC GACTGGAAAA GCCTGCATCT GAATGGCCCG ATTGTTGATA AACACTCCAC CGGCGGCGTC GGCGATGTGA CTTCGCTGAT GTTGGGGCCG ATGGTCGCAG CCTGCGGCGG CTATATTCCG ATGATCTCCG GTCGCGGCCT CGGTCATACT GGCGGTACGC TCGACAAACT GGAATCCATC CCTGGCTTCG ACATTTTCCC GGATGACAAC CGTTTCCGCG AAATTATTAA AGACGTCGGC GTGGCGATTA TCGGTCAGAC CAGCTCACTG GCTCCGGCGG ATAAACGTTT CTACGCGACC CGTGATATTA CCGCAACCGT GGACTCCATC CCGCTGATCA CCGCCTCTAT TCTGGCGAAG AAACTGGCGG AAGGTCTGGA CGCGCTGGTG ATGGACGTGA AAGTGGGTAG CGGCGCGTTT ATGCCGACCT ACGAACTCTC TGAAGCCCTT GCCGAAGCGA TTGTTGGCGT GGCTAACGGC GCTGGCGTGC GCACCACCGC GCTGCTCACC GACATGAATC AGGTACTGGC CTCCAGTGCA GGTAACGCGG TTGAAGTTCG TGAAGCGGTG CAGTTCCTGA CGGGTGAATA TCGTAACCCG CGTCTGTTTG ATGTCACGAT GGCGCTGTGC GTGGAGATGC TGATCTCCGG CAAACTGGCG AAAGATGACG CCGAAGCGCG CGCGAAATTG CAGGCGGTGC TGGACAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT AGCGGCACAA AAAGGCCCGA CCGACTTCGT TGAGAACTAC GCGAAGTATC TGCCGACAGC GATGCTGACG AAAGCAGTCT ATGCTGATAC CGAAGGTTTT GTCAGTGAAA TGGATACCCG CGCGCTGGGG ATGGCAGTGG TTGCAATGGG CGGCGGTCGT CGTCAGGCAT CTGACACCAT TGATTACAGC GTCGGCTTTA CTGATATGGC GCGTTTGGGC GACCAGGTAG ACGGTCAGCG TCCGCTGGCG GTTATCCACG CGAAAGACGA AAACAGCTGG CAGGAAGCGG CGAAAGCGGT GAAAGCGGCA ATTAAACTTG CCGATAAAGC ACCGGAAAGC ACACCAACTG TCTATCGCCG TATCAGCGAA TAA
|
Protein sequence | MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT MAMRDSGTVL DWKSLHLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYIP MISGRGLGHT GGTLDKLESI PGFDIFPDDN RFREIIKDVG VAIIGQTSSL APADKRFYAT RDITATVDSI PLITASILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGKLA KDDAEARAKL QAVLDNGKAA EVFGRMVAAQ KGPTDFVENY AKYLPTAMLT KAVYADTEGF VSEMDTRALG MAVVAMGGGR RQASDTIDYS VGFTDMARLG DQVDGQRPLA VIHAKDENSW QEAAKAVKAA IKLADKAPES TPTVYRRISE
|
| |