Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4931 |
Symbol | deoA |
ID | 6145192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 5046159 |
End bp | 5047481 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619734 |
Product | thymidine phosphorylase |
Protein accession | YP_001746838 |
Protein GI | 170682854 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTTCTCG CACAAGAAAT TATTCGTAAA AAACGTGATG GTCATGCGCT GAGTGATGAA GAAATTCGTT TCTTTATCAA CGGCATTCGC GACAACACTA TCTCCGAAGG GCAGATTGCC GCCCTCGCGA TGACCATTTT CTTCCACGAT ATGACAATGC CTGAGCGTGT CTCGCTGACC ATGGCGATGC GAGATTCAGG AACCGTTCTC GACTGGAAAA GCCTGCATCT GAATGGCCCG ATTGTTGATA AACACTCGAC CGGCGGTGTC GGCGATGTGA CTTCGCTGAT GTTGGGGCCG ATGGTCGCTG CCTGTGGCGG CTATATTCCG ATGATCTCCG GTCGCGGCCT CGGTCATACT GGCGGTACGC TCGACAAACT GGAATCCATC CCTGGCTTCG ACATTTTCCC GGATGACAAC CGTTTCCGCG AAATTATTAA AGACGTCGGC GTGGCGATTA TCGGTCAGAC CAGCTCACTG GCTCCGGCGG ATAAACGTTT CTACGCGACC CGTGATATTA CCGCAACCGT GGACTCCATC CCGCTGATCA CTGCCTCGAT CCTGGCGAAG AAACTGGCGG AAGGTCTGGA TGCGCTGGTG ATGGACGTGA AAGTGGGTAG CGGCGCGTTT ATGCCGACCT ACGAACTTTC TGAAGCCCTT GCCGAAGCGA TTGTTGGCGT AGCTAACGGC GCTGGCGTGC GTACCACCGC GCTGCTCACC GATATGAATC AGGTACTGGC CTCCAGTGCA GGTAACGCGG TTGAAGTTCG TGAAGCGGTG CAGTTCCTGA CGGGTGAGTA TCGTAACCCG CGTCTGTTTG ATGTCACAAT GGCGCTGTGC GTAGAGATGC TTATCTCCGG CAAACTGGCG AAAGATGACG CCGAAGCGCG CGCGAAATTG CAGGCGGTGC TGGACAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT AGCGGCACAA AAAGGCCCGA CTGACTTCGT TGAGAACTAC GCGAAGTATC TGCCGACAGC GATGCTGACG AAAGCAGTCT ATGCTGATAC CGAAGGGTTT GTCAGTGAAA TGGATACCCG CGCGCTGGGG ATGGCAGTGG TTGCAATGGG CGGCGGACGC CGTCAGGCAT CTGACACCAT CGATTACAGC GTCGGCTTTA CTGATATGGC GCGTCTGGGC GACCAGGTAG ACGGTCAGCG TCCGCTGGCA GTTATCCACG CGAAAGACGA AAACAGCTGG CAGGAAGCGG CGAAAGCGGT GAAAGCGGCA ATTAAACTTG CCGATAAAGC ACCGGAAAGC ACACCAACTG TCTATCGTCG TATCAGTGAA TAA
|
Protein sequence | MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT MAMRDSGTVL DWKSLHLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYIP MISGRGLGHT GGTLDKLESI PGFDIFPDDN RFREIIKDVG VAIIGQTSSL APADKRFYAT RDITATVDSI PLITASILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGKLA KDDAEARAKL QAVLDNGKAA EVFGRMVAAQ KGPTDFVENY AKYLPTAMLT KAVYADTEGF VSEMDTRALG MAVVAMGGGR RQASDTIDYS VGFTDMARLG DQVDGQRPLA VIHAKDENSW QEAAKAVKAA IKLADKAPES TPTVYRRISE
|
| |