Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3616 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 3893478 |
End bp | 3894800 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | thymidine phosphorylase |
Protein accession | ACX41229 |
Protein GI | 260450807 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTCTCG CACAAGAAAT TATTCGTAAA AAACGTGATG GTCATGCGCT GAGCGATGAA GAAATTCGTT TCTTTATCAA CGGTATTCGC GACAACACTA TCTCCGAAGG GCAGATTGCC GCCCTCGCGA TGACCATTTT CTTCCACGAT ATGACAATGC CTGAGCGTGT CTCGCTGACC ATGGCGATGC GAGATTCAGG AACCGTTCTC GACTGGAAAA GCCTGCATCT GAATGGCCCG ATTGTTGATA AACACTCCAC CGGTGGCGTC GGCGATGTGA CTTCGCTGAT GTTGGGGCCG ATGGTCGCAG CCTGCGGCGG CTATATTCCG ATGATCTCTG GTCGCGGCCT CGGTCATACT GGCGGTACGC TCGACAAACT GGAATCCATC CCTGGCTTCG ACATTTTCCC GGATGACAAC CGTTTCCGCG AAATTATTAA AGACGTCGGC GTGGCGATTA TCGGTCAGAC CAGTTCACTG GCTCCGGCTG ATAAACGTTT CTACGCGACC CGTGATATTA CCGCAACCGT GGACTCCATC CCGCTGATCA CCGCCTCTAT TCTGGCGAAG AAACTTGCGG AAGGTCTGGA CGCGCTGGTG ATGGACGTGA AAGTGGGTAG CGGCGCGTTT ATGCCGACCT ACGAACTCTC TGAAGCCCTT GCCGAAGCGA TTGTTGGCGT GGCTAACGGC GCTGGCGTGC GCACCACCGC GCTGCTCACC GACATGAATC AGGTACTGGC CTCCAGTGCA GGTAACGCGG TTGAAGTTCG TGAAGCGGTG CAGTTCCTGA CGGGTGAATA TCGTAACCCG CGTCTGTTTG ATGTCACGAT GGCGCTGTGC GTGGAGATGC TGATCTCCGG CAAACTGGCG AAAGATGACG CCGAAGCGCG CGCGAAATTG CAGGCGGTGC TGGACAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT AGCGGCACAA AAAGGCCCGA CCGACTTCGT TGAGAACTAC GCGAAGTATC TGCCGACAGC GATGCTGACG AAAGCAGTCT ATGCTGATAC CGAAGGTTTT GTCAGTGAAA TGGATACCCG CGCGCTGGGG ATGGCAGTGG TTGCAATGGG CGGCGGACGC CGTCAGGCAT CTGACACCAT CGATTACAGC GTCGGCTTTA CTGATATGGC GCGTCTGGGC GACCAGGTAG ACGGTCAGCG TCCGCTGGCG GTTATCCACG CGAAAGACGA AAACAACTGG CAGGAAGCGG CGAAAGCGGT GAAAGCGGCA ATTAAACTTG CCGATAAAGC ACCGGAAAGC ACACCAACTG TCTATCGCCG TATCAGCGAA TAA
|
Protein sequence | MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT MAMRDSGTVL DWKSLHLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYIP MISGRGLGHT GGTLDKLESI PGFDIFPDDN RFREIIKDVG VAIIGQTSSL APADKRFYAT RDITATVDSI PLITASILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGKLA KDDAEARAKL QAVLDNGKAA EVFGRMVAAQ KGPTDFVENY AKYLPTAMLT KAVYADTEGF VSEMDTRALG MAVVAMGGGR RQASDTIDYS VGFTDMARLG DQVDGQRPLA VIHAKDENNW QEAAKAVKAA IKLADKAPES TPTVYRRISE
|
| |