Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_0542 |
Symbol | deoA |
ID | 5112174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 608146 |
End bp | 609468 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640490713 |
Product | thymidine phosphorylase |
Protein accession | YP_001175280 |
Protein GI | 146310206 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.492482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.113052 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTTCTCG CACAAGAAAT TATTCGTAAA AAACGTGATG GTCATGCATT AAGTGACGAA GAGATCCGTT TCTTTATCAA CGGCATTCGT GACAACACCA TCTCTGAAGG GCAAATTGCC GCCCTGGCGA TGACCATCTT CTTCCACGAT ATGTCGATAC CGGAACGCGT ATCGCTGACC ATGGCGATGC GGGATTCAGG TAGCGTTCTG GACTGGAAAA GCCTCAATCT TAACGGCCCG ATCGTGGACA AACACTCCAC GGGCGGCGTA GGCGATGTCA CCTCTCTGAT GCTCGGCCCA ATGGTCGCAG CATGCGGCGG TTACATCCCG ATGATCTCCG GGCGCGGCCT GGGTCACACG GGCGGTACGC TCGACAAACT GGAAGCCATT CCGGGCTTCG ATATCTTCCC GAACGACACG CGTTTTCGCG AAATTATTAA AGATGTTGGT GTGGCGATTA TCGGCCAGAC CAGCTCTCTG GCTCCGGCGG ACAAGCGTTT CTACGCAACA CGCGATATTA CCGCGACGGT TGATTCCATC CCGCTGATCA CCGCTTCAAT CCTGGCGAAA AAACTGGCCG AAGGGCTGGA TGCGCTGGTG ATGGACGTGA AAGTGGGCAG CGGTGCATTT ATGCCGACGT TTGAACTTTC TGCGGCACTG GCTGAAGCGA TCGTTGGCGT CTCCAACGGC GCGGGCGTGC GTACCACGGC ACTGCTGACT GACATGAATC AGGTGCTGGC GTCCAGCGCC GGTAACGCGG TTGAAGTCCG TGAAGCGGTG CAGTTCCTGA CGGGCGAATA CCGTAATCCG CGTCTGTTCG ACGTCACCAT GGCGCTGTGC GTTGAGATGC TCATCTCCGG CAAGCTGGCC AAAGACGACG CCGAAGCGCG TGCGAAATTG CAGGCCGTGC TGGATAACGG TAAAGCGGCA GAGATCTTTG GCCGCATGGT CGCTGCGCAA AAAGGCCCGA ACGATTTCGT TGAGAACTAT GCGAAATACC TGCCGACCGC CATGCTCAGC AAAGCGGTGT ATGCCGATAC AGAAGGTTTT GTCTCCGCAA TGGACACCCG TGCGCTCGGC ATGGCAGTGG TATCGATGGG CGGCGGTCGT CGCCAGGCGT CGGACACCAT TGATTACAGC GTTGGCTTTA CCGATATGGC ACGTCTGGGC GACAGCGTTG ACGGACAGCG TCCGTTAGCC GTTATCCATG CGAAGGACGA AAACAGCTGG CAGGAAGCGG CGAAAGCGGT GAAAGCGGCC ATTCAGCTTG ACGACAAAGC ACCAGAAACC ACACCAACGG TCTATCGTCG TATCACCGAT TAG
|
Protein sequence | MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MSIPERVSLT MAMRDSGSVL DWKSLNLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYIP MISGRGLGHT GGTLDKLEAI PGFDIFPNDT RFREIIKDVG VAIIGQTSSL APADKRFYAT RDITATVDSI PLITASILAK KLAEGLDALV MDVKVGSGAF MPTFELSAAL AEAIVGVSNG AGVRTTALLT DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGKLA KDDAEARAKL QAVLDNGKAA EIFGRMVAAQ KGPNDFVENY AKYLPTAMLS KAVYADTEGF VSAMDTRALG MAVVSMGGGR RQASDTIDYS VGFTDMARLG DSVDGQRPLA VIHAKDENSW QEAAKAVKAA IQLDDKAPET TPTVYRRITD
|
| |