Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1809 |
Symbol | |
ID | 6142743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1829833 |
End bp | 1830885 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641616685 |
Product | zinc-binding dehydrogenase family oxidoreductase |
Protein accession | YP_001743863 |
Protein GI | 170681445 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGT TAGTAGCCAC AGCACCGCGC GTTGCTGCGC TGGTTGAGTA TGAAGATCGG GCGATTTTAG CCAATGAAGT GAAGATCCGC GTGCGTTTCG GCGCACCGAA ACACGGAACG GAAGTGGTCG ACTTCCGCGC CGCCAGCCCG TTTATCGATG AAGACTTTAA CGGCGAGTGG CAGATGTTCA CCCCGCGTCC CGCAGATGCG CCGCGCGGCA TTGAGTTTGG CAAATTCCAG CTTGGCAACA TGGTGGTTGG CGACATTATC GAGTGCGGCA GCGACGTTAC CGACTACGCG GTGGGCGACA GCGTATGCGG CTACGGCCCG CTCTCCGAGA CGGTCATCAT TAACGCAGTG AATAACTACA AGCTGCGCAA AATGCCGCAA GGCAGCTCCT GGAAAAACGC CGTCTGTTAC GACCCGGCGC AGTTTGCCAT GAGCGGCGTG CGCGATGCCA ACGTACGCGT AGGGGATTTT GTCGTGGTGG TAGGGCTTGG CGCGATCGGT CAAATTGCCA TCCAACTGGC GAAACGCGCT GGCGCATCGG TGGTAATTGG CGTCGATCCT ATCGCCCATC GCTGTGAGAT TGCCCGTCGC CACGGTGCTG ATTTCTGCCT TAACCCCATT GGCACTGACG TAGGCAAAGA GATCAAAACG CTGACCGGCA AGCAGGGTGC CGATGTGATT ATCGAAACCA GCGGTTACGC CGACGCGCTG CAATCGGCGC TGCGCGGTCT GTCCTACGGC GGCACCATCT CGTATGTCGC GTTTGCCAAA CCATTTGCTG AAGGTTTTAA CCTCGGACGC GAAGCGCATT TCAATAACGC CAAAATTGTT TTCTCCCGCG CGTGCAGCGA ACCGAACCCG GATTATCCGC GCTGGAGCCG TAAGCGTATT GAAGAAACCT GCTGGGAACT GCTGATGAAC GGTTATCTCA ATTGCGAAGA TTTAATCGAC CCGGTAGTGA CCTTTGCCAA CAGCCCGGAA AGCTACATGC AGTATGTCGA TCAGCATCCG GAACAGAGCA TCAAAATGGG CGTTACGTTT TAA
|
Protein sequence | MKKLVATAPR VAALVEYEDR AILANEVKIR VRFGAPKHGT EVVDFRAASP FIDEDFNGEW QMFTPRPADA PRGIEFGKFQ LGNMVVGDII ECGSDVTDYA VGDSVCGYGP LSETVIINAV NNYKLRKMPQ GSSWKNAVCY DPAQFAMSGV RDANVRVGDF VVVVGLGAIG QIAIQLAKRA GASVVIGVDP IAHRCEIARR HGADFCLNPI GTDVGKEIKT LTGKQGADVI IETSGYADAL QSALRGLSYG GTISYVAFAK PFAEGFNLGR EAHFNNAKIV FSRACSEPNP DYPRWSRKRI EETCWELLMN GYLNCEDLID PVVTFANSPE SYMQYVDQHP EQSIKMGVTF
|
| |