Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1417 |
Symbol | gutB |
ID | 6142812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1401480 |
End bp | 1402523 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641616295 |
Product | sorbitol dehydrogenase |
Protein accession | YP_001743475 |
Protein GI | 170684066 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATT CAAAAGCAAT ATTGCAGGTG CCGGGCACAA TGAAAATTAT TTCAGCAGAA ATGCCTGTGC CTAAAGAAGA TGAAGTTTTG ATTAAAGTAG AATATGTCGG TATTTGTGGT TCAGATGTAC ATGGTTTTGA ATCAGGCCCG TTTATTCCGC CTAAAGACCC AAATCAAGAA ATTGGCCTGG GTCATGAATG CGCCGGGACG GTTGTGGCTG TGGGAAGCCG CGTGCGCAAA TTTAAACCGG GGGATCGGGT AAATATCGAA CCTGGCGTTC CTTGCGGTCA CTGTCGTTAC TGTCTGGAAG GCAAATATAA CATTTGCCCG GACGTTGATT TTATGGCGAC ACAACCCAAC TACCGCGGCG CATTAACGCA CTATCTGTGT CATCCGGAGA GCTTTACTTA CAAACTGCCA GACAATATGG ACACGATGGA AGGGGCGCTG GTGGAGCCTG CCGCAGTCGG GATGCATGCC GCGATGCTGG CAGATGTTAA ACCGGGTAAG AAGATAATTA TTCTGGGAGC AGGTTGTATT GGTTTGATGA CGTTGCAAGC GTGCAAATGC CTGGGAGCTA CGGATATTGC CGTCGTTGAT GTGCTGGAAA AACGTCTGAC AATGGCGGAA CAGCTCGGCG CGACAGTGGT TATTAACGGC GCAAAAGAAG ACACTATTGC ACGCTGTCAG CAATTTACCG AAGAAATGGG CGCAGATATT GTTTTCGAAA CAGCAGGTTC TGCGGTCACC GTTAAACAGG CACCTTATCT GGTAATGCGC GGCGGTAAAA TTATGATTGT TGGTACTGTA CCGGGCGATT CGGCAATCAA TTTCCTCAAA ATCAATCGCG AAGTCACTAT CCAGACGGTA TTCCGCTATG CCAATCGTTA TCCGGTCACG ATTGAAGCTA TTTCTTCAGG GCGATTCGAT GTGAAATCGA TGGTGACGCA TATTTACGAT TATCGGGATG TACAACAGGC ATTTGAAGAG TCAGTTAACA ACAAACGCGA CATTATTAAA GGCGTTATTA AAATTAGCGA TTAA
|
Protein sequence | MKNSKAILQV PGTMKIISAE MPVPKEDEVL IKVEYVGICG SDVHGFESGP FIPPKDPNQE IGLGHECAGT VVAVGSRVRK FKPGDRVNIE PGVPCGHCRY CLEGKYNICP DVDFMATQPN YRGALTHYLC HPESFTYKLP DNMDTMEGAL VEPAAVGMHA AMLADVKPGK KIIILGAGCI GLMTLQACKC LGATDIAVVD VLEKRLTMAE QLGATVVING AKEDTIARCQ QFTEEMGADI VFETAGSAVT VKQAPYLVMR GGKIMIVGTV PGDSAINFLK INREVTIQTV FRYANRYPVT IEAISSGRFD VKSMVTHIYD YRDVQQAFEE SVNNKRDIIK GVIKISD
|
| |