Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2059 |
Symbol | gutB |
ID | 6871395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 1991800 |
End bp | 1992843 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642785173 |
Product | sorbitol dehydrogenase |
Protein accession | YP_002215839 |
Protein GI | 198244287 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.642486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.0031409 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAATT CAAAAGCGAT ACTAAAAACG CCGGGCACCA TGACAATTAT AGCGGCTGAT ATTCCAGTAC CAAAAGAAAA CGAAGTATTG ATCAAAGTGG AATATGTCGG TATTTGCGGT TCAGATGTTC ACGGTTTTGA ATCCGGGCCA TTCATTCCGC CGAAGGATCC AAATCAGGAA ATTGGTCTCG GTCATGAGTG TGCTGGTACG GTCGTTGCGG TCGGCAATCG GGTAAGCAAA TTTAAGCCAG GCGATCGGGT TAATATCGAG CCGGGCGTGC CGTGCGGCCA CTGCCGCTAT TGTCTGGAAG GAAAATACAA TATTTGTCCG GATGTTGATT TTATGGCGAC GCAGCCGAAT TATCGCGGGG CCTTAACGCA CTATCTGTGC CATCCGGAAA GTTTTACGTA CAAGCTTCCG GACAATATGG ACACTATGGA AGGTGCGCTG GTGGAACCTG CTGCTGTTGG AATGCACGCG GCAATGCTGG CGGATGTTAA ACCGGGTAAG AAAATCGTCA TTCTCGGCGC GGGCTGCATT GGTTTAATGA CCCTGCAAGC GTGTAAGTGT CTGGGGGCGA CCAATATCGC GGTAGTGGAT GTGCTGGAAA AACGGCTGGC AATGGCTGAA CGACTGGGCG CGACAACCGT TATCAATGGG GCGAAAGAAG ATACTGTCGC GCTCTGCCAG CAGTTCACCG ACGATATGGG CGCCGATATT GTGTTTGAAA CCGCCGGTTC CGCCGTCACA ACTCAGCAAG CGCCGTATCT GGTCATGCGC GGCGGGAAGA TCATGATTGT TGGCACTGTC GCAGGAGATT CAGCGATTAA TTTCCTCAAA ATTAACCGTG AAGTCTCCAT CCAGACGGTA TTCCGCTATG CCAACCGCTA TCCGGTGACT ATTGATGCCA TCTCCTCCGG GCGTTTCGAT GTGAAATCAA TGGTGACGCA TATTTACGAT TACAAAGACG TACAACGTGC ATTTGAAGAG TCGGTGAATA ACAAACGCGA CATTATTAAA GGCGTTATTA AAGTTTGCGA TTAA
|
Protein sequence | MKNSKAILKT PGTMTIIAAD IPVPKENEVL IKVEYVGICG SDVHGFESGP FIPPKDPNQE IGLGHECAGT VVAVGNRVSK FKPGDRVNIE PGVPCGHCRY CLEGKYNICP DVDFMATQPN YRGALTHYLC HPESFTYKLP DNMDTMEGAL VEPAAVGMHA AMLADVKPGK KIVILGAGCI GLMTLQACKC LGATNIAVVD VLEKRLAMAE RLGATTVING AKEDTVALCQ QFTDDMGADI VFETAGSAVT TQQAPYLVMR GGKIMIVGTV AGDSAINFLK INREVSIQTV FRYANRYPVT IDAISSGRFD VKSMVTHIYD YKDVQRAFEE SVNNKRDIIK GVIKVCD
|
| |