Gene SeD_A2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2059 
SymbolgutB 
ID6871395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1991800 
End bp1992843 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content50% 
IMG OID642785173 
Productsorbitol dehydrogenase 
Protein accessionYP_002215839 
Protein GI198244287 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.642486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.0031409 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAATT CAAAAGCGAT ACTAAAAACG CCGGGCACCA TGACAATTAT AGCGGCTGAT 
ATTCCAGTAC CAAAAGAAAA CGAAGTATTG ATCAAAGTGG AATATGTCGG TATTTGCGGT
TCAGATGTTC ACGGTTTTGA ATCCGGGCCA TTCATTCCGC CGAAGGATCC AAATCAGGAA
ATTGGTCTCG GTCATGAGTG TGCTGGTACG GTCGTTGCGG TCGGCAATCG GGTAAGCAAA
TTTAAGCCAG GCGATCGGGT TAATATCGAG CCGGGCGTGC CGTGCGGCCA CTGCCGCTAT
TGTCTGGAAG GAAAATACAA TATTTGTCCG GATGTTGATT TTATGGCGAC GCAGCCGAAT
TATCGCGGGG CCTTAACGCA CTATCTGTGC CATCCGGAAA GTTTTACGTA CAAGCTTCCG
GACAATATGG ACACTATGGA AGGTGCGCTG GTGGAACCTG CTGCTGTTGG AATGCACGCG
GCAATGCTGG CGGATGTTAA ACCGGGTAAG AAAATCGTCA TTCTCGGCGC GGGCTGCATT
GGTTTAATGA CCCTGCAAGC GTGTAAGTGT CTGGGGGCGA CCAATATCGC GGTAGTGGAT
GTGCTGGAAA AACGGCTGGC AATGGCTGAA CGACTGGGCG CGACAACCGT TATCAATGGG
GCGAAAGAAG ATACTGTCGC GCTCTGCCAG CAGTTCACCG ACGATATGGG CGCCGATATT
GTGTTTGAAA CCGCCGGTTC CGCCGTCACA ACTCAGCAAG CGCCGTATCT GGTCATGCGC
GGCGGGAAGA TCATGATTGT TGGCACTGTC GCAGGAGATT CAGCGATTAA TTTCCTCAAA
ATTAACCGTG AAGTCTCCAT CCAGACGGTA TTCCGCTATG CCAACCGCTA TCCGGTGACT
ATTGATGCCA TCTCCTCCGG GCGTTTCGAT GTGAAATCAA TGGTGACGCA TATTTACGAT
TACAAAGACG TACAACGTGC ATTTGAAGAG TCGGTGAATA ACAAACGCGA CATTATTAAA
GGCGTTATTA AAGTTTGCGA TTAA
 
Protein sequence
MKNSKAILKT PGTMTIIAAD IPVPKENEVL IKVEYVGICG SDVHGFESGP FIPPKDPNQE 
IGLGHECAGT VVAVGNRVSK FKPGDRVNIE PGVPCGHCRY CLEGKYNICP DVDFMATQPN
YRGALTHYLC HPESFTYKLP DNMDTMEGAL VEPAAVGMHA AMLADVKPGK KIVILGAGCI
GLMTLQACKC LGATNIAVVD VLEKRLAMAE RLGATTVING AKEDTVALCQ QFTDDMGADI
VFETAGSAVT TQQAPYLVMR GGKIMIVGTV AGDSAINFLK INREVSIQTV FRYANRYPVT
IDAISSGRFD VKSMVTHIYD YKDVQRAFEE SVNNKRDIIK GVIKVCD