Gene SeD_A2388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2388 
Symbol 
ID6875521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2258050 
End bp2259162 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content61% 
IMG OID642785479 
Productpropanediol utilization: propanol dehydrogenase 
Protein accessionYP_002216137 
Protein GI198245287 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACCT TCTCACTACA AACGCGGTTG TACAGCGGTC AGGGCAGCCT GGCGGTGCTC 
AAGCGCTTTA CCAATAAGCA CATCTGGATA ATCTGCGATG GCTTTCTGGC TCGCTCGCCG
CTGCTGGATA CCCTGCGTAA CGCGCTGCCC GCAGATAACC GCATCAGCGT CTTTAGCGAG
ATAACGCCGG ACCCCACCAT CCACACAGTG GTTCAGGGCA TTGCGCAAAT GCAGGCTCTG
CAACCGCAGG TGGTGATCGG TTTTGGCGGC GGCTCGGCAA TGGACGCGGC GAAAGCGATT
GTCTGGTTTA GCCAGCAAAG CGGCATCAAT ATCGAAACCT GCGTGGCGAT CCCGACCACC
AGCGGCACCG GTTCGGAAGT CACCAGCGCC TGCGTAATTA GCGACCCGGA TAAAGGCATT
AAGTATCCGC TGTTCAACAA TGCGCTGTAT CCGGATATGG CGATCCTTGA CCCGGAGCTG
GTGGTCAGCG TTCCGCCGCA GATTACCGCC AACACCGGTA TGGACGTGCT GACCCACGCC
CTGGAGGCCT GGGTGTCACC GCACGCCAGC GACTTTACCG ACGCGCTGGC GGAAAAAGCC
GCCAAACTGG TGCTCCAGTA TCTGCCCACG GCGGTGGAAA AAGGCGACTG CGTGGCGACG
CGCGGGAAAA TGCACAATGC CTCAACGCTC GCCGGGATGG CCTTCAGCCA GGCGGGGCTG
GGGCTTAACC ACGCGATAGC CCACCAGCTC GGCGGACAGT TCCATCTGCC GCACGGGCTG
GCCAATGCGC TGCTGCTCAC GACGGTGATC CGCTTTAACG CGGGTGACCC GCGCGCCGCC
AAACGCTACG CGCGGCTGGC CAAAGCCTGC GGTTTTTGCC CGGCAGAAGC CAATGACGTT
GCGGCGATCA ATGCGCTGAT TCAGCAAATC GAACTGCTTA AGCAACGCTG TGCCCTTCCC
TCACTGGCCG TTGCGCTTAA AGAAGGAAGA TCCGACTTTT CCGCACGTAT TCCGGCGATG
GTGCAGGCCG CGCTGGCGGA TATCACGCTG CGCACCAACC CGCGCCCGGC CAGCGCCGAG
GAAATTCGCG AGCTGCTGGA GGAACTGCTA TGA
 
Protein sequence
MNTFSLQTRL YSGQGSLAVL KRFTNKHIWI ICDGFLARSP LLDTLRNALP ADNRISVFSE 
ITPDPTIHTV VQGIAQMQAL QPQVVIGFGG GSAMDAAKAI VWFSQQSGIN IETCVAIPTT
SGTGSEVTSA CVISDPDKGI KYPLFNNALY PDMAILDPEL VVSVPPQITA NTGMDVLTHA
LEAWVSPHAS DFTDALAEKA AKLVLQYLPT AVEKGDCVAT RGKMHNASTL AGMAFSQAGL
GLNHAIAHQL GGQFHLPHGL ANALLLTTVI RFNAGDPRAA KRYARLAKAC GFCPAEANDV
AAINALIQQI ELLKQRCALP SLAVALKEGR SDFSARIPAM VQAALADITL RTNPRPASAE
EIRELLEELL