Gene SNSL254_A2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2229 
Symbol 
ID6486397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2136308 
End bp2137420 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content61% 
IMG OID642737577 
Productpropanediol utilization: propanol dehydrogenase 
Protein accessionYP_002041319 
Protein GI194446766 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.176556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACCT TCTCACTACA AACGCGGTTG TACAGCGGTC AGGGCAGCCT GGCGGTGCTC 
AAGCGCTTTA CCAATAAGCA CATCTGGATA ATCTGCGATG GCTTTCTGGC GCGCTCTCCG
CTGCTGGATA CCCTGCGTAA CGCGCTGCCC GCAGATAACC GCATCAGCGT CTTTAGCGAG
ATAACGCCGG ACCCCACCAT CCACACAGTG GTTCAGGGCA TTGCGCAAAT GCAGGCTCTG
CAACCGCAGG TGGTGATTGG TTTTGGCGGC GGCTCGGCAA TGGACGCGGC GAAAGCGATT
GTCTGGTTTA GCCAGCAGAG CGGGATCAAC ATCGAAACCT GCGTGGCGAT CCCGACCACC
AGCGGCACCG GCTCGGAAGT GACCAGCGCC TGCGTAATTA GCGACCCGGA TAAAGGCATC
AAGTATCCGC TGTTCAACAA TGCGCTGTAT CCGGATATGG CGATCCTCGA CCCGGAGCTG
GTGGTCAGCG TACCGCCGCA GATTACCGCC AACACCGGCA TGGACGTGCT GACCCACGCC
CTGGAGGCCT GGGTGTCACC GCGCGCCAGC GACTTTACCG ACGCGCTAGC GGAAAAGGCC
GCCAAACTGG TGTTCCAGTA TCTGCCCACG GCGGTGGAAA AAGGCGACTG CGTGGCGACG
CGCGGGAAAA TGCACAACGC TTCAACGCTC GCCGGGATGG CCTTCAGCCA GGCGGGGCTG
GGGCTTAACC ACGCGATAGC CCACCAGCTT GGCGGACAGT TTCATCTGCC GCACGGGCTG
GCCAATGCGC TGCTGCTCAC GACGGTGATC CGCTTTAACG CGGGTGACCC GCGCGCCGCC
AAACGCTATG CGCGGCTGGC CAAAGCCTGC GGTTTTTGCC CGGCAGAAGC CAATGACGTT
GCGGCAATCA ATGCGCTGAT TCAGCAAATC GAACTGCTTA AGCAACGCTG CGCCCTTCCC
TCACTGGCCG TTGCGCTTAA AGAAGGAAGA TCCGACTTTT CCGCACGTAT TCCGGCGATG
GTGCAGGCCG CGCTGGCGGA TATCACGCTG CGCACCAACC CGCGCCCGGC CAGCGCCGAG
GAAATTCGCG AGCTGCTGGA GGAACTGCTA TGA
 
Protein sequence
MNTFSLQTRL YSGQGSLAVL KRFTNKHIWI ICDGFLARSP LLDTLRNALP ADNRISVFSE 
ITPDPTIHTV VQGIAQMQAL QPQVVIGFGG GSAMDAAKAI VWFSQQSGIN IETCVAIPTT
SGTGSEVTSA CVISDPDKGI KYPLFNNALY PDMAILDPEL VVSVPPQITA NTGMDVLTHA
LEAWVSPRAS DFTDALAEKA AKLVFQYLPT AVEKGDCVAT RGKMHNASTL AGMAFSQAGL
GLNHAIAHQL GGQFHLPHGL ANALLLTTVI RFNAGDPRAA KRYARLAKAC GFCPAEANDV
AAINALIQQI ELLKQRCALP SLAVALKEGR SDFSARIPAM VQAALADITL RTNPRPASAE
EIRELLEELL