Gene SNSL254_A2217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2217 
SymbolpduC 
ID6484621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2126633 
End bp2128297 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content57% 
IMG OID642737565 
Productpropanediol utilization dehydratase large subunit 
Protein accessionYP_002041307 
Protein GI194443056 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4909] Propanediol dehydratase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCGA AAAGATTTGA AGCACTGGCG AAACGCCCTG TGAATCAGGA CGGCTTTGTT 
AAGGAGTGGA TCGAAGAAGG CTTTATCGCG ATGGAAAGCC CGAACGACCC AAAACCGTCG
ATAAAAATCG TTAACGGCGC GGTAACCGAG CTGGACGGAA AACCGGTTAG CGAATTCGAC
CTGATCGACC ACTTTATCGC CCGCTACGGC ATCAACCTGA ACCGCGCCGA AGAAGTGATG
GCGATGGATT CGGTCAAGCT GGCTAACATG CTGTGCGATC CGAACGTCAA GCGCAGCGAA
ATCGTTCCGC TAACCACCGC GATGACCCCA GCGAAAATTG TCGAAGTGGT TTCGCATATG
AACGTGGTTG AGATGATGAT GGCGATGCAG AAAATGCGCG CCCGCCGTAC TCCATCTCAA
CAGGCGCACG TCACCAACGT TAAAGACAAC CCGGTGCAAA TTGCCGCCGA TGCCGCCGAA
GGCGCATGGC GCGGGTTTGA CGAACAAGAG ACGACGGTTG CGGTAGCGCG CTATGCGCCG
TTCAACGCCA TCGCGCTGCT GGTTGGTTCT CAGGTAGGTC GTCCGGGGGT ACTGACTCAA
TGCTCGCTGG AAGAAGCCAC CGAGCTGAAG CTCGGCATGC TGGGCCACAC CTGCTACGCC
GAAACCATCT CCGTTTACGG CACCGAGCCG GTCTTCACCG ACGGTGACGA TACCCCATGG
TCGAAGGGCT TCTTAGCCTC TTCCTACGCC TCTCGCGGCC TGAAAATGCG CTTCACCTCC
GGCTCCGGCT CCGAAGTGCA GATGGGCTAC GCCGAAGGCA AATCCATGCT GTATCTGGAA
GCGCGCTGCA TCTATATCAC CAAAGCCGCG GGCGTTCAGG GGCTGCAAAA CGGCTCCGTA
AGCTGCATCG GCGTACCGTC TGCCGTGCCG TCAGGCATTC GTGCCGTGCT GGCGGAAAAC
CTGATCTGCT CTTCGCTGGA TCTGGAATGC GCCTCCAGTA ACGACCAGAC CTTCACCCAC
TCCGATATGC GTCGTACCGC GCGCCTGCTG ATGCAGTTCC TGCCGGGTAC CGACTTTATC
TCCTCTGGTT ATTCCGCGGT GCCGAACTAC GACAACATGT TCGCCGGTTC CAACGAAGAT
GCGGAAGACT TTGACGACTA CAACGTTATC CAGCGTGACC TGAAAGTGGA CGGTGGTCTG
CGCCCGGTTC GCGAAGAGGA CGTTATCGCC ATCCGTAACA AAGCCGCCCG CGCGCTGCAG
GCCGTGTTTG CCGGAATGGG ACTGCCGCCG ATTACCGATG AAGAAGTTGA AGCTGCGACC
TATGCCCACG GTTCGAAAGA TATGCCGGAG CGTAACATCG TCGAAGACAT CAAGTTCGCC
CAGGAAATCA TCAATAAAAA CCGCAACGGT CTGGAAGTCG TTAAAGCGCT GGCGCAGGGC
GGGTTTACCG ACGTGGCCCA GGACATGCTC AACATCCAGA AAGCCAAGCT AACCGGCGAC
TATTTGCACA CCTCCGCCAT TATCGTCGGC GACGGACAAG TGCTCTCTGC GGTTAATGAC
GTCAATGACT ATGCCGGTCC GGCAACAGGT TATCGCCTGC AGGGAGAACG CTGGGAAGAG
ATTAAAAACA TCCCTGGCGC TCTTGATCCC AACGAGATTG ATTAA
 
Protein sequence
MRSKRFEALA KRPVNQDGFV KEWIEEGFIA MESPNDPKPS IKIVNGAVTE LDGKPVSEFD 
LIDHFIARYG INLNRAEEVM AMDSVKLANM LCDPNVKRSE IVPLTTAMTP AKIVEVVSHM
NVVEMMMAMQ KMRARRTPSQ QAHVTNVKDN PVQIAADAAE GAWRGFDEQE TTVAVARYAP
FNAIALLVGS QVGRPGVLTQ CSLEEATELK LGMLGHTCYA ETISVYGTEP VFTDGDDTPW
SKGFLASSYA SRGLKMRFTS GSGSEVQMGY AEGKSMLYLE ARCIYITKAA GVQGLQNGSV
SCIGVPSAVP SGIRAVLAEN LICSSLDLEC ASSNDQTFTH SDMRRTARLL MQFLPGTDFI
SSGYSAVPNY DNMFAGSNED AEDFDDYNVI QRDLKVDGGL RPVREEDVIA IRNKAARALQ
AVFAGMGLPP ITDEEVEAAT YAHGSKDMPE RNIVEDIKFA QEIINKNRNG LEVVKALAQG
GFTDVAQDML NIQKAKLTGD YLHTSAIIVG DGQVLSAVND VNDYAGPATG YRLQGERWEE
IKNIPGALDP NEID