Gene EcE24377A_2281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2281 
SymbolpduC 
ID5587739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2248194 
End bp2249858 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content51% 
IMG OID640925947 
Productpropanediol utilization dehydratase, large subunit 
Protein accessionYP_001463342 
Protein GI157157290 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4909] Propanediol dehydratase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATCGA AAAGATTTGA AGCACTGGCA AAACGCCCGG TCAACCAGGA CGGTTTCGTT 
AAAGAGTGGA TTGAAGAAGG CTTTATTGCG ATGGAAAGCC CGAACGATCC TAAACCGTCA
ATCAAAATTG TTAACGGCGT CGTTACCGAA CTGGATGGAA AACCCGCCAG CCAGTTTGAC
CTGATCGACC ACTTTATCGC CCGCTACGGC ATCAATCTCG CGCGCGCCGA AGAAGTCATC
GCGATGGATT CGGTAAAACT TGCCAATATG TTGTGCGATC CAAATGTCAA ACGTAGCGAT
ATCGTTCCCC TTACCACAGC AATGACGCCT GCAAAAATCG TCGAAGTGGT TTCGCAGATG
AACGTGGTAG AGATGATGAT GGCGATGCAG AAAATGCGCG CCCGTCGAAC ACCATCACAA
CAGGCTCATG TCACCAACGT CAAAGATAAC CCAGTACAAA TTGCCGCCGA CGCAGCTGAA
GGCGCATGGC GCGGGTTTGA TGAACAGGAA ACCACCGTTG CCGTTGCTCG CTATGCACCG
TTCAACGCCA TCGCCCTGCT GGTCGGCTCA CAGGTGGGAC GTCCGGGCGT CTTGACCCAG
TGTTCACTGG AAGAAGCCAC CGAACTGAAA CTCGGTATGT TAGGCCACAC CTGTTACGCC
GAAACAATTT CTGTCTATGG TACAGAACCT GTTTTCACCG ATGGTGATGA TACCCCGTGG
TCAAAAGGGT TTCTCGCGTC ATCCTATGCT TCTCGTGGCT TAAAAATGCG CTTTACTTCC
GGCTCCGGCT CTGAAGTACA GATGGGTTAC GCGGAAGGCA AATCAATGCT TTACCTCGAG
GCTCGTTGCA TTTACATCAC CAAAGCCGCA GGCGTTCAGG GTCTGCAGAA TGGTTCGGTA
AGCTGCATCG GTGTTCCATC AGCTGTGCCA TCAGGCATCC GCGCAGTATT AGCGGAAAAC
TTAATTTGCT CATCGCTGGA TCTGGAATGC GCCTCCAGTA ATGACCAGAC CTTTACCCAC
TCGGATATGC GCCGTACCGC ACGTTTCCTG ATGCAATTTC TGCCAGGCAC CGACTTTATC
TCTTCCGGTT TTTCCGCTGT GCCGAACTAC GACAACATGT TTGCTGGTTC AAACGAAGAT
GCTGAAGACT TCGATGATTA CAACGTCATT CAGCGTGACC TGAAGGTTGA TGGTGGGCTG
CGACCAGTAC GCGAAGAAGA TGTTATCGCT ATCCGCAATA AAGCCGCTCG CGCTTTACAG
GCTGTCTTTG CCGGTATGGG ACTACCGCCT ATTACCGATG AAGAAGTCGA AGCGGCAACC
TATGCCCACG GTTCAAAAGA TATGCCTGAG CGCAATATTG TGGAAGACAT CAAGTTCGCC
CAGGAGATTA TCAATAAAAA CCGGAACAGC CTGGAAGTGG TGAAAGCCCT GGCGCAAGGC
GGTTTTACCG ATGTCGCCCA GGACATGCTC AACATGCAAA AAGCAAAACT GACCGGCGAT
TATCTCCATA CCTCAGCCAT CATCGTCGAT GACGGACAAG TGCTCTCTGC GGTCAATGAC
GTCAATGATT ACGCCGGACC GGCTACAGGT TACCGCCTGC AAGGGGAACG CTGGGAAGAG
ATCAAAAATA TTCCCGGTGC ACTTGATCCC AACGAAATTG ACTAA
 
Protein sequence
MRSKRFEALA KRPVNQDGFV KEWIEEGFIA MESPNDPKPS IKIVNGVVTE LDGKPASQFD 
LIDHFIARYG INLARAEEVI AMDSVKLANM LCDPNVKRSD IVPLTTAMTP AKIVEVVSQM
NVVEMMMAMQ KMRARRTPSQ QAHVTNVKDN PVQIAADAAE GAWRGFDEQE TTVAVARYAP
FNAIALLVGS QVGRPGVLTQ CSLEEATELK LGMLGHTCYA ETISVYGTEP VFTDGDDTPW
SKGFLASSYA SRGLKMRFTS GSGSEVQMGY AEGKSMLYLE ARCIYITKAA GVQGLQNGSV
SCIGVPSAVP SGIRAVLAEN LICSSLDLEC ASSNDQTFTH SDMRRTARFL MQFLPGTDFI
SSGFSAVPNY DNMFAGSNED AEDFDDYNVI QRDLKVDGGL RPVREEDVIA IRNKAARALQ
AVFAGMGLPP ITDEEVEAAT YAHGSKDMPE RNIVEDIKFA QEIINKNRNS LEVVKALAQG
GFTDVAQDML NMQKAKLTGD YLHTSAIIVD DGQVLSAVND VNDYAGPATG YRLQGERWEE
IKNIPGALDP NEID