Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2281 |
Symbol | pduC |
ID | 5587739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2248194 |
End bp | 2249858 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640925947 |
Product | propanediol utilization dehydratase, large subunit |
Protein accession | YP_001463342 |
Protein GI | 157157290 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4909] Propanediol dehydratase, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATCGA AAAGATTTGA AGCACTGGCA AAACGCCCGG TCAACCAGGA CGGTTTCGTT AAAGAGTGGA TTGAAGAAGG CTTTATTGCG ATGGAAAGCC CGAACGATCC TAAACCGTCA ATCAAAATTG TTAACGGCGT CGTTACCGAA CTGGATGGAA AACCCGCCAG CCAGTTTGAC CTGATCGACC ACTTTATCGC CCGCTACGGC ATCAATCTCG CGCGCGCCGA AGAAGTCATC GCGATGGATT CGGTAAAACT TGCCAATATG TTGTGCGATC CAAATGTCAA ACGTAGCGAT ATCGTTCCCC TTACCACAGC AATGACGCCT GCAAAAATCG TCGAAGTGGT TTCGCAGATG AACGTGGTAG AGATGATGAT GGCGATGCAG AAAATGCGCG CCCGTCGAAC ACCATCACAA CAGGCTCATG TCACCAACGT CAAAGATAAC CCAGTACAAA TTGCCGCCGA CGCAGCTGAA GGCGCATGGC GCGGGTTTGA TGAACAGGAA ACCACCGTTG CCGTTGCTCG CTATGCACCG TTCAACGCCA TCGCCCTGCT GGTCGGCTCA CAGGTGGGAC GTCCGGGCGT CTTGACCCAG TGTTCACTGG AAGAAGCCAC CGAACTGAAA CTCGGTATGT TAGGCCACAC CTGTTACGCC GAAACAATTT CTGTCTATGG TACAGAACCT GTTTTCACCG ATGGTGATGA TACCCCGTGG TCAAAAGGGT TTCTCGCGTC ATCCTATGCT TCTCGTGGCT TAAAAATGCG CTTTACTTCC GGCTCCGGCT CTGAAGTACA GATGGGTTAC GCGGAAGGCA AATCAATGCT TTACCTCGAG GCTCGTTGCA TTTACATCAC CAAAGCCGCA GGCGTTCAGG GTCTGCAGAA TGGTTCGGTA AGCTGCATCG GTGTTCCATC AGCTGTGCCA TCAGGCATCC GCGCAGTATT AGCGGAAAAC TTAATTTGCT CATCGCTGGA TCTGGAATGC GCCTCCAGTA ATGACCAGAC CTTTACCCAC TCGGATATGC GCCGTACCGC ACGTTTCCTG ATGCAATTTC TGCCAGGCAC CGACTTTATC TCTTCCGGTT TTTCCGCTGT GCCGAACTAC GACAACATGT TTGCTGGTTC AAACGAAGAT GCTGAAGACT TCGATGATTA CAACGTCATT CAGCGTGACC TGAAGGTTGA TGGTGGGCTG CGACCAGTAC GCGAAGAAGA TGTTATCGCT ATCCGCAATA AAGCCGCTCG CGCTTTACAG GCTGTCTTTG CCGGTATGGG ACTACCGCCT ATTACCGATG AAGAAGTCGA AGCGGCAACC TATGCCCACG GTTCAAAAGA TATGCCTGAG CGCAATATTG TGGAAGACAT CAAGTTCGCC CAGGAGATTA TCAATAAAAA CCGGAACAGC CTGGAAGTGG TGAAAGCCCT GGCGCAAGGC GGTTTTACCG ATGTCGCCCA GGACATGCTC AACATGCAAA AAGCAAAACT GACCGGCGAT TATCTCCATA CCTCAGCCAT CATCGTCGAT GACGGACAAG TGCTCTCTGC GGTCAATGAC GTCAATGATT ACGCCGGACC GGCTACAGGT TACCGCCTGC AAGGGGAACG CTGGGAAGAG ATCAAAAATA TTCCCGGTGC ACTTGATCCC AACGAAATTG ACTAA
|
Protein sequence | MRSKRFEALA KRPVNQDGFV KEWIEEGFIA MESPNDPKPS IKIVNGVVTE LDGKPASQFD LIDHFIARYG INLARAEEVI AMDSVKLANM LCDPNVKRSD IVPLTTAMTP AKIVEVVSQM NVVEMMMAMQ KMRARRTPSQ QAHVTNVKDN PVQIAADAAE GAWRGFDEQE TTVAVARYAP FNAIALLVGS QVGRPGVLTQ CSLEEATELK LGMLGHTCYA ETISVYGTEP VFTDGDDTPW SKGFLASSYA SRGLKMRFTS GSGSEVQMGY AEGKSMLYLE ARCIYITKAA GVQGLQNGSV SCIGVPSAVP SGIRAVLAEN LICSSLDLEC ASSNDQTFTH SDMRRTARFL MQFLPGTDFI SSGFSAVPNY DNMFAGSNED AEDFDDYNVI QRDLKVDGGL RPVREEDVIA IRNKAARALQ AVFAGMGLPP ITDEEVEAAT YAHGSKDMPE RNIVEDIKFA QEIINKNRNS LEVVKALAQG GFTDVAQDML NMQKAKLTGD YLHTSAIIVD DGQVLSAVND VNDYAGPATG YRLQGERWEE IKNIPGALDP NEID
|
| |