Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2291 |
Symbol | pduO |
ID | 5585941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2255406 |
End bp | 2256413 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640925956 |
Product | propanediol utilization protein, PduO |
Protein accession | YP_001463351 |
Protein GI | 157158139 |
COG category | [R] General function prediction only [S] Function unknown |
COG ID | [COG2096] Uncharacterized conserved protein [COG3193] Uncharacterized protein, possibly involved in utilization of glycolate and propanediol |
TIGRFAM ID | [TIGR00636] ATP:cob(I)alamin adenosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATTT ATACCCGAAC AGGTGATTCA GGAAGTACGT CTTTATTTAC CGGACAGCGC GTCAGCAAAA CTCATCTACG TGTCGAAACC TATGGCACGC TGGATGAACT GAACGCGACA CTCAGTCTGT GTTATTGCGC CACCGCCATC GAAAGCCATC GCATCTTACT GGAAGCCATT CAACAACAGA TATTTTGGTT TAGCGCCGAA CTCGCCAGTG AAAGTGAACA GCCGTCAGCG CAACAGCGTT ACATCGGTAC GGAAGAAATT GCTGCGCTGG AAAACGCGAT CGATAGCGCA ATGAACGCCG TCCCCCCAGT TCACAGCTTC ATTCTGCCTG GTCGATGTGA AGCCGCGAGC CGCATGCATT TCGCCCGCAC AGTGGCTCGA CGGGCTGAAA GACGTCTGGT TGAACTGACA ACGGAAACCA CCGTCCGGAA TGTTTTGCTG CACTATATCA ACCGACTTTC CGATTGTCTG TATGCCCTCG CCCGCGTGGA AGACAACGTT GCTCATCAAA ACCTGATGAT TCAGGAAATC ACAAAACGTT ATCACGAAGC CAACCACATA CCAGCATTGA AGGAACGCAC AATGCCGCTC ACTTTTCAGG ATCTTCACCA GCTCATTCGT AGTGCGGCAA TGCGCGCGGA TGAACTGCAT ATTCCCGTTG TCATCAGCAT TGTTGATGCC AACGGAACAG AAAGTGTTAC CTGGCGGATG CCTGATGCCC TGTTGGTCAG CAGTGAACTG GCACCGAAAA AAGCCTGGAC CGCCGTGGCG ATGAAAACGG CAACCCACAA ACTTGCTGAT ACCGTGCAAC CCGGCGCGCC GCTTTACGGG CTCGAAAGCC ATATGCAAGG CAAAGTCGTC ACTTTTGGTG GCGGCTTCCC CCTCTGGCGT GACGGAAAAT TGCTTGGCGG GCTTGGCATC AGCGGCGGTA GCGTTGAACA AGACATGGAT ATTGCTCAAA GCGCAATGGC GGCAATTAAC GTGGGAGTAA ACCAATGA
|
Protein sequence | MAIYTRTGDS GSTSLFTGQR VSKTHLRVET YGTLDELNAT LSLCYCATAI ESHRILLEAI QQQIFWFSAE LASESEQPSA QQRYIGTEEI AALENAIDSA MNAVPPVHSF ILPGRCEAAS RMHFARTVAR RAERRLVELT TETTVRNVLL HYINRLSDCL YALARVEDNV AHQNLMIQEI TKRYHEANHI PALKERTMPL TFQDLHQLIR SAAMRADELH IPVVISIVDA NGTESVTWRM PDALLVSSEL APKKAWTAVA MKTATHKLAD TVQPGAPLYG LESHMQGKVV TFGGGFPLWR DGKLLGGLGI SGGSVEQDMD IAQSAMAAIN VGVNQ
|
| |