Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1943 |
Symbol | edd |
ID | 5594033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1951770 |
End bp | 1953581 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921088 |
Product | phosphogluconate dehydratase |
Protein accession | YP_001458637 |
Protein GI | 157161319 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR01196] 6-phosphogluconate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCAC AAATGTTACG CGTAACAAAT CGAATCATTG AACGTTCGCG CGAGACTCGC TCTGCTTATC TCGCCCGGAT AGAACAAGCG AAAACGTCGA CCGTTCATCG TTCGCAGTTG GCATGCGGTA ACCTGGCACA CGGTTTCGCT GCCTGCCAGC CAGAAGACAA AGCCTCTTTG AAAAGCATGT TGCGTAACAA TATCGCCATC ATCACCTCCT ATAACGACAT GCTCTCCGCG CACCAGCCTT ATGAACACTA TCCAGAAATC ATTCGTAAAG CCCTGCATGA AGCGAATGCG GTTGGTCAGG TTGCGGGCGG TGTTCCGGCG ATGTGTGATG GTGTCACCCA GGGGCAGGAT GGAATGGAAT TGTCGCTGCT AAGCCGCGAA GTGATAGCGA TGTCTGCGGC GGTCGGGCTG TCCCATAACA TGTTTGATGG TGCTCTGTTC CTCGGGGTGT GCGACAAGAT TGTCCCGGGT CTGACGATGG CAGCCCTGTC GTTTGGTCAT TTGCCTGCGG TGTTTGTACC GTCTGGACCG ATGGCAAGCG GTTTGCCAAA TAAAGAAAAA GTGCGTATTC GCCAGCTTTA TGCCGAAGGT AAAGTGGACC GCATGGCCCT ACTGGAGTCA GAAGCCGCGT CTTATCATGC GCCGGGAACA TGTACTTTCT ACGGTACTGC CAACACCAAC CAGATGGTGG TGGAGTTTAT GGGGATGCAG TTGCCAGGCT CTTCTTTTGT TCATCCGGAC TCTCCGCTGC GCGATGCTTT GACCGCTGCC GCTGCGCGTC AGGTTACACG CATGACCGGT AATGGTAATG AATGGATGCC GATCGGTAAG ATGATCGATG AGAAAGTGGT GGTGAACGGT ATTGTTGCAC TGCTGGCGAC CGGTGGTTCC ACTAACCACA CCATGCACCT GGTGGCGATG GCGCGCGCGG CCGGTATTCA GATTAACTGG GATGACTTCT CTGACCTTTC TGATGTTGTA CCGCTGATGG CACGTCTCTA CCCGAACGGT CCGGCCGATA TTAACCACTT CCAGGCGGCA GGTGGCGTAC CGGTTCTGGT GCGTGAACTG CTCAAAGCAG GTCTGCTGCA TGAAGATGTC AATACGGTGG CAGGCTTTGG TCTGTCTCGT TATACGCTTG AACCATGGCT GAATAATGGT GAACTGGACT GGCGGGAAGG GGCGGAAAAA TCACTCGACG GCAATGTGAT CGCTTCCTTC GAACAACCTT TCTCTCATCA TGGTGGGACA AAAGTGTTAA GCGGTAACCT GGGCCGTGCG GTTATGAAAA CCTCTGCCGT GCCGGTTGAG AACCAGGTGA TTGAAGCGCC AGCGGTTGTT TTTGAAAGCC AGCATGACGT TATGCCGGCC TTTGAAGCGG GTTTGCTGGA CCGCGATTGT GTCGTTGTTG TCCGTCATCA GGGGCCAAAA GCGAACGGAA TGCCAGAATT ACATAAACTC ATGCCGCCAC TTGGTGTATT ATTGGACCGG TGTTTCAAAA TTGCGTTAGT TACCGATGGA CGACTCTCCG GCGCTTCAGG TAAAGTGCCG TCAGCTATCC ACGTAACACC AGAAGCCTAC GATGGCGGGC TGCTGGCAAA AGTGCGCGAC GGGGACATCA TTCGTGTGAA TGGACAGACA GGCGAACTGA CGCTGCTGGT AGACGAAGCG GAACTGGCTG CTCGCGAACC GCACATTCCT GACCTGAGCG CGTCACGCGT GGGAACAGGA CGTGAATTAT TCAGCGCCTT GCGTGAAAAA CTGTCCGGTG CCGAACAGGG CGCAACCTGT ATCACTTTTT AA
|
Protein sequence | MNPQMLRVTN RIIERSRETR SAYLARIEQA KTSTVHRSQL ACGNLAHGFA ACQPEDKASL KSMLRNNIAI ITSYNDMLSA HQPYEHYPEI IRKALHEANA VGQVAGGVPA MCDGVTQGQD GMELSLLSRE VIAMSAAVGL SHNMFDGALF LGVCDKIVPG LTMAALSFGH LPAVFVPSGP MASGLPNKEK VRIRQLYAEG KVDRMALLES EAASYHAPGT CTFYGTANTN QMVVEFMGMQ LPGSSFVHPD SPLRDALTAA AARQVTRMTG NGNEWMPIGK MIDEKVVVNG IVALLATGGS TNHTMHLVAM ARAAGIQINW DDFSDLSDVV PLMARLYPNG PADINHFQAA GGVPVLVREL LKAGLLHEDV NTVAGFGLSR YTLEPWLNNG ELDWREGAEK SLDGNVIASF EQPFSHHGGT KVLSGNLGRA VMKTSAVPVE NQVIEAPAVV FESQHDVMPA FEAGLLDRDC VVVVRHQGPK ANGMPELHKL MPPLGVLLDR CFKIALVTDG RLSGASGKVP SAIHVTPEAY DGGLLAKVRD GDIIRVNGQT GELTLLVDEA ELAAREPHIP DLSASRVGTG RELFSALREK LSGAEQGATC ITF
|
| |