Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1336 |
Symbol | edd |
ID | 6142790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1325501 |
End bp | 1327312 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616214 |
Product | phosphogluconate dehydratase |
Protein accession | YP_001743394 |
Protein GI | 170684043 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR01196] 6-phosphogluconate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCAC AATTGTTACG CGTAACAAAT CGAATCATTG AACGTTCGCG CGAGACTCGC TCTGCTTATC TCGCCCGGAT AGAACAAGCG AAAACTTCGA CCGTTCATCG TTCGCAGTTG GCATGCGGTA ACCTGGCACA CGGTTTCGCT GCCTGCCAGC CAGAAGACAA AGCCTCTTTG AAAAGCATGT TGCGTAACAA TATCGCCATC ATCACCTCCT ATAACGACAT GCTCTCCGCG CACCAGCCTT ATGAACACTA TCCAGAAATC ATTCGTAAAG CCCTGCATGA AGCAAATGCG GTTGGTCAGG TTGCGGGCGG TGTTCCGGCG ATGTGTGATG GTGTCACCCA GGGGCAGGAT GGAATGGAAT TGTCGCTGTT AAGCCGCGAA GTGATTGCGA TGTCTGCGGC GGTGGGGCTG TCCCATAACA TGTTTGATGG TGCGCTGTTC CTCGGTGTGT GCGACAAGAT TGTTCCGGGT CTGACTATGG CAGCCCTGTC GTTTGGGCAT TTGCCCGCGG TGTTTGTGCC GTCTGGACCG ATGGCAAGCG GTTTGCCAAA TAAAGAAAAA GTGCGTATTC GTCAGCTTTA TGCCGAAGGT AAAGTGGACC GCATGGCACT ACTGGAGTCA GAAGCCGCAT CTTATCATGC GCCGGGAACA TGTACTTTCT ACGGTACTGC CAACACCAAC CAGATGGTGG TGGAGTTTAT GGGGATGCAG TTGCCAGGCT CTTCATTTGT TCATCCGGAT TCTCCGCTGC GCGATGCTTT GACCGCAGCC GCTGCGCGCC AGGTTACACG CATGACCGGT AATGGTAATG AATGGATGCC GATCGGTAAG ATGATAGATG AGAAAGTGGT GGTGAACGGT ATCGTTGCAC TGCTGGCGAC CGGTGGTTCC ACTAACCACA CCATGCACCT GGTGGCGATG GCACGCGCGG CCGGTATTCA GATTAACTGG GATGACTTCT CTGACCTTTC TGATGTTGTA CCGCTGATGG CACGTCTCTA CCCGAACGGT CCGGCTGATA TTAACCACTT CCAGGCGGCA GGTGGCGTAC CGGTTCTGGT GCGTGAACTG CTCAAAGCAG GCCTGCTGCA TGAAGATGTC AATACGGTGG CAGGCTTTGG TCTGTCTCGT TATACCCTTG AACCATGGCT GAATAATGGT GAACTGGACT GGCGGGAAGG GGCGGAAAAA TCACTCGACA GCAATGTGAT CGCTTCCTTC GAACAACCTT TCTCTCATCA TGGTGGGACA AAAGTGTTAA GCGGTAACCT GGGCCGTGCG GTTATGAAAA CCTCTGCCGT GCCGGTAGAG AACCAGGTGA TTGAAGCGCC AGCGGTTGTT TTTGAAAGCC AGCATGACGT TATGCCGGCC TTTGAAGCGG GTTTGCTGGA CCGCGATTGT GTCGTTGTTG TCCGTCATCA GGGGCCAAAA GCGAACGGAA TGCCAGAATT ACATAAACTC ATGCCGCCAC TTGGTGTATT ATTGGACCGG TGTTTCAAAA TTGCGTTAGT TACCGATGGA CGACTCTCCG GCGCTTCAGG TAAAGTGCCG TCAGCTATCC ACGTAACACC AGAGGCCTAC GATGGCGGGC TGCTGGCAAA AGTGCGCGAC GGGGACATCA TTCGTGTGAA TGGACAGACA GGCGAACTGA CGCTGCTGGT AGACGAAGCG GAACTGGCTG CTCGCGAACC GCACATTCCT GACCTGAGCG CGTCACGCGT GGGAACAGGA CGTGAATTAT TCAGCGCCTT GCGTGAAAAA CTGTCCGGTG CCGAACAGGG CGCAACCTGT ATCACTTTTT AA
|
Protein sequence | MNPQLLRVTN RIIERSRETR SAYLARIEQA KTSTVHRSQL ACGNLAHGFA ACQPEDKASL KSMLRNNIAI ITSYNDMLSA HQPYEHYPEI IRKALHEANA VGQVAGGVPA MCDGVTQGQD GMELSLLSRE VIAMSAAVGL SHNMFDGALF LGVCDKIVPG LTMAALSFGH LPAVFVPSGP MASGLPNKEK VRIRQLYAEG KVDRMALLES EAASYHAPGT CTFYGTANTN QMVVEFMGMQ LPGSSFVHPD SPLRDALTAA AARQVTRMTG NGNEWMPIGK MIDEKVVVNG IVALLATGGS TNHTMHLVAM ARAAGIQINW DDFSDLSDVV PLMARLYPNG PADINHFQAA GGVPVLVREL LKAGLLHEDV NTVAGFGLSR YTLEPWLNNG ELDWREGAEK SLDSNVIASF EQPFSHHGGT KVLSGNLGRA VMKTSAVPVE NQVIEAPAVV FESQHDVMPA FEAGLLDRDC VVVVRHQGPK ANGMPELHKL MPPLGVLLDR CFKIALVTDG RLSGASGKVP SAIHVTPEAY DGGLLAKVRD GDIIRVNGQT GELTLLVDEA ELAAREPHIP DLSASRVGTG RELFSALREK LSGAEQGATC ITF
|
| |