Gene EcE24377A_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2081 
Symboledd 
ID5588546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2060390 
End bp2062201 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content54% 
IMG OID640925751 
Productphosphogluconate dehydratase 
Protein accessionYP_001463154 
Protein GI157157338 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAC AATTGTTACG CGTAACAAAT CGAATCATTG AACGTTCGCG CGAGACTCGC 
TCTGCTTATC TCGCCCGGAT AGAACAAGCG AAAACTTCGA CCGTTCATCG TTCGCAGTTG
GCATGCGGTA ACCTGGCACA CGGTTTCGCT GCCTGCCAGC CAGAAGACAA AGCCTCTTTG
AAAAGCATGT TGCGTAACAA TATCGCCATC ATCACCTCCT ATAACGACAT GCTCTCCGCG
CACCAGCCTT ATGAACACTA TCCAGAAATC ATTCGTAAAG CCCTGCATGA AGCGAATGCG
GTTGGTCAGG TTGCGGGCGG TGTTCCGGCG ATGTGTGATG GTGTCACCCA GGGGCAGGAT
GGAATGGAAT TGTCGCTGCT AAGCCGCGAA GTGATAGCGA TGTCTGCGGC GGTCGGGCTG
TCCCATAACA TGTTTGATGG TGCTCTGTTC CTCGGTGTGT GCGACAAGAT TGTCCCGGGT
CTGACGATGG CAGCCCTGTC GTTTGGTCAT TTGCCTGCGG TGTTTGTGCC GTCTGGACCG
ATGGCAAGCG GTTTGCCAAA TAAAGAAAAA GTGCGTATTC GCCAGCTTTA TGCCGAAGGT
AAAGTGGACC GCATGGCCTT ACTGGAGTCA GAAGCCGCGT CTTACCATGC GCCGGGAACA
TGTACTTTCT ACGGTACTGC CAACACCAAC CAGATGGTGG TGGAATTTAT GGGGATGCAG
TTGCCAGGCT CTTCTTTTGT TCATCCGGAT TCTCCGCTGC GCGATGCTTT GACCGCTGCC
GCTGCGCGTC AGGTTACACG CATGACCGGT AATGGTAATG AATGGATGCC GATCGGTAAG
ATGATCGATG AGAAAGTGGT GGTGAACGGT ATCGTTGCAC TGCTGGCGAC CGGTGGTTCC
ACTAACCACA CCATGCACCT GGTGGCGATG GCGCGCGCGG CCGGTATTCA GATTAACTGG
GATGACTTCT CTGACCTTTC TGATGTTGTA CCGCTGATGG CACGTCTCTA CCCGAACGGT
CCGGCCGATA TTAACCACTT CCAGGCGGCA GGTGGCGTAC CGGTTCTGGT GCGTGAACTG
CTCAAAGCAG GCCTGCTGCA TGAAGATGTC AATACGGTGG CAGGCTTTGG TCTGTCTCGT
TATACCCTTG AACCATGGCT GAATAATGGT GAACTGGACT GGCGGGAAGG GGCGGAAAAA
TCACTCGACA GCAATGTGAT CGCTTCCTTC GAACAACCTT TCTCTCATCA TGGTGGGACA
AAAGTGTTAA GCGGTAACCT GGGCCGTGCG GTTATGAAAA CCTCTGCCGT GCCGGTTGAG
AACCAGGTGA TTGAAGCGCC AGCGGTTGTT TTTGAAAGCC AGCATGACGT TATGCCGGCC
TTTGAAGCGG GTTTGCTGGA CCGCGATTGT GTCGTTGTTG TCCGTCATCA GGGGCCAAAA
GCGAACGGAA TGCCAGAATT ACATAAACTC ATGCCGCCAC TTGGTGTATT ATTGGACCGG
TGTTTCAAAA TTGCGTTAGT TACCGATGGA CGACTCTCCG GCGCTTCAGG TAAAGTGCCG
TCAGCTATCC ACGTAACACC AGAAGCCTAC GATGGCGGGC TGCTGGCAAA AGTGCGCGAC
GGGGACATCA TTCGTGTGAA TGGACAGACA GGCGAACTGA CGCTGCTGGT AGACGAAGCG
GAACTGGCTG CTCGCGAACC GCACATTCCT GACCTGAGCG CGTCACGCGT GGGGACAGGA
CGTGAATTAT TCAGCGCCTT GCGTGAAAAA CTGTCCGGTG CCGAACAGGG CGCAACCTGT
ATCACTTTTT AA
 
Protein sequence
MNPQLLRVTN RIIERSRETR SAYLARIEQA KTSTVHRSQL ACGNLAHGFA ACQPEDKASL 
KSMLRNNIAI ITSYNDMLSA HQPYEHYPEI IRKALHEANA VGQVAGGVPA MCDGVTQGQD
GMELSLLSRE VIAMSAAVGL SHNMFDGALF LGVCDKIVPG LTMAALSFGH LPAVFVPSGP
MASGLPNKEK VRIRQLYAEG KVDRMALLES EAASYHAPGT CTFYGTANTN QMVVEFMGMQ
LPGSSFVHPD SPLRDALTAA AARQVTRMTG NGNEWMPIGK MIDEKVVVNG IVALLATGGS
TNHTMHLVAM ARAAGIQINW DDFSDLSDVV PLMARLYPNG PADINHFQAA GGVPVLVREL
LKAGLLHEDV NTVAGFGLSR YTLEPWLNNG ELDWREGAEK SLDSNVIASF EQPFSHHGGT
KVLSGNLGRA VMKTSAVPVE NQVIEAPAVV FESQHDVMPA FEAGLLDRDC VVVVRHQGPK
ANGMPELHKL MPPLGVLLDR CFKIALVTDG RLSGASGKVP SAIHVTPEAY DGGLLAKVRD
GDIIRVNGQT GELTLLVDEA ELAAREPHIP DLSASRVGTG RELFSALREK LSGAEQGATC
ITF