Gene EcHS_A1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1943 
Symboledd 
ID5594033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1951770 
End bp1953581 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content54% 
IMG OID640921088 
Productphosphogluconate dehydratase 
Protein accessionYP_001458637 
Protein GI157161319 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAC AAATGTTACG CGTAACAAAT CGAATCATTG AACGTTCGCG CGAGACTCGC 
TCTGCTTATC TCGCCCGGAT AGAACAAGCG AAAACGTCGA CCGTTCATCG TTCGCAGTTG
GCATGCGGTA ACCTGGCACA CGGTTTCGCT GCCTGCCAGC CAGAAGACAA AGCCTCTTTG
AAAAGCATGT TGCGTAACAA TATCGCCATC ATCACCTCCT ATAACGACAT GCTCTCCGCG
CACCAGCCTT ATGAACACTA TCCAGAAATC ATTCGTAAAG CCCTGCATGA AGCGAATGCG
GTTGGTCAGG TTGCGGGCGG TGTTCCGGCG ATGTGTGATG GTGTCACCCA GGGGCAGGAT
GGAATGGAAT TGTCGCTGCT AAGCCGCGAA GTGATAGCGA TGTCTGCGGC GGTCGGGCTG
TCCCATAACA TGTTTGATGG TGCTCTGTTC CTCGGGGTGT GCGACAAGAT TGTCCCGGGT
CTGACGATGG CAGCCCTGTC GTTTGGTCAT TTGCCTGCGG TGTTTGTACC GTCTGGACCG
ATGGCAAGCG GTTTGCCAAA TAAAGAAAAA GTGCGTATTC GCCAGCTTTA TGCCGAAGGT
AAAGTGGACC GCATGGCCCT ACTGGAGTCA GAAGCCGCGT CTTATCATGC GCCGGGAACA
TGTACTTTCT ACGGTACTGC CAACACCAAC CAGATGGTGG TGGAGTTTAT GGGGATGCAG
TTGCCAGGCT CTTCTTTTGT TCATCCGGAC TCTCCGCTGC GCGATGCTTT GACCGCTGCC
GCTGCGCGTC AGGTTACACG CATGACCGGT AATGGTAATG AATGGATGCC GATCGGTAAG
ATGATCGATG AGAAAGTGGT GGTGAACGGT ATTGTTGCAC TGCTGGCGAC CGGTGGTTCC
ACTAACCACA CCATGCACCT GGTGGCGATG GCGCGCGCGG CCGGTATTCA GATTAACTGG
GATGACTTCT CTGACCTTTC TGATGTTGTA CCGCTGATGG CACGTCTCTA CCCGAACGGT
CCGGCCGATA TTAACCACTT CCAGGCGGCA GGTGGCGTAC CGGTTCTGGT GCGTGAACTG
CTCAAAGCAG GTCTGCTGCA TGAAGATGTC AATACGGTGG CAGGCTTTGG TCTGTCTCGT
TATACGCTTG AACCATGGCT GAATAATGGT GAACTGGACT GGCGGGAAGG GGCGGAAAAA
TCACTCGACG GCAATGTGAT CGCTTCCTTC GAACAACCTT TCTCTCATCA TGGTGGGACA
AAAGTGTTAA GCGGTAACCT GGGCCGTGCG GTTATGAAAA CCTCTGCCGT GCCGGTTGAG
AACCAGGTGA TTGAAGCGCC AGCGGTTGTT TTTGAAAGCC AGCATGACGT TATGCCGGCC
TTTGAAGCGG GTTTGCTGGA CCGCGATTGT GTCGTTGTTG TCCGTCATCA GGGGCCAAAA
GCGAACGGAA TGCCAGAATT ACATAAACTC ATGCCGCCAC TTGGTGTATT ATTGGACCGG
TGTTTCAAAA TTGCGTTAGT TACCGATGGA CGACTCTCCG GCGCTTCAGG TAAAGTGCCG
TCAGCTATCC ACGTAACACC AGAAGCCTAC GATGGCGGGC TGCTGGCAAA AGTGCGCGAC
GGGGACATCA TTCGTGTGAA TGGACAGACA GGCGAACTGA CGCTGCTGGT AGACGAAGCG
GAACTGGCTG CTCGCGAACC GCACATTCCT GACCTGAGCG CGTCACGCGT GGGAACAGGA
CGTGAATTAT TCAGCGCCTT GCGTGAAAAA CTGTCCGGTG CCGAACAGGG CGCAACCTGT
ATCACTTTTT AA
 
Protein sequence
MNPQMLRVTN RIIERSRETR SAYLARIEQA KTSTVHRSQL ACGNLAHGFA ACQPEDKASL 
KSMLRNNIAI ITSYNDMLSA HQPYEHYPEI IRKALHEANA VGQVAGGVPA MCDGVTQGQD
GMELSLLSRE VIAMSAAVGL SHNMFDGALF LGVCDKIVPG LTMAALSFGH LPAVFVPSGP
MASGLPNKEK VRIRQLYAEG KVDRMALLES EAASYHAPGT CTFYGTANTN QMVVEFMGMQ
LPGSSFVHPD SPLRDALTAA AARQVTRMTG NGNEWMPIGK MIDEKVVVNG IVALLATGGS
TNHTMHLVAM ARAAGIQINW DDFSDLSDVV PLMARLYPNG PADINHFQAA GGVPVLVREL
LKAGLLHEDV NTVAGFGLSR YTLEPWLNNG ELDWREGAEK SLDGNVIASF EQPFSHHGGT
KVLSGNLGRA VMKTSAVPVE NQVIEAPAVV FESQHDVMPA FEAGLLDRDC VVVVRHQGPK
ANGMPELHKL MPPLGVLLDR CFKIALVTDG RLSGASGKVP SAIHVTPEAY DGGLLAKVRD
GDIIRVNGQT GELTLLVDEA ELAAREPHIP DLSASRVGTG RELFSALREK LSGAEQGATC
ITF