Gene EcolC_1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1781 
Symbol 
ID6066403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1981310 
End bp1983121 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content54% 
IMG OID641601196 
Productphosphogluconate dehydratase 
Protein accessionYP_001724758 
Protein GI170019804 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.183249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAC AAATGTTACG CGTAACAAAT CGAATCATTG AACGTTCGCG CGAGACTCGC 
TCTGCTTATC TCGCCCGGAT AGAACAAGCG AAAACGTCGA CCGTTCATCG TTCGCAGTTG
GCATGCGGTA ACCTGGCACA CGGTTTCGCT GCCTGCCAGC CAGAAGACAA AGCCTCTTTG
AAAAGCATGT TGCGTAACAA TATCGCCATC ATCACCTCCT ATAACGACAT GCTCTCCGCG
CACCAGCCTT ATGAACACTA TCCAGAAATC ATTCGTAAAG CCCTGCATGA AGCGAATGCG
GTTGGTCAGG TTGCGGGCGG TGTTCCGGCG ATGTGTGATG GTGTCACCCA GGGGCAGGAT
GGAATGGAAT TGTCGCTGCT AAGCCGCGAA GTGATAGCGA TGTCTGCGGC GGTCGGGCTG
TCCCATAACA TGTTTGATGG TGCTCTGTTC CTCGGGGTGT GCGACAAGAT TGTCCCGGGT
CTGACGATGG CAGCCCTGTC GTTTGGTCAT TTGCCTGCGG TGTTTGTACC GTCTGGACCG
ATGGCAAGCG GTTTGCCAAA TAAAGAAAAA GTGCGTATTC GCCAGCTTTA TGCCGAAGGT
AAAGTGGACC GCATGGCCCT ACTGGAGTCA GAAGCCGCGT CTTATCATGC GCCGGGAACA
TGTACTTTCT ACGGTACTGC CAACACCAAC CAGATGGTGG TGGAGTTTAT GGGGATGCAG
TTGCCAGGCT CTTCTTTTGT TCATCCGGAC TCTCCGCTGC GCGATGCTTT GACCGCTGCC
GCTGCGCGTC AGGTTACACG CATGACCGGT AATGGTAATG AATGGATGCC GATCGGTAAG
ATGATCGATG AGAAAGTGGT GGTGAACGGT ATTGTTGCAC TGCTGGCGAC CGGTGGTTCC
ACTAACCACA CCATGCACCT GGTGGCGATG GCGCGCGCGG CCGGTATTCA GATTAACTGG
GATGACTTCT CTGACCTTTC TGATGTTGTA CCGCTGATGG CACGTCTCTA CCCGAACGGT
CCGTCCGATA TTAACCACTT CCAGGCGGCA GGTGGCGTAC CGGTTCTGGT GCGTGAACTG
CTCAAAGCAG GTCTGCTGCA TGAAGATGTC AATACGGTGG CAGGCTTTGG TCTGTCTCGT
TATACGCTTG AACCATGGCT GAATAATGGT GAACTGGACT GGCGGGAAGG GGCGGAAAAA
TCACTCGACG GCAATGTGAT CGCTTCCTTC GAACAACCTT TCTCTCATCA TGGTGGGACA
AAAGTGTTAA GCGGTAACCT GGGCCGTGCG GTTATGAAAA CCTCTGCCGT GCCGGTTGAG
AACCAGGTGA TTGAAGCGCC AGCGGTTGTT TTTGAAAGCC AGCATGACGT TATGCCGGCC
TTTGAAGCGG GTTTGCTGGA CCGCGATTGT GTCGTTGTTG TCCGTCATCA GGGGCCAAAA
GCGAACGGAA TGCCAGAATT ACATAAACTC ATGCCGCCAC TTGGTGTATT ATTGGACCGG
TGTTTCAAAA TTGCGTTAGT TACCGATGGA CGACTCTCCG GCGCTTCAGG TAAAGTGCCG
TCAGCTATCC ACGTAACACC AGAAGCCTAC GATGGCGGGC TGCTGGCAAA AGTGCGCGAC
GGGGACATCA TTCGTGTGAA TGGACAGACA GGCGAACTGA CGCTGCTGGT AGACGAAGCG
GAACTGGCTG CTCGCGAACC GCACATTCCT GACCTGAGCG CGTCACGCGT GGGAACAGGA
CGTGAATTAT TCAGCGCCTT GCGTGAAAAA CTGTCCGGTG CCGAACAGGG CGCAACCTGT
ATCACTTTTT AA
 
Protein sequence
MNPQMLRVTN RIIERSRETR SAYLARIEQA KTSTVHRSQL ACGNLAHGFA ACQPEDKASL 
KSMLRNNIAI ITSYNDMLSA HQPYEHYPEI IRKALHEANA VGQVAGGVPA MCDGVTQGQD
GMELSLLSRE VIAMSAAVGL SHNMFDGALF LGVCDKIVPG LTMAALSFGH LPAVFVPSGP
MASGLPNKEK VRIRQLYAEG KVDRMALLES EAASYHAPGT CTFYGTANTN QMVVEFMGMQ
LPGSSFVHPD SPLRDALTAA AARQVTRMTG NGNEWMPIGK MIDEKVVVNG IVALLATGGS
TNHTMHLVAM ARAAGIQINW DDFSDLSDVV PLMARLYPNG PSDINHFQAA GGVPVLVREL
LKAGLLHEDV NTVAGFGLSR YTLEPWLNNG ELDWREGAEK SLDGNVIASF EQPFSHHGGT
KVLSGNLGRA VMKTSAVPVE NQVIEAPAVV FESQHDVMPA FEAGLLDRDC VVVVRHQGPK
ANGMPELHKL MPPLGVLLDR CFKIALVTDG RLSGASGKVP SAIHVTPEAY DGGLLAKVRD
GDIIRVNGQT GELTLLVDEA ELAAREPHIP DLSASRVGTG RELFSALREK LSGAEQGATC
ITF