Gene EcSMS35_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1336 
Symboledd 
ID6142790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1325501 
End bp1327312 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content53% 
IMG OID641616214 
Productphosphogluconate dehydratase 
Protein accessionYP_001743394 
Protein GI170684043 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAC AATTGTTACG CGTAACAAAT CGAATCATTG AACGTTCGCG CGAGACTCGC 
TCTGCTTATC TCGCCCGGAT AGAACAAGCG AAAACTTCGA CCGTTCATCG TTCGCAGTTG
GCATGCGGTA ACCTGGCACA CGGTTTCGCT GCCTGCCAGC CAGAAGACAA AGCCTCTTTG
AAAAGCATGT TGCGTAACAA TATCGCCATC ATCACCTCCT ATAACGACAT GCTCTCCGCG
CACCAGCCTT ATGAACACTA TCCAGAAATC ATTCGTAAAG CCCTGCATGA AGCAAATGCG
GTTGGTCAGG TTGCGGGCGG TGTTCCGGCG ATGTGTGATG GTGTCACCCA GGGGCAGGAT
GGAATGGAAT TGTCGCTGTT AAGCCGCGAA GTGATTGCGA TGTCTGCGGC GGTGGGGCTG
TCCCATAACA TGTTTGATGG TGCGCTGTTC CTCGGTGTGT GCGACAAGAT TGTTCCGGGT
CTGACTATGG CAGCCCTGTC GTTTGGGCAT TTGCCCGCGG TGTTTGTGCC GTCTGGACCG
ATGGCAAGCG GTTTGCCAAA TAAAGAAAAA GTGCGTATTC GTCAGCTTTA TGCCGAAGGT
AAAGTGGACC GCATGGCACT ACTGGAGTCA GAAGCCGCAT CTTATCATGC GCCGGGAACA
TGTACTTTCT ACGGTACTGC CAACACCAAC CAGATGGTGG TGGAGTTTAT GGGGATGCAG
TTGCCAGGCT CTTCATTTGT TCATCCGGAT TCTCCGCTGC GCGATGCTTT GACCGCAGCC
GCTGCGCGCC AGGTTACACG CATGACCGGT AATGGTAATG AATGGATGCC GATCGGTAAG
ATGATAGATG AGAAAGTGGT GGTGAACGGT ATCGTTGCAC TGCTGGCGAC CGGTGGTTCC
ACTAACCACA CCATGCACCT GGTGGCGATG GCACGCGCGG CCGGTATTCA GATTAACTGG
GATGACTTCT CTGACCTTTC TGATGTTGTA CCGCTGATGG CACGTCTCTA CCCGAACGGT
CCGGCTGATA TTAACCACTT CCAGGCGGCA GGTGGCGTAC CGGTTCTGGT GCGTGAACTG
CTCAAAGCAG GCCTGCTGCA TGAAGATGTC AATACGGTGG CAGGCTTTGG TCTGTCTCGT
TATACCCTTG AACCATGGCT GAATAATGGT GAACTGGACT GGCGGGAAGG GGCGGAAAAA
TCACTCGACA GCAATGTGAT CGCTTCCTTC GAACAACCTT TCTCTCATCA TGGTGGGACA
AAAGTGTTAA GCGGTAACCT GGGCCGTGCG GTTATGAAAA CCTCTGCCGT GCCGGTAGAG
AACCAGGTGA TTGAAGCGCC AGCGGTTGTT TTTGAAAGCC AGCATGACGT TATGCCGGCC
TTTGAAGCGG GTTTGCTGGA CCGCGATTGT GTCGTTGTTG TCCGTCATCA GGGGCCAAAA
GCGAACGGAA TGCCAGAATT ACATAAACTC ATGCCGCCAC TTGGTGTATT ATTGGACCGG
TGTTTCAAAA TTGCGTTAGT TACCGATGGA CGACTCTCCG GCGCTTCAGG TAAAGTGCCG
TCAGCTATCC ACGTAACACC AGAGGCCTAC GATGGCGGGC TGCTGGCAAA AGTGCGCGAC
GGGGACATCA TTCGTGTGAA TGGACAGACA GGCGAACTGA CGCTGCTGGT AGACGAAGCG
GAACTGGCTG CTCGCGAACC GCACATTCCT GACCTGAGCG CGTCACGCGT GGGAACAGGA
CGTGAATTAT TCAGCGCCTT GCGTGAAAAA CTGTCCGGTG CCGAACAGGG CGCAACCTGT
ATCACTTTTT AA
 
Protein sequence
MNPQLLRVTN RIIERSRETR SAYLARIEQA KTSTVHRSQL ACGNLAHGFA ACQPEDKASL 
KSMLRNNIAI ITSYNDMLSA HQPYEHYPEI IRKALHEANA VGQVAGGVPA MCDGVTQGQD
GMELSLLSRE VIAMSAAVGL SHNMFDGALF LGVCDKIVPG LTMAALSFGH LPAVFVPSGP
MASGLPNKEK VRIRQLYAEG KVDRMALLES EAASYHAPGT CTFYGTANTN QMVVEFMGMQ
LPGSSFVHPD SPLRDALTAA AARQVTRMTG NGNEWMPIGK MIDEKVVVNG IVALLATGGS
TNHTMHLVAM ARAAGIQINW DDFSDLSDVV PLMARLYPNG PADINHFQAA GGVPVLVREL
LKAGLLHEDV NTVAGFGLSR YTLEPWLNNG ELDWREGAEK SLDSNVIASF EQPFSHHGGT
KVLSGNLGRA VMKTSAVPVE NQVIEAPAVV FESQHDVMPA FEAGLLDRDC VVVVRHQGPK
ANGMPELHKL MPPLGVLLDR CFKIALVTDG RLSGASGKVP SAIHVTPEAY DGGLLAKVRD
GDIIRVNGQT GELTLLVDEA ELAAREPHIP DLSASRVGTG RELFSALREK LSGAEQGATC
ITF