Gene EcolC_4231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4231 
Symbol 
ID6067855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4674656 
End bp4676506 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content56% 
IMG OID641603662 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001727154 
Protein GI170022200 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.604372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAGT ATCGTTCCGC TACCACCACC CATGGCCGTA ATATGGCGGG GGCCCGCGCA 
CTGTGGCGCG CCACCGGGAT GACCGACGCC GATTTCGGTA AGCCGATTAT CGCGGTTGTG
AACTCGTTCA CCCAATTTGT ACCGGGTCAC GTCCATCTGC GCGATCTCGG TAAACTGGTC
GCCGAACAAA TTGAAGCGGC TGGCGGCGTT GCCAAAGAGT TCAACACCAT TGCGGTGGAT
GATGGGATTG CCATGGGCCA CGGGGGGATG CTTTATTCAC TGCCATCTCG CGAACTGATC
GCTGATTCCG TTGAGTATAT GGTCAACGCC CACTGCGCCG ATGCCATGGT CTGTATCTCC
AACTGCGACA AAATCACCCC TGGGATGTTG ATGGCTTCCC TGCGATTAAA TATTCCGGTG
ATCTTTGTTT CCGGCGGCCC GATGGAAGCC GGGAAAACCA AGCTGTCCGA TCGGATAATC
AAGCTCGATC TGGTTGATGC GATGATCCAG GGCGCAGACC CGAAAGTCTC TGACTCCCAG
AGCGATCAGG TTGAACGTTC CGCCTGCCCA ACCTGCGGTT CCTGCTCCGG GATGTTTACC
GCTAACTCAA TGAACTGCCT GACCGAAGCG CTGGGTTTGT CGCAGCCAGG CAACGGCTCG
CTGCTGGCAA CCCACGCGGA CCGTAAGCAG CTGTTCCTTA ATGCTGGTAA ACGCATTGTT
GAATTGACCA AACGTTATTA CGAGCAAAAC GACGAAAGTG CACTGCCGCG TAATATCGCC
AGTAAGGCGG CGTTTGAAAA CGCCATGACG CTGGATATCG CGATGGGTGG ATCGACTAAC
ACCGTACTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT
ATCGATAAGC TCTCCCGCAA GGTTCCGCAG CTGTGTAAAG TTGCGCCGAG CACCCAGAAA
TACCATATGG AAGATGTTCA TCGTGCTGGT GGTGTTATCG GTATTCTCGG CGAACTGGAT
CGTGCGGGGT TACTGAACCG TGATGTGAAA AACGTACTTG GCCTGACGTT GCCGCAAACG
CTGGAACAAT ACGACGTTAT GCTGACCCAG GATGACGCGG TAAAAAATAT GTTCCGCGCA
GGCCCGGCGG GCATTCGGAC TACACAGGCA TTCTCGCAGG ATTGCCGTTG GGATTCTCTC
GATGACGATC GCGCAAACGG CTGTATCCGC TCGCTGGAAC ACGCCTACAG CAAAGACGGC
GGCCTGGCGG TGCTCTACGG TAATTTCGCA GAAAACGGCT GCATCGTTAA AACCGCGGGC
GTCGATGACA GCATCCTCAA ATTCACCGGC CCGGCGAAAG TGTACGAAAG CCAGGACGAC
GCGGTAGAAG CGATTCTCGG CGGTAAAGTT GTCGCCGGAG ATGTGGTAGT AATTCGCTAT
GAAGGCCCGA AAGGCGGTCC GGGGATGCAG GAAATGCTCT ACCCAACCAG CTTCCTGAAA
TCAATGGGGC TCGGTAAAGC CTGTGCGCTG ATCACCGACG GTCGTTTCTC TGGCGGCACC
TCTGGCCTTT CTATCGGGCA CGTCTCACCG GAAGCGGCAA GCGGCGGCAG CATTGGCCTG
ATTGAAGACG GCGATCTTAT CGCTATCGAC ATTCCGAACC GTGGTATTCA GTTACAGGTA
AGCGATGCCG AACTGGCGGC GCGTCGTGAA GCGCAGGAAG CCCGGGGTGA CAAAGCCTGG
ACGCCGAAAA ACCGTGAACG TCAGGTTTCC TTTGCGCTGC GTGCCTACGC CAGCCTGGCG
ACCAGCGCCG ACAAAGGTGC GGTGCGCGAT AAATCGAAAC TGGGGGGTTA A
 
Protein sequence
MPKYRSATTT HGRNMAGARA LWRATGMTDA DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV 
AEQIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDRII KLDLVDAMIQ GADPKVSDSQ
SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV
ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD
IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVK NVLGLTLPQT
LEQYDVMLTQ DDAVKNMFRA GPAGIRTTQA FSQDCRWDSL DDDRANGCIR SLEHAYSKDG
GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY
EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGSIGL
IEDGDLIAID IPNRGIQLQV SDAELAARRE AQEARGDKAW TPKNRERQVS FALRAYASLA
TSADKGAVRD KSKLGG