Gene EcHS_A3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3988 
SymbolilvD 
ID5591059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3981891 
End bp3983741 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content56% 
IMG OID640923093 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001460564 
Protein GI157163246 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAGT ATCGTTCCGC TACCACCACC CATGGCCGTA ATATGGCGGG GGCCCGCGCA 
CTGTGGCGCG CCACCGGGAT GACCGACGCC GATTTCGGTA AGCCGATTAT CGCGGTTGTG
AACTCGTTCA CCCAATTTGT ACCGGGTCAC GTCCATCTGC GCGATCTCGG TAAACTGGTC
GCCGAACAAA TTGAAGCGGC TGGCGGCGTT GCCAAAGAGT TCAACACCAT TGCGGTGGAT
GATGGGATTG CCATGGGCCA CGGGGGGATG CTTTATTCAC TGCCATCTCG CGAACTGATC
GCTGATTCCG TTGAGTATAT GGTCAACGCC CACTGCGCCG ATGCCATGGT CTGTATCTCC
AACTGCGACA AAATCACCCC TGGGATGTTG ATGGCTTCCC TGCGATTAAA TATTCCGGTG
ATCTTTGTTT CCGGCGGCCC GATGGAAGCC GGGAAAACCA AGCTGTCCGA TCGGATAATC
AAGCTCGATC TGGTTGATGC GATGATCCAG GGCGCAGACC CGAAAGTCTC TGACTCCCAG
AGCGATCAGG TTGAACGTTC CGCCTGCCCA ACCTGCGGTT CCTGCTCCGG GATGTTTACC
GCTAACTCAA TGAACTGCCT GACCGAAGCG CTGGGTTTGT CGCAGCCAGG CAACGGCTCG
CTGCTGGCAA CCCACGCGGA CCGTAAGCAG CTGTTCCTTA ATGCTGGTAA ACGCATTGTT
GAATTGACCA AACGTTATTA CGAGCAAAAC GACGAAAGTG CACTGCCGCG TAATATCGCC
AGTAAGGCGG CGTTTGAAAA CGCCATGACG CTGGATATCG CGATGGGTGG ATCGACTAAC
ACCGTACTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT
ATCGATAAGC TCTCCCGCAA GGTTCCGCAG CTGTGTAAAG TTGCGCCGAG CACCCAGAAA
TACCATATGG AAGATGTTCA TCGTGCTGGT GGTGTTATCG GTATTCTCGG CGAACTGGAT
CGTGCGGGGT TACTGAACCG TGATGTGAAA AACGTACTTG GCCTGACGTT GCCGCAAACG
CTGGAACAAT ACGACGTTAT GCTGACCCAG GATGACGCGG TAAAAAATAT GTTCCGCGCA
GGCCCGGCGG GCATTCGGAC TACACAGGCA TTCTCGCAGG ATTGCCGTTG GGATTCTCTC
GATGACGATC GCGCAAACGG CTGTATCCGC TCGCTGGAAC ACGCCTACAG CAAAGACGGC
GGCCTGGCGG TGCTCTACGG TAATTTCGCA GAAAACGGCT GCATCGTTAA AACCGCGGGC
GTCGATGACA GCATCCTCAA ATTCACCGGC CCGGCGAAAG TGTACGAAAG CCAGGACGAC
GCGGTAGAAG CGATTCTCGG CGGTAAAGTT GTCGCCGGAG ATGTGGTAGT AATTCGCTAT
GAAGGCCCGA AAGGCGGTCC GGGGATGCAG GAAATGCTCT ACCCAACCAG CTTCCTGAAA
TCAATGGGGC TCGGTAAAGC CTGTGCGCTG ATCACCGACG GTCGTTTCTC TGGCGGCACC
TCTGGCCTTT CTATCGGGCA CGTCTCACCG GAAGCGGCAA GCGGCGGCAG CATTGGCCTG
ATTGAAGACG GCGATCTTAT CGCTATCGAC ATTCCGAACC GTGGTATTCA GTTACAGGTA
AGCGATGCCG AACTGGCGGC GCGTCGTGAA GCGCAGGAAG CCCGGGGTGA CAAAGCCTGG
ACGCCGAAAA ACCGTGAACG TCAGGTTTCC TTTGCGCTGC GTGCCTACGC CAGCCTGGCG
ACCAGCGCCG ACAAAGGTGC GGTGCGCGAT AAATCGAAAC TGGGGGGTTA A
 
Protein sequence
MPKYRSATTT HGRNMAGARA LWRATGMTDA DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV 
AEQIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDRII KLDLVDAMIQ GADPKVSDSQ
SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV
ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD
IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVK NVLGLTLPQT
LEQYDVMLTQ DDAVKNMFRA GPAGIRTTQA FSQDCRWDSL DDDRANGCIR SLEHAYSKDG
GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY
EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGSIGL
IEDGDLIAID IPNRGIQLQV SDAELAARRE AQEARGDKAW TPKNRERQVS FALRAYASLA
TSADKGAVRD KSKLGG