Gene ECH74115_5205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5205 
SymbolilvD 
ID6967710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4850020 
End bp4851870 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content56% 
IMG OID643388870 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002273290 
Protein GI209397588 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAGT ACCGTTCCGC CACCACCACT CATGGTCGTA ATATGGCGGG TGCTCGTGCG 
CTGTGGCGCG CCACCGGAAT GACCGACGCC GATTTCGGTA AGCCGATTAT CGCGGTTGTG
AACTCGTTCA CCCAATTTGT ACCGGGTCAC GTCCATCTGC GCGATCTCGG TAAACTGGTC
GCCGAACAAA TTGAAGCGGC TGGCGGCGTT GCCAAAGAGT TCAACACCAT TGCGGTGGAT
GATGGGATTG CCATGGGCCA CGGGGGGATG CTTTATTCAC TGCCATCTCG CGAACTGATC
GCTGATTCCG TTGAGTATAT GGTCAACGCC CACTGCGCCG ACGCCATGGT CTGCATCTCT
AACTGTGACA AAATCACCCC GGGGATGCTG ATGGCTTCCC TGCGTCTGAA TATTCCGGTG
ATCTTTGTTT CCGGCGGCCC GATGGAGGCC GGGAAAACCA AACTGTCCGA TCAGATCATC
AAGCTCGATC TGGTTGATGC GATGATCCAG GGCGCAGACC CGAAAGTATC TGACTCCCAG
AGCGATCAGG TTGAACGTTC CGCATGCCCA ACCTGCGGTT CCTGCTCCGG GATGTTTACC
GCTAACTCAA TGAACTGCCT GACCGAAGCG CTGGGTCTGT CGCAGCCGGG CAACGGCTCG
CTGCTGGCAA CCCACTCTGA CCGTAAGCAG CTGTTCCTCA ATGCTGGTAA ACGCATTGTT
GAATTGACCA AACGTTACTA CGAGCAAAAC GACGAAAGTG CACTGCCCCG CAATATTGCT
AGCAAAGCGG CGTTTGAAAA TGCCATGACG CTGGATATCG CGATGGGTGG ATCGACTAAC
ACCGTACTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT
ATCGATAAGC TTTCCCGCAA GGTTCCGCAG TTATGCAAAG TCGCGCCGAG CACCCAAAAA
TACCATATGG AAGATGTTCA CCGTGCTGGT GGTGTTATCG GTATTCTCGG CGAGCTGGAT
CGTGCGGGGT TACTGAACCG TGATGTGAAA AACGTACTTG GCCTGACGTT GCCGCAAACG
CTGGAACAAT ACGACGTTAT GCTGACCCAG GATGACGCGG TAAAAAATAT GTTCCGCGCC
GGCCCTGCGG GTATCCGTAC CACCCAGGCA TTCTCGCAAG ATTGCCGTTG GGATACGCTG
GACGACGATC GCGCCAATGG CTGTATCCGC TCGCTGGAAC ACGCCTATAG CAAAGACGGC
GGCCTGGCGG TGCTCTACGG TAATTTCGCG GAAAACGGCT GCATCGTTAA AACTGCGGGC
GTCGATGACA GCATCCTCAA ATTCACCGGC CCGGCGAAAG TGTACGAAAG CCAGGACGAC
GCGGTAGAAG CGATTCTCGG CGGTAAAGTT GTCGCCGGAG ATGTGGTAGT AATTCGCTAT
GAAGGCCCAA AAGGTGGTCC GGGGATGCAG GAAATGCTCT ACCCAACCAG CTTCCTGAAA
TCAATGGGGC TCGGTAAAGC CTGTGCGCTG ATCACCGACG GTCGTTTCTC TGGCGGCACC
TCTGGCCTTT CTATCGGCCA CGTCTCACCG GAAGCGGCAA GCGGCGGCAG CATTGGCCTG
ATTGAAGACG GCGACCTGAT CGCCATCGAC ATTCCGAACC GTGGCATTCA GTTACAGGTA
AGCGATGCCG AACTGGCGGC GCGTCGTGAA GCGCAGGAAG CTCGGGGTGA CAAAGCCTGG
ACGCCGAAAA ATCGTGAACG TCAGATCTCC TTTGCCCTGC GTGCTTATGC CAGCCTGGCA
ACCAGCGCCG ACAAAGGCGC GGTGCGCGAT AAATCGAAAC TGGGGGGTTA A
 
Protein sequence
MPKYRSATTT HGRNMAGARA LWRATGMTDA DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV 
AEQIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDQII KLDLVDAMIQ GADPKVSDSQ
SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHSDRKQ LFLNAGKRIV
ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD
IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVK NVLGLTLPQT
LEQYDVMLTQ DDAVKNMFRA GPAGIRTTQA FSQDCRWDTL DDDRANGCIR SLEHAYSKDG
GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY
EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGSIGL
IEDGDLIAID IPNRGIQLQV SDAELAARRE AQEARGDKAW TPKNRERQIS FALRAYASLA
TSADKGAVRD KSKLGG