Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5205 |
Symbol | ilvD |
ID | 6967710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4850020 |
End bp | 4851870 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643388870 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002273290 |
Protein GI | 209397588 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTAAGT ACCGTTCCGC CACCACCACT CATGGTCGTA ATATGGCGGG TGCTCGTGCG CTGTGGCGCG CCACCGGAAT GACCGACGCC GATTTCGGTA AGCCGATTAT CGCGGTTGTG AACTCGTTCA CCCAATTTGT ACCGGGTCAC GTCCATCTGC GCGATCTCGG TAAACTGGTC GCCGAACAAA TTGAAGCGGC TGGCGGCGTT GCCAAAGAGT TCAACACCAT TGCGGTGGAT GATGGGATTG CCATGGGCCA CGGGGGGATG CTTTATTCAC TGCCATCTCG CGAACTGATC GCTGATTCCG TTGAGTATAT GGTCAACGCC CACTGCGCCG ACGCCATGGT CTGCATCTCT AACTGTGACA AAATCACCCC GGGGATGCTG ATGGCTTCCC TGCGTCTGAA TATTCCGGTG ATCTTTGTTT CCGGCGGCCC GATGGAGGCC GGGAAAACCA AACTGTCCGA TCAGATCATC AAGCTCGATC TGGTTGATGC GATGATCCAG GGCGCAGACC CGAAAGTATC TGACTCCCAG AGCGATCAGG TTGAACGTTC CGCATGCCCA ACCTGCGGTT CCTGCTCCGG GATGTTTACC GCTAACTCAA TGAACTGCCT GACCGAAGCG CTGGGTCTGT CGCAGCCGGG CAACGGCTCG CTGCTGGCAA CCCACTCTGA CCGTAAGCAG CTGTTCCTCA ATGCTGGTAA ACGCATTGTT GAATTGACCA AACGTTACTA CGAGCAAAAC GACGAAAGTG CACTGCCCCG CAATATTGCT AGCAAAGCGG CGTTTGAAAA TGCCATGACG CTGGATATCG CGATGGGTGG ATCGACTAAC ACCGTACTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT ATCGATAAGC TTTCCCGCAA GGTTCCGCAG TTATGCAAAG TCGCGCCGAG CACCCAAAAA TACCATATGG AAGATGTTCA CCGTGCTGGT GGTGTTATCG GTATTCTCGG CGAGCTGGAT CGTGCGGGGT TACTGAACCG TGATGTGAAA AACGTACTTG GCCTGACGTT GCCGCAAACG CTGGAACAAT ACGACGTTAT GCTGACCCAG GATGACGCGG TAAAAAATAT GTTCCGCGCC GGCCCTGCGG GTATCCGTAC CACCCAGGCA TTCTCGCAAG ATTGCCGTTG GGATACGCTG GACGACGATC GCGCCAATGG CTGTATCCGC TCGCTGGAAC ACGCCTATAG CAAAGACGGC GGCCTGGCGG TGCTCTACGG TAATTTCGCG GAAAACGGCT GCATCGTTAA AACTGCGGGC GTCGATGACA GCATCCTCAA ATTCACCGGC CCGGCGAAAG TGTACGAAAG CCAGGACGAC GCGGTAGAAG CGATTCTCGG CGGTAAAGTT GTCGCCGGAG ATGTGGTAGT AATTCGCTAT GAAGGCCCAA AAGGTGGTCC GGGGATGCAG GAAATGCTCT ACCCAACCAG CTTCCTGAAA TCAATGGGGC TCGGTAAAGC CTGTGCGCTG ATCACCGACG GTCGTTTCTC TGGCGGCACC TCTGGCCTTT CTATCGGCCA CGTCTCACCG GAAGCGGCAA GCGGCGGCAG CATTGGCCTG ATTGAAGACG GCGACCTGAT CGCCATCGAC ATTCCGAACC GTGGCATTCA GTTACAGGTA AGCGATGCCG AACTGGCGGC GCGTCGTGAA GCGCAGGAAG CTCGGGGTGA CAAAGCCTGG ACGCCGAAAA ATCGTGAACG TCAGATCTCC TTTGCCCTGC GTGCTTATGC CAGCCTGGCA ACCAGCGCCG ACAAAGGCGC GGTGCGCGAT AAATCGAAAC TGGGGGGTTA A
|
Protein sequence | MPKYRSATTT HGRNMAGARA LWRATGMTDA DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV AEQIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDQII KLDLVDAMIQ GADPKVSDSQ SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHSDRKQ LFLNAGKRIV ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVK NVLGLTLPQT LEQYDVMLTQ DDAVKNMFRA GPAGIRTTQA FSQDCRWDTL DDDRANGCIR SLEHAYSKDG GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGSIGL IEDGDLIAID IPNRGIQLQV SDAELAARRE AQEARGDKAW TPKNRERQIS FALRAYASLA TSADKGAVRD KSKLGG
|
| |