Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4137 |
Symbol | ilvD |
ID | 6144498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4233268 |
End bp | 4235118 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641618960 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001746092 |
Protein GI | 170682714 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.129603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTAAGT ACCGTTCCGC CACCACCACT CATGGTCGTA ATATGGCGGG TGCTCGTGCG CTGTGGCGCG CCACCGGAAT GACCGACGCC GATTTCGGTA AGCCGATTAT CGCGGTTGTG AACTCGTTCA CCCAATTTGT ACCGGGTCAC GTCCATCTGC GCGATCTCGG TAAACTGGTC GCCGAACAAA TTGAAGCGGC TGGCGGCGTT GCCAAAGAGT TCAACACCAT TGCGGTGGAT GATGGGATTG CCATGGGCCA CGGGGGGATG CTTTATTCGC TGCCATCTCG CGAACTGATC GCTGATTCCG TTGAGTATAT GGTCAATGCT CACTGCGCCG ACGCCATGGT CTGCATCTCT AACTGCGACA AAATCACCCC GGGGATGCTG ATGGCTTCCC TGCGCCTGAA TATTCCGGTG ATCTTTGTTT CCGGCGGCCC GATGGAGGCC GGGAAAACCA AACTGTCCGA TCAGATCATC AAGCTCGATC TGGTTGATGC GATGATCCAG GGGGCAGACC CGAAAGTTTC TGACTCCCAA AGCGATCAGG TTGAACGTTC CGCATGCCCA ACCTGCGGTT CCTGCTCCGG GATGTTTACC GCTAACTCAA TGAACTGCCT GACCGAAGCG CTGGGCCTGT CGCAGCCGGG CAACGGCTCG CTGCTGGCAA CTCACGCCGA CCGTAAGCAG CTGTTCCTCA ATGCCGGTAA ACGCATTGTT GAATTGACCA AACGTTACTA CGAGCAAAAT GACGAAAGTG CACTGCCGCG TAATATCGCC AGTAAGGCGG CGTTTGAAAA CGCCATGACG CTGGATATCG CGATGGGTGG ATCGACTAAC ACTGTACTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT ATCGACAAAC TTTCGCGTAA AGTCCCGCAG TTATGCAAAG TTGCGCCGAG CACCCAGAAA TACCATATGG AAGATGTTCA TCGTGCTGGT GGTGTTATCG GTATTCTCGG CGAACTGGAT CGCGCGGGGT TACTGAACCG TGATGTGAAA AACGTGCTTG GCCTGACGTT GCCACAAACG CTGGAACAAT ACGACGTTAT GCTGACCCAG GATGACTCGG TAAAAAATAT GTTCCGCGCT GGCCCTGCGG GCATTCGTAC TACACAGGCA TTCTCGCAGG ATTGCCGTTG GGATTCTCTC GATGACGATC GCGCCAATGG CTGTATCCGC TCGCTGGAAC ACGCCTACAG CAAAGAAGGT GGTCTGGCGG TGCTGTACGG TAATTTCGCA GAAAACGGCT GCATCGTTAA AACCGCGGGC GTCGATGACA GCATCCTCAA ATTCACCGGC CCGGCGAAAG TGTACGAAAG CCAGGATGAC GCGGTAGAAG CGATTCTCGG CGGCAAAGTT GTCGCTGGCG ATGTAGTGGT GATCCGCTAT GAAGGCCCGA AAGGCGGTCC GGGGATGCAG GAAATGCTCT ACCCAACCAG CTTCCTGAAA TCAATGGGTC TCGGTAAAGC CTGCGCGCTG ATCACCGACG GACGCTTCTC CGGCGGCACC TCTGGCCTTT CTATCGGCCA CGTCTCACCG GAAGCGGCAA GCGGCGGCAG CATTGGCCTG ATTGAAGACG GCGACCTGAT CGCCATCGAC ATTCCGAACC GTGGTATTCA GTTACAGGTA AGCGATGCCG AACTGGCGGC GCGTCGTGAA GCGCAGGAAG CTCGTGGCGA CAAAGCCTGG ACGCCGAAAA ACCGTGAACG CCAGGTTTCC TTTGCGCTGC GTGCCTACGC CAGCCTGGCA ACCAGCGCCG ACAAAGGTGC GGTGCGCGAT AAATCGAAAC TGGGGGGTTA A
|
Protein sequence | MPKYRSATTT HGRNMAGARA LWRATGMTDA DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV AEQIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDQII KLDLVDAMIQ GADPKVSDSQ SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVK NVLGLTLPQT LEQYDVMLTQ DDSVKNMFRA GPAGIRTTQA FSQDCRWDSL DDDRANGCIR SLEHAYSKEG GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGSIGL IEDGDLIAID IPNRGIQLQV SDAELAARRE AQEARGDKAW TPKNRERQVS FALRAYASLA TSADKGAVRD KSKLGG
|
| |