Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4231 |
Symbol | |
ID | 6067855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4674656 |
End bp | 4676506 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641603662 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001727154 |
Protein GI | 170022200 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.604372 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTAAGT ATCGTTCCGC TACCACCACC CATGGCCGTA ATATGGCGGG GGCCCGCGCA CTGTGGCGCG CCACCGGGAT GACCGACGCC GATTTCGGTA AGCCGATTAT CGCGGTTGTG AACTCGTTCA CCCAATTTGT ACCGGGTCAC GTCCATCTGC GCGATCTCGG TAAACTGGTC GCCGAACAAA TTGAAGCGGC TGGCGGCGTT GCCAAAGAGT TCAACACCAT TGCGGTGGAT GATGGGATTG CCATGGGCCA CGGGGGGATG CTTTATTCAC TGCCATCTCG CGAACTGATC GCTGATTCCG TTGAGTATAT GGTCAACGCC CACTGCGCCG ATGCCATGGT CTGTATCTCC AACTGCGACA AAATCACCCC TGGGATGTTG ATGGCTTCCC TGCGATTAAA TATTCCGGTG ATCTTTGTTT CCGGCGGCCC GATGGAAGCC GGGAAAACCA AGCTGTCCGA TCGGATAATC AAGCTCGATC TGGTTGATGC GATGATCCAG GGCGCAGACC CGAAAGTCTC TGACTCCCAG AGCGATCAGG TTGAACGTTC CGCCTGCCCA ACCTGCGGTT CCTGCTCCGG GATGTTTACC GCTAACTCAA TGAACTGCCT GACCGAAGCG CTGGGTTTGT CGCAGCCAGG CAACGGCTCG CTGCTGGCAA CCCACGCGGA CCGTAAGCAG CTGTTCCTTA ATGCTGGTAA ACGCATTGTT GAATTGACCA AACGTTATTA CGAGCAAAAC GACGAAAGTG CACTGCCGCG TAATATCGCC AGTAAGGCGG CGTTTGAAAA CGCCATGACG CTGGATATCG CGATGGGTGG ATCGACTAAC ACCGTACTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT ATCGATAAGC TCTCCCGCAA GGTTCCGCAG CTGTGTAAAG TTGCGCCGAG CACCCAGAAA TACCATATGG AAGATGTTCA TCGTGCTGGT GGTGTTATCG GTATTCTCGG CGAACTGGAT CGTGCGGGGT TACTGAACCG TGATGTGAAA AACGTACTTG GCCTGACGTT GCCGCAAACG CTGGAACAAT ACGACGTTAT GCTGACCCAG GATGACGCGG TAAAAAATAT GTTCCGCGCA GGCCCGGCGG GCATTCGGAC TACACAGGCA TTCTCGCAGG ATTGCCGTTG GGATTCTCTC GATGACGATC GCGCAAACGG CTGTATCCGC TCGCTGGAAC ACGCCTACAG CAAAGACGGC GGCCTGGCGG TGCTCTACGG TAATTTCGCA GAAAACGGCT GCATCGTTAA AACCGCGGGC GTCGATGACA GCATCCTCAA ATTCACCGGC CCGGCGAAAG TGTACGAAAG CCAGGACGAC GCGGTAGAAG CGATTCTCGG CGGTAAAGTT GTCGCCGGAG ATGTGGTAGT AATTCGCTAT GAAGGCCCGA AAGGCGGTCC GGGGATGCAG GAAATGCTCT ACCCAACCAG CTTCCTGAAA TCAATGGGGC TCGGTAAAGC CTGTGCGCTG ATCACCGACG GTCGTTTCTC TGGCGGCACC TCTGGCCTTT CTATCGGGCA CGTCTCACCG GAAGCGGCAA GCGGCGGCAG CATTGGCCTG ATTGAAGACG GCGATCTTAT CGCTATCGAC ATTCCGAACC GTGGTATTCA GTTACAGGTA AGCGATGCCG AACTGGCGGC GCGTCGTGAA GCGCAGGAAG CCCGGGGTGA CAAAGCCTGG ACGCCGAAAA ACCGTGAACG TCAGGTTTCC TTTGCGCTGC GTGCCTACGC CAGCCTGGCG ACCAGCGCCG ACAAAGGTGC GGTGCGCGAT AAATCGAAAC TGGGGGGTTA A
|
Protein sequence | MPKYRSATTT HGRNMAGARA LWRATGMTDA DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV AEQIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDRII KLDLVDAMIQ GADPKVSDSQ SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVK NVLGLTLPQT LEQYDVMLTQ DDAVKNMFRA GPAGIRTTQA FSQDCRWDSL DDDRANGCIR SLEHAYSKDG GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGSIGL IEDGDLIAID IPNRGIQLQV SDAELAARRE AQEARGDKAW TPKNRERQVS FALRAYASLA TSADKGAVRD KSKLGG
|
| |