Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_4205 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 4559142 |
End bp | 4560992 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | dihydroxy-acid dehydratase |
Protein accession | ACX41805 |
Protein GI | 260451383 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00333285 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAGT ACCGTTCCGC CACCACCACT CATGGTCGTA ATATGGCGGG TGCTCGTGCG CTGTGGCGCG CCACCGGAAT GACCGACGCC GATTTCGGTA AGCCGATTAT CGCGGTTGTG AACTCGTTCA CCCAATTTGT ACCGGGTCAC GTCCATCTGC GCGATCTCGG TAAACTGGTC GCCGAACAAA TTGAAGCGGC TGGCGGCGTT GCCAAAGAGT TCAACACCAT TGCGGTGGAT GATGGGATTG CCATGGGCCA CGGGGGGATG CTTTATTCAC TGCCATCTCG CGAACTGATC GCTGATTCCG TTGAGTATAT GGTCAACGCC CACTGCGCCG ACGCCATGGT CTGCATCTCT AACTGCGACA AAATCACCCC GGGGATGCTG ATGGCTTCCC TGCGCCTGAA TATTCCGGTG ATCTTTGTTT CCGGCGGCCC GATGGAGGCC GGGAAAACCA AACTTTCCGA TCAGATCATC AAGCTCGATC TGGTTGATGC GATGATCCAG GGCGCAGACC CGAAAGTATC TGACTCCCAG AGCGATCAGG TTGAACGTTC CGCGTGTCCG ACCTGCGGTT CCTGCTCCGG GATGTTTACC GCTAACTCAA TGAACTGCCT GACCGAAGCG CTGGGCCTGT CGCAGCCGGG CAACGGCTCG CTGCTGGCAA CCCACGCCGA CCGTAAGCAG CTGTTCCTTA ATGCTGGTAA ACGCATTGTT GAATTGACCA AACGTTATTA CGAGCAAAAC GACGAAAGTG CACTGCCGCG TAATATCGCC AGTAAGGCGG CGTTTGAAAA CGCCATGACG CTGGATATCG CGATGGGTGG ATCGACTAAC ACCGTACTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT ATCGATAAGC TTTCCCGCAA GGTTCCACAG CTGTGTAAAG TTGCGCCGAG CACCCAGAAA TACCATATGG AAGATGTTCA CCGTGCTGGT GGTGTTATCG GTATTCTCGG CGAACTGGAT CGCGCGGGGT TACTGAACCG TGATGTGAAA AACGTACTTG GCCTGACGTT GCCGCAAACG CTGGAACAAT ACGACGTTAT GCTGACCCAG GATGACGCGG TAAAAAATAT GTTCCGCGCA GGTCCTGCAG GCATTCGTAC CACACAGGCA TTCTCGCAAG ATTGCCGTTG GGATACGCTG GACGACGATC GCGCCAATGG CTGTATCCGC TCGCTGGAAC ACGCCTACAG CAAAGACGGC GGCCTGGCGG TGCTCTACGG TAACTTTGCG GAAAACGGCT GCATCGTGAA AACGGCAGGC GTCGATGACA GCATCCTCAA ATTCACCGGC CCGGCGAAAG TGTACGAAAG CCAGGACGAT GCGGTAGAAG CGATTCTCGG CGGTAAAGTT GTCGCCGGAG ATGTGGTAGT AATTCGCTAT GAAGGCCCGA AAGGCGGTCC GGGGATGCAG GAAATGCTCT ACCCAACCAG CTTCCTGAAA TCAATGGGTC TCGGCAAAGC CTGTGCGCTG ATCACCGACG GTCGTTTCTC TGGTGGCACC TCTGGTCTTT CCATCGGCCA CGTCTCACCG GAAGCGGCAA GCGGCGGCAG CATTGGCCTG ATTGAAGATG GTGACCTGAT CGCTATCGAC ATCCCGAACC GTGGCATTCA GTTACAGGTA AGCGATGCCG AACTGGCGGC GCGTCGTGAA GCGCAGGACG CTCGAGGTGA CAAAGCCTGG ACGCCGAAAA ATCGTGAACG TCAGGTCTCC TTTGCCCTGC GTGCTTATGC CAGCCTGGCA ACCAGCGCCG ACAAAGGCGC GGTGCGCGAT AAATCGAAAC TGGGGGGTTA A
|
Protein sequence | MPKYRSATTT HGRNMAGARA LWRATGMTDA DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV AEQIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDQII KLDLVDAMIQ GADPKVSDSQ SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVK NVLGLTLPQT LEQYDVMLTQ DDAVKNMFRA GPAGIRTTQA FSQDCRWDTL DDDRANGCIR SLEHAYSKDG GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGSIGL IEDGDLIAID IPNRGIQLQV SDAELAARRE AQDARGDKAW TPKNRERQVS FALRAYASLA TSADKGAVRD KSKLGG
|
| |