Gene EcDH1_4205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4205 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4559142 
End bp4560992 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content56% 
IMG OID 
Productdihydroxy-acid dehydratase 
Protein accessionACX41805 
Protein GI260451383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00333285 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAGT ACCGTTCCGC CACCACCACT CATGGTCGTA ATATGGCGGG TGCTCGTGCG 
CTGTGGCGCG CCACCGGAAT GACCGACGCC GATTTCGGTA AGCCGATTAT CGCGGTTGTG
AACTCGTTCA CCCAATTTGT ACCGGGTCAC GTCCATCTGC GCGATCTCGG TAAACTGGTC
GCCGAACAAA TTGAAGCGGC TGGCGGCGTT GCCAAAGAGT TCAACACCAT TGCGGTGGAT
GATGGGATTG CCATGGGCCA CGGGGGGATG CTTTATTCAC TGCCATCTCG CGAACTGATC
GCTGATTCCG TTGAGTATAT GGTCAACGCC CACTGCGCCG ACGCCATGGT CTGCATCTCT
AACTGCGACA AAATCACCCC GGGGATGCTG ATGGCTTCCC TGCGCCTGAA TATTCCGGTG
ATCTTTGTTT CCGGCGGCCC GATGGAGGCC GGGAAAACCA AACTTTCCGA TCAGATCATC
AAGCTCGATC TGGTTGATGC GATGATCCAG GGCGCAGACC CGAAAGTATC TGACTCCCAG
AGCGATCAGG TTGAACGTTC CGCGTGTCCG ACCTGCGGTT CCTGCTCCGG GATGTTTACC
GCTAACTCAA TGAACTGCCT GACCGAAGCG CTGGGCCTGT CGCAGCCGGG CAACGGCTCG
CTGCTGGCAA CCCACGCCGA CCGTAAGCAG CTGTTCCTTA ATGCTGGTAA ACGCATTGTT
GAATTGACCA AACGTTATTA CGAGCAAAAC GACGAAAGTG CACTGCCGCG TAATATCGCC
AGTAAGGCGG CGTTTGAAAA CGCCATGACG CTGGATATCG CGATGGGTGG ATCGACTAAC
ACCGTACTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT
ATCGATAAGC TTTCCCGCAA GGTTCCACAG CTGTGTAAAG TTGCGCCGAG CACCCAGAAA
TACCATATGG AAGATGTTCA CCGTGCTGGT GGTGTTATCG GTATTCTCGG CGAACTGGAT
CGCGCGGGGT TACTGAACCG TGATGTGAAA AACGTACTTG GCCTGACGTT GCCGCAAACG
CTGGAACAAT ACGACGTTAT GCTGACCCAG GATGACGCGG TAAAAAATAT GTTCCGCGCA
GGTCCTGCAG GCATTCGTAC CACACAGGCA TTCTCGCAAG ATTGCCGTTG GGATACGCTG
GACGACGATC GCGCCAATGG CTGTATCCGC TCGCTGGAAC ACGCCTACAG CAAAGACGGC
GGCCTGGCGG TGCTCTACGG TAACTTTGCG GAAAACGGCT GCATCGTGAA AACGGCAGGC
GTCGATGACA GCATCCTCAA ATTCACCGGC CCGGCGAAAG TGTACGAAAG CCAGGACGAT
GCGGTAGAAG CGATTCTCGG CGGTAAAGTT GTCGCCGGAG ATGTGGTAGT AATTCGCTAT
GAAGGCCCGA AAGGCGGTCC GGGGATGCAG GAAATGCTCT ACCCAACCAG CTTCCTGAAA
TCAATGGGTC TCGGCAAAGC CTGTGCGCTG ATCACCGACG GTCGTTTCTC TGGTGGCACC
TCTGGTCTTT CCATCGGCCA CGTCTCACCG GAAGCGGCAA GCGGCGGCAG CATTGGCCTG
ATTGAAGATG GTGACCTGAT CGCTATCGAC ATCCCGAACC GTGGCATTCA GTTACAGGTA
AGCGATGCCG AACTGGCGGC GCGTCGTGAA GCGCAGGACG CTCGAGGTGA CAAAGCCTGG
ACGCCGAAAA ATCGTGAACG TCAGGTCTCC TTTGCCCTGC GTGCTTATGC CAGCCTGGCA
ACCAGCGCCG ACAAAGGCGC GGTGCGCGAT AAATCGAAAC TGGGGGGTTA A
 
Protein sequence
MPKYRSATTT HGRNMAGARA LWRATGMTDA DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV 
AEQIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDQII KLDLVDAMIQ GADPKVSDSQ
SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV
ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD
IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVK NVLGLTLPQT
LEQYDVMLTQ DDAVKNMFRA GPAGIRTTQA FSQDCRWDTL DDDRANGCIR SLEHAYSKDG
GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY
EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGSIGL
IEDGDLIAID IPNRGIQLQV SDAELAARRE AQDARGDKAW TPKNRERQVS FALRAYASLA
TSADKGAVRD KSKLGG