Gene ECD_03649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03649 
SymbolilvD 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3843720 
End bp3845570 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content56% 
IMG OID 
Productdihydroxy-acid dehydratase 
Protein accessionACT45443 
Protein GI253979773 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000570599 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAGT ACCGTTCCGC CACCACCACC CATGGCCGTA ATATGGCGGG GGCCCGCGCA 
CTGTGGCGCG CCACCGGGAT GACCGACGCC GATTTCGGTA AGCCGATTAT CGCGGTTGTG
AACTCGTTCA CCCAATTTGT ACCGGGTCAC GTCCATCTGC GCGATCTCGG TAAACTGGTC
GCCGAACAAA TTGAAGCGGC TGGCGGCGTT GCCAAAGAGT TCAACACCAT TGCGGTGGAT
GATGGGATTG CCATGGGCCA CGGGGGGATG CTTTATTCAC TGCCATCTCG CGAACTGATC
GCTGATTCCG TTGAGTATAT GGTCAACGCC CACTGCGCCG ATGCCATGGT CTGTATCTCC
AACTGCGACA AAATCACCCC TGGGATGTTG ATGGCTTCCC TGCGATTAAA TATTCCGGTG
ATCTTTGTTT CCGGCGGCCC GATGGAAGCC GGGAAAACCA AGCTGTCCGA TCGGATAATC
AAGCTCGATC TGGTTGATGC GATGATCCAG GGCGCAGACC CGAAAGTCTC TGACTCCCAG
AGCGATCAGG TTGAACGTTC CGCCTGCCCA ACCTGCGGTT CCTGCTCCGG GATGTTTACC
GCTAACTCAA TGAACTGCCT GACCGAAGCG CTGGGTTTGT CGCAGCCAGG CAACGGCTCG
CTGCTGGCAA CCCACGCGGA CCGTAAGCAG CTGTTCCTTA ATGCTGGTAA ACGCATTGTT
GAATTGACCA AACGTTATTA CGAGCAAAAC GACGAAAGTG CACTGCCGCG TAATATCGCC
AGTAAGGCGG CGTTTGAAAA CGCCATGACG CTGGATATCG CGATGGGTGG ATCGACTAAC
ACCGTACTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT
ATCGATAAGC TCTCCCGCAA GGTTCCGCAG CTGTGTAAAG TTGCGCCGAG CACCCAGAAA
TACCATATGG AAGATGTTCA TCGTGCTGGT GGTGTTATCG GTATTCTCGG CGAACTGGAT
CGTGCGGGGT TACTGAACCG TGATGTGAAA AACGTACTTG GCCTGACGTT GCCGCAAACG
CTGGAACAAT ACGACGTTAT GCTGACCCAG GATGACGCGG TAAAAAATAT GTTCCGCGCA
GGCCCGGCGG GCATTCGGAC TACACAGGCA TTCTCGCAGG ATTGCCGTTG GGATTCTCTC
GATGACGATC GCGCAAACGG CTGTATCCGC TCGCTGGAAC ACGCCTACAG CAAAGACGGC
GGCCTGGCGG TGCTCTACGG TAATTTCGCA GAAAACGGCT GCATCGTTAA AACCGCGGGC
GTCGATGACA GCATCCTCAA ATTCACCGGC CCGGCGAAAG TGTACGAAAG CCAGGACGAC
GCGGTAGAAG CGATTCTCGG CGGTAAAGTT GTCGCCGGAG ATGTGGTAGT AATTCGCTAT
GAAGGCCCGA AAGGCGGTCC GGGGATGCAG GAAATGCTCT ACCCAACCAG CTTCCTGAAA
TCAATGGGGC TCGGTAAAGC CTGTGCGCTG ATCACCGACG GTCGTTTCTC TGGCGGCACC
TCTGGCCTTT CTATCGGGCA CGTCTCACCG GAAGCGGCAA GCGGCGGCAG CATTGGCCTG
ATTGAAGACG GCGATCTTAT CGCTATCGAC ATTCCGAACC GTGGTATTCA GTTACAGGTA
AGCGATGCCG AACTGGCGGC GCGTCGTGAA GCGCAGGAAG CCCGGGGTGA CAAAGCCTGG
ACGCCGAAAA ACCGTGAACG TCAGGTTTCC TTTGCGCTGC GTGCCTACGC CAGCCTGGCG
ACCAGCGCCG ACAAAGGTGC GGTGCGCGAT AAATCGAAAC TGGGGGGTTA A
 
Protein sequence
MPKYRSATTT HGRNMAGARA LWRATGMTDA DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV 
AEQIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDRII KLDLVDAMIQ GADPKVSDSQ
SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV
ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD
IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVK NVLGLTLPQT
LEQYDVMLTQ DDAVKNMFRA GPAGIRTTQA FSQDCRWDSL DDDRANGCIR SLEHAYSKDG
GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY
EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGSIGL
IEDGDLIAID IPNRGIQLQV SDAELAARRE AQEARGDKAW TPKNRERQVS FALRAYASLA
TSADKGAVRD KSKLGG