Gene EcSMS35_4137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4137 
SymbolilvD 
ID6144498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4233268 
End bp4235118 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content56% 
IMG OID641618960 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001746092 
Protein GI170682714 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.129603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAGT ACCGTTCCGC CACCACCACT CATGGTCGTA ATATGGCGGG TGCTCGTGCG 
CTGTGGCGCG CCACCGGAAT GACCGACGCC GATTTCGGTA AGCCGATTAT CGCGGTTGTG
AACTCGTTCA CCCAATTTGT ACCGGGTCAC GTCCATCTGC GCGATCTCGG TAAACTGGTC
GCCGAACAAA TTGAAGCGGC TGGCGGCGTT GCCAAAGAGT TCAACACCAT TGCGGTGGAT
GATGGGATTG CCATGGGCCA CGGGGGGATG CTTTATTCGC TGCCATCTCG CGAACTGATC
GCTGATTCCG TTGAGTATAT GGTCAATGCT CACTGCGCCG ACGCCATGGT CTGCATCTCT
AACTGCGACA AAATCACCCC GGGGATGCTG ATGGCTTCCC TGCGCCTGAA TATTCCGGTG
ATCTTTGTTT CCGGCGGCCC GATGGAGGCC GGGAAAACCA AACTGTCCGA TCAGATCATC
AAGCTCGATC TGGTTGATGC GATGATCCAG GGGGCAGACC CGAAAGTTTC TGACTCCCAA
AGCGATCAGG TTGAACGTTC CGCATGCCCA ACCTGCGGTT CCTGCTCCGG GATGTTTACC
GCTAACTCAA TGAACTGCCT GACCGAAGCG CTGGGCCTGT CGCAGCCGGG CAACGGCTCG
CTGCTGGCAA CTCACGCCGA CCGTAAGCAG CTGTTCCTCA ATGCCGGTAA ACGCATTGTT
GAATTGACCA AACGTTACTA CGAGCAAAAT GACGAAAGTG CACTGCCGCG TAATATCGCC
AGTAAGGCGG CGTTTGAAAA CGCCATGACG CTGGATATCG CGATGGGTGG ATCGACTAAC
ACTGTACTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT
ATCGACAAAC TTTCGCGTAA AGTCCCGCAG TTATGCAAAG TTGCGCCGAG CACCCAGAAA
TACCATATGG AAGATGTTCA TCGTGCTGGT GGTGTTATCG GTATTCTCGG CGAACTGGAT
CGCGCGGGGT TACTGAACCG TGATGTGAAA AACGTGCTTG GCCTGACGTT GCCACAAACG
CTGGAACAAT ACGACGTTAT GCTGACCCAG GATGACTCGG TAAAAAATAT GTTCCGCGCT
GGCCCTGCGG GCATTCGTAC TACACAGGCA TTCTCGCAGG ATTGCCGTTG GGATTCTCTC
GATGACGATC GCGCCAATGG CTGTATCCGC TCGCTGGAAC ACGCCTACAG CAAAGAAGGT
GGTCTGGCGG TGCTGTACGG TAATTTCGCA GAAAACGGCT GCATCGTTAA AACCGCGGGC
GTCGATGACA GCATCCTCAA ATTCACCGGC CCGGCGAAAG TGTACGAAAG CCAGGATGAC
GCGGTAGAAG CGATTCTCGG CGGCAAAGTT GTCGCTGGCG ATGTAGTGGT GATCCGCTAT
GAAGGCCCGA AAGGCGGTCC GGGGATGCAG GAAATGCTCT ACCCAACCAG CTTCCTGAAA
TCAATGGGTC TCGGTAAAGC CTGCGCGCTG ATCACCGACG GACGCTTCTC CGGCGGCACC
TCTGGCCTTT CTATCGGCCA CGTCTCACCG GAAGCGGCAA GCGGCGGCAG CATTGGCCTG
ATTGAAGACG GCGACCTGAT CGCCATCGAC ATTCCGAACC GTGGTATTCA GTTACAGGTA
AGCGATGCCG AACTGGCGGC GCGTCGTGAA GCGCAGGAAG CTCGTGGCGA CAAAGCCTGG
ACGCCGAAAA ACCGTGAACG CCAGGTTTCC TTTGCGCTGC GTGCCTACGC CAGCCTGGCA
ACCAGCGCCG ACAAAGGTGC GGTGCGCGAT AAATCGAAAC TGGGGGGTTA A
 
Protein sequence
MPKYRSATTT HGRNMAGARA LWRATGMTDA DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV 
AEQIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDQII KLDLVDAMIQ GADPKVSDSQ
SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV
ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD
IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVK NVLGLTLPQT
LEQYDVMLTQ DDSVKNMFRA GPAGIRTTQA FSQDCRWDSL DDDRANGCIR SLEHAYSKEG
GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY
EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGSIGL
IEDGDLIAID IPNRGIQLQV SDAELAARRE AQEARGDKAW TPKNRERQVS FALRAYASLA
TSADKGAVRD KSKLGG