Gene SeHA_C4235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4235 
SymbolilvD 
ID6489694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4121876 
End bp4123726 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content58% 
IMG OID642744328 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002047926 
Protein GI194451395 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.866608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAGT ACCGTTCCGC CACCACCACC CATGGTCGTA ATATGGCGGG TGCCCGCGCG 
CTGTGGCGCG CCACCGGAAT GACCGACAGT GATTTTGGCA AACCGATTAT CGCCGTGGTG
AACTCATTCA CTCAGTTTGT GCCGGGTCAC GTTCATCTGC GCGATCTCGG TAAGCTGGTC
GCCGAACAGA TTGAAGCTTC CGGCGGGGTG GCGAAAGAGT TCAACACTAT TGCCGTGGAT
GACGGGATTG CCATGGGGCA CGGGGGTATG CTCTATTCAC TGCCGTCGCG CGAGCTGATC
GCCGACTCCG TGGAGTACAT GGTGAACGCT CACTGCGCTG ACGCGATGGT GTGTATCTCC
AACTGCGACA AAATCACCCC AGGGATGCTC ATGGCCTCGC TGCGCCTGAA TATTCCGGTG
ATCTTTGTCT CCGGCGGCCC GATGGAAGCC GGGAAAACCA AGCTTTCAGA CAAAATTATC
AAGCTGGATC TGGTTGATGC CATGATTCAG GGAGCGGACC CGAAAGTCTC TGACGATCAA
AGTAACCAGG TTGAACGCTC CGCCTGTCCA ACCTGCGGCT CCTGCTCCGG CATGTTTACC
GCTAACTCCA TGAATTGCCT GACCGAAGCA CTGGGCCTGT CGCAGCCGGG CAACGGTTCG
CTGCTGGCGA CCCATGCTGA CCGGAAGCAG TTGTTCCTCA ATGCCGGTAA GCGGATTGTT
GAACTGACTA AACGCTATTA CGAGCAGGAT GACGAAAGTG CACTGCCGCG TAACATCGCC
AGCAAAGCCG CGTTTGAAAA CGCCATGACG CTGGATATTG CGATGGGCGG TTCGACCAAT
ACCGTTCTGC ATCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT
ATCGACAAGC TGTCCCGCAA GGTGCCGCAG CTGTGTAAAG TGGCGCCAAG TACCCAGAAA
TACCATATGG AAGACGTTCA CCGTGCCGGC GGTGTGCTGG GTATTTTAGG CGAGCTGGAT
CGCGCCGGGC TGCTGAACCG CAACGTGAAA AACGTATTAG GCCTGACGCT GCCGCAAACG
CTGGAACAGT ACGACATCAC GGTTACGCAG GACGAAGCGG TTAAAAAAAT GTTCCGTGCT
GGCCCTGCCG GTATCCGTAC TACCCAGGCG TTCTCTCAGG ATTGTCGCTG GGATTCGCTG
GATGACGACC GCGCAGCGGG TTGCATCCGC TCGCTGGAAT ATGCCTATAG CAAAGACGGC
GGTCTGGCGG TGCTGTATGG CAACTTCGCC GAAAACGGCT GCATTGTGAA AACCGCAGGC
GTGGATGACA GCATCCTTAA ATTTACCGGC CCGGCTAAAG TGTATGAAAG CCAGGACGAC
GCGGTAGAGG CGATTCTCGG CGGCAAAGTA GTGGAAGGCG ATGTAGTTGT GATCCGCTAC
GAAGGGCCGA AAGGCGGGCC GGGAATGCAG GAAATGCTCT ATCCGACCAG TTTCCTGAAG
TCGATGGGGC TGGGCAAAGC CTGCGCGCTC ATCACCGATG GGCGTTTCTC CGGCGGTACT
TCGGGTCTTT CCATCGGCCA CGTCTCGCCG GAAGCGGCCA GCGGCGGCAC TATTGCGCTG
ATTGAAGATG GCGACACTAT TGCGATTGAT ATCCCGAACC GCAGCATTCA GTTGCAGTTG
AGCGAGGCTG AAATCGCCGC ACGCCGCGAG GCGCAGGAAG CTCGTGGCGA CAAAGCCTGG
ACGCCTAAAA ATCGTCAGCG TCAGGTTTCG TTTGCCCTGC GTGCCTACGC CAGCCTGGCG
ACCAGCGCCG ATAAAGGCGC GGTGCGCGAT AAATCGAAAC TGGGAGGTTG A
 
Protein sequence
MPKYRSATTT HGRNMAGARA LWRATGMTDS DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV 
AEQIEASGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDKII KLDLVDAMIQ GADPKVSDDQ
SNQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV
ELTKRYYEQD DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD
IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVLGILGELD RAGLLNRNVK NVLGLTLPQT
LEQYDITVTQ DEAVKKMFRA GPAGIRTTQA FSQDCRWDSL DDDRAAGCIR SLEYAYSKDG
GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VEGDVVVIRY
EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGTIAL
IEDGDTIAID IPNRSIQLQL SEAEIAARRE AQEARGDKAW TPKNRQRQVS FALRAYASLA
TSADKGAVRD KSKLGG