Gene SeD_A4294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4294 
SymbolilvD 
ID6875362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4140580 
End bp4142430 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content57% 
IMG OID642787223 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002217843 
Protein GI198243549 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAGT ACCGTTCCGC CACCACCACC CATGGTCGTA ATATGGCGGG TGCCCGCGCG 
CTGTGGCGCG CCACCGGAAT GACCGACAGT GATTTTGGCA AACCGATTAT CGCCGTGGTG
AACTCATTCA CTCAGTTTGT GCCGGGTCAC GTTCATCTGC GCGATCTCGG TAAGCTGGTC
GCCGAACAGA TTGAAGCTTC CGGCGGGGTG GCGAAAGAGT TCAACACTAT TGCCGTGGAT
GACGGGATTG CCATGGGGCA CGGGGGTATG CTCTATTCAC TGCCGTCGCG CGAGCTGATC
GCCGACTCCG TTGAGTACAT GGTGAACGCT CACTGCGCTG ACGCGATGGT GTGTATCTCC
AACTGCGACA AAATCACCCC AGGGATGCTC ATGGCCTCGC TGCGCCTGAA TATTCCGGTG
ATCTTTGTCT CTGGCGGACC GATGGAAGCC GGGAAAACCA AGCTTTCAGA CAAAATTATC
AAGCTGGATC TGGTTGATGC CATGATTCAG GGAGCGGACC CGAAAGTCTC TGACGATCAA
AGTAACCAGG TTGAACGCTC CGCCTGTCCA ACCTGCGGCT CCTGCTCCGG CATGTTTACC
GCTAACTCCA TGAATTGCCT GACCGAAGCG CTGGGCCTGT CGCAGCCGGG CAACGGCTCG
CTGCTGGCAA CTCACGCTGA CCGTAAGCAG TTGTTCCTCA ATGCCGGTAA GCGGATTGTT
GAACTGACTA AACGCTATTA CGAGCAAAAC GACGAAAGTG CACTGCCGCG TAACATCGCC
AGCAAAGCCG CGTTTGAAAA CGCGATGACG CTGGATATCG CGATGGGCGG TTCGACCAAC
ACCGTTCTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT
ATCGACAAGC TGTCCCGCAA GGTGCCGCAG CTGTGTAAAG TGGCGCCAAG TACCCAGAAA
TATCATATGG AAGATGTTCA CCGTGCCGGC GGTGTGCTGG GTATTTTAGG CGAGCTGGAT
CGCGCCGGGC TGCTGAACTG CAACGTGAAA AACGTATTAG GCCTGACGCT GCCGCAAACG
CTGGAACAGT ACGACATCAC GGTTACGCAG GACGAAGCGG TTAAAAAAAT GTTCCGTGCT
GGCCCTGCCG GTATCCGTAC TACCCAGGCG TTCTCGCAGG ATTGTCGCTG GGATTCGCTG
GATGACGACC GCGCAGCGGG TTGCATCCGC TCGCTGGAAT ATGCCTATAG CAAAGACGGC
GGTCTGGCGG TGCTGTATGG CAACTTCGCC GAAAACGGCT GCATTGTGAA AACCGCAGGC
GTGGATGACA GCATCCTTAA ATTTACCGGC CCGGCTAAAG TGTATGAAAG CCAGGACGAC
GCGGTAGAGG CGATTCTCGG CGGCAAAGTA GTGGAAGGCG ATGTAGTCGT GATCCGCTAC
GAAGGGCCGA AAGGCGGGCC GGGAATGCAG GAAATGCTCT ATCCGACCAG TTTCCTGAAG
TCGATGGGGC TGGGCAAAGC CTGCGCGCTC ATCACCGATG GGCGTTTCTC CGGCGGTACT
TCGGGTCTTT CCATCGGCCA CGTCTCGCCG GAAGCGGCCA GCGGCGGCAC TATTGCGTTG
ATTGAAGATG GCGACACTAT TGCGATTGAT ATCCCGAACC GCAGCATTCA GTTGCAGTTG
AACGAGGCTG AAATCGCCGC ACGCCGTGAG GCGCAGGAGG CTCGTGGCGA CAAAGCCTGG
ACGCCGAAAA ATCGTCAGCG TCAGGTTTCG TTTGCCCTGC GTGCCTACGC CAGCCTGGCG
ACCAGCGCCG ATAAAGGCGC GGTGCGCGAT AAATCGAAAC TGGGAGGTTG A
 
Protein sequence
MPKYRSATTT HGRNMAGARA LWRATGMTDS DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV 
AEQIEASGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDKII KLDLVDAMIQ GADPKVSDDQ
SNQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV
ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD
IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVLGILGELD RAGLLNCNVK NVLGLTLPQT
LEQYDITVTQ DEAVKKMFRA GPAGIRTTQA FSQDCRWDSL DDDRAAGCIR SLEYAYSKDG
GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VEGDVVVIRY
EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGTIAL
IEDGDTIAID IPNRSIQLQL NEAEIAARRE AQEARGDKAW TPKNRQRQVS FALRAYASLA
TSADKGAVRD KSKLGG