Gene SNSL254_A4186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4186 
SymbolilvD 
ID6483511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4078280 
End bp4080130 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content57% 
IMG OID642739440 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002043143 
Protein GI194442868 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAGT ACCGTTCCGC CACCACCACC CATGGTCGTA ATATGGCGGG TGCCCGCGCG 
CTGTGGCGCG CCACCGGAAT GACCGACAGT GATTTTGGCA AACCGATTAT CGCCGTGGTG
AACTCATTCA CTCAGTTTGT GCCGGGTCAC GTTCATCTGC GCGATCTCGG TAAGCTGGTC
GCCGAACAGA TTGAAGCTTC CGGCGGGGTG GCGAAAGAGT TCAACACTAT TGCCGTGGAT
GACGGGATTG CCATGGGGCA CGGGGGTATG CTCTATTCAC TGCCGTCGCG CGAGCTGATC
GCCGACTCCG TTGAGTACAT GGTGAACGCT CACTGCGCTG ACGCGATGGT GTGTATCTCC
AACTGCGACA AAATCACCCC AGGGATGCTC ATGGCCTCGC TGCGCCTGAA TATTCCGGTG
ATCTTTGTCT CCGGCGGCCC GATGGAAGCC GGGAAAACCA AGCTTTCAGA CAAAATTATC
AAGCTGGATC TGGTTGATGC CATGATTCAG GGAGCGGACC CGAAAGTCTC TGACGATCAA
AGTAACCAGG TTGAACGCTC CGCCTGTCCA ACCTGCGGCT CCTGCTCCGG CATGTTTACC
GCTAACTCCA TGAATTGCCT GACCGAAGCG CTGGGCCTGT CGCAGCCGGG CAACGGCTCG
CTGCTGGCAA CTCACGCTGA CCGTAAGCAG TTGTTCCTCA ATGCCGGTAA GCGGATTGTT
GAACTGACTA AACGCTATTA CGAGCAAAAC GACGAAAGTG CACTGCCGCG TAACATCGCC
AGCAAAGCCG CGTTTGAAAA CGCGATGACG CTGGATATCG CGATGGGCGG TTCGACCAAC
ACCGTTCTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT
ATCGACAAGC TGTCCCGCAA GGTGCCGCAG CTGTGTAAAG TGGCGCCAAG TACCCAGAAA
TACCATATGG AAGATGTTCA CCGTGCCGGC GGTGTGCTGG GTATTTTAGG CGAGCTGGAT
CGCGCCGGGC TGCTGAACCG CAACGTGAAA AACGTATTAG GCCTGACGCT GCCGCAAACG
CTGGAACAGT ACGACATCAC GGTTACGCAG GACGAAGCGG TTAAAAAAAT GTTCCGTGCT
GGCCCTGCCG GTATCCGTAC TACTCAGGCG TTCTCGCAGG ATTGTCGCTG GGATTCGCTG
GATGACGACC GCGCAGCGGG TTGCATCCGC TCGCTGGAAT ATGCCTATAG CAAAGACGGC
GGTCTGGCGG TGCTGTATGG CAACTTCGCC GAAAACGGCT GCATCGTTAA AACCGCGGGC
GTCGATGACA GCATCCTTAA ATTCACCGGC CCGGCGAAAG TGTATGAAAG TCAGGATGAG
GCGGTAGAGG CGATTCTCGG CGGCAAAGTA GTGGAAGGCG ATGTAGTCGT GATCCGCTAC
GAAGGGCCGA AAGGCGGGCC GGGAATGCAG GAAATGCTCT ATCCGACCAG TTTCCTGAAG
TCGATGGGGC TGGGCAAAGC CTGCGCGCTC ATCACCGATG GGCGTTTTTC CGGCGGGACT
TCCGGTCTCT CTATTGGCCA CGTTTCGCCG GAAGCGGCCA GCGGCGGCAC TATTGCGTTG
ATTGAAGATG GCGACACTAT TGCGATTGAT ATCCCGAACC GCAGCATTCA GTTGCAGTTG
AGTGAGGCTG AAATCGCCGC ACGCCGCGAG GCGCAGGAAG CTCGTGGCGA CAAAGCCTGG
ACGCCGAAAA ATCGTCAGCG TCAGGTTTCG TTTGCCCTGC GTGCCTACGC CAGCCTGGCG
ACCAGCGCCG ATAAAGGCGC GGTGCGCGAT AAATCGAAAC TGGGAGGTTG A
 
Protein sequence
MPKYRSATTT HGRNMAGARA LWRATGMTDS DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV 
AEQIEASGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDKII KLDLVDAMIQ GADPKVSDDQ
SNQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV
ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD
IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVLGILGELD RAGLLNRNVK NVLGLTLPQT
LEQYDITVTQ DEAVKKMFRA GPAGIRTTQA FSQDCRWDSL DDDRAAGCIR SLEYAYSKDG
GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDE AVEAILGGKV VEGDVVVIRY
EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGTIAL
IEDGDTIAID IPNRSIQLQL SEAEIAARRE AQEARGDKAW TPKNRQRQVS FALRAYASLA
TSADKGAVRD KSKLGG