Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4294 |
Symbol | ilvD |
ID | 6875362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4140580 |
End bp | 4142430 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642787223 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002217843 |
Protein GI | 198243549 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTAAGT ACCGTTCCGC CACCACCACC CATGGTCGTA ATATGGCGGG TGCCCGCGCG CTGTGGCGCG CCACCGGAAT GACCGACAGT GATTTTGGCA AACCGATTAT CGCCGTGGTG AACTCATTCA CTCAGTTTGT GCCGGGTCAC GTTCATCTGC GCGATCTCGG TAAGCTGGTC GCCGAACAGA TTGAAGCTTC CGGCGGGGTG GCGAAAGAGT TCAACACTAT TGCCGTGGAT GACGGGATTG CCATGGGGCA CGGGGGTATG CTCTATTCAC TGCCGTCGCG CGAGCTGATC GCCGACTCCG TTGAGTACAT GGTGAACGCT CACTGCGCTG ACGCGATGGT GTGTATCTCC AACTGCGACA AAATCACCCC AGGGATGCTC ATGGCCTCGC TGCGCCTGAA TATTCCGGTG ATCTTTGTCT CTGGCGGACC GATGGAAGCC GGGAAAACCA AGCTTTCAGA CAAAATTATC AAGCTGGATC TGGTTGATGC CATGATTCAG GGAGCGGACC CGAAAGTCTC TGACGATCAA AGTAACCAGG TTGAACGCTC CGCCTGTCCA ACCTGCGGCT CCTGCTCCGG CATGTTTACC GCTAACTCCA TGAATTGCCT GACCGAAGCG CTGGGCCTGT CGCAGCCGGG CAACGGCTCG CTGCTGGCAA CTCACGCTGA CCGTAAGCAG TTGTTCCTCA ATGCCGGTAA GCGGATTGTT GAACTGACTA AACGCTATTA CGAGCAAAAC GACGAAAGTG CACTGCCGCG TAACATCGCC AGCAAAGCCG CGTTTGAAAA CGCGATGACG CTGGATATCG CGATGGGCGG TTCGACCAAC ACCGTTCTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT ATCGACAAGC TGTCCCGCAA GGTGCCGCAG CTGTGTAAAG TGGCGCCAAG TACCCAGAAA TATCATATGG AAGATGTTCA CCGTGCCGGC GGTGTGCTGG GTATTTTAGG CGAGCTGGAT CGCGCCGGGC TGCTGAACTG CAACGTGAAA AACGTATTAG GCCTGACGCT GCCGCAAACG CTGGAACAGT ACGACATCAC GGTTACGCAG GACGAAGCGG TTAAAAAAAT GTTCCGTGCT GGCCCTGCCG GTATCCGTAC TACCCAGGCG TTCTCGCAGG ATTGTCGCTG GGATTCGCTG GATGACGACC GCGCAGCGGG TTGCATCCGC TCGCTGGAAT ATGCCTATAG CAAAGACGGC GGTCTGGCGG TGCTGTATGG CAACTTCGCC GAAAACGGCT GCATTGTGAA AACCGCAGGC GTGGATGACA GCATCCTTAA ATTTACCGGC CCGGCTAAAG TGTATGAAAG CCAGGACGAC GCGGTAGAGG CGATTCTCGG CGGCAAAGTA GTGGAAGGCG ATGTAGTCGT GATCCGCTAC GAAGGGCCGA AAGGCGGGCC GGGAATGCAG GAAATGCTCT ATCCGACCAG TTTCCTGAAG TCGATGGGGC TGGGCAAAGC CTGCGCGCTC ATCACCGATG GGCGTTTCTC CGGCGGTACT TCGGGTCTTT CCATCGGCCA CGTCTCGCCG GAAGCGGCCA GCGGCGGCAC TATTGCGTTG ATTGAAGATG GCGACACTAT TGCGATTGAT ATCCCGAACC GCAGCATTCA GTTGCAGTTG AACGAGGCTG AAATCGCCGC ACGCCGTGAG GCGCAGGAGG CTCGTGGCGA CAAAGCCTGG ACGCCGAAAA ATCGTCAGCG TCAGGTTTCG TTTGCCCTGC GTGCCTACGC CAGCCTGGCG ACCAGCGCCG ATAAAGGCGC GGTGCGCGAT AAATCGAAAC TGGGAGGTTG A
|
Protein sequence | MPKYRSATTT HGRNMAGARA LWRATGMTDS DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV AEQIEASGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDKII KLDLVDAMIQ GADPKVSDDQ SNQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVLGILGELD RAGLLNCNVK NVLGLTLPQT LEQYDITVTQ DEAVKKMFRA GPAGIRTTQA FSQDCRWDSL DDDRAAGCIR SLEYAYSKDG GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VEGDVVVIRY EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGTIAL IEDGDTIAID IPNRSIQLQL NEAEIAARRE AQEARGDKAW TPKNRQRQVS FALRAYASLA TSADKGAVRD KSKLGG
|
| |