Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4149 |
Symbol | ilvD |
ID | 6271981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3873591 |
End bp | 3875441 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641727976 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001882403 |
Protein GI | 187732909 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAGT ACCGTTCCGC CACCACCACT CATGGTCGTA ATATGGCGGG TGCTCGTGCG CTGTGGCGCG CCACCGGAAT GACCGACGCC GATTTTGGCA AGCCGATTAT CGCGGTTGTG AACTCGTTCA CCCAATTTGT ACCGGGTCAC GTCCATCTGC GCGATCTCGG TAAACTGGTC GCCGAACAAA TTGAAGCGGC TGGCGGCGTT GCCAAAGAGT TCAACACCAT TGCGGTGGAT GATGGGATTG CCATGGGCCA CGGGGGGATG CTTTATTCAC TTCCATCTCG CGAACTGATC GCTGATTCCG TTGAGTATAT GGTCAACGCC CACTGCGCCG ACGCCATGGT CTGCATCTCT AACTGCGACA AAATCACCCC GGGGATGCTG ATGGCTTCCC TGCGCCTGAA TATTCCGGTG ATCTTTGTTT CCGGCGGCCC GATGGAGGCC GGGAAAACCA AACTGTCCGA TCAGATAATC AAACTCGATC TGGTTGATGC GATGATCCAG GGCGCAGACC CGAAAGTTTC TGACTCCCAG AGCGATCAGG TTGAACGTTC CGCCTGCCCA ACCTGCGGTT CCTGCTCCGG GATGTTTACC GCTAACTCAA TGAACTGCCT GACCGAAGCG CTGGGTTTGT CGCAGCCAGG CAACGGCTCG CTGCTGGCAA CCCACGCGGA CCGTAAGCAG CTGTTCCTTA ATGCTGGTAA ACGCATTGTT GAATTGACCA AACGTTATTA CGAGCAAAAC GACGAAAGTG CACTGCCGCG TAATATCGCC AGTAAGGCGG CGTTTGAAAA CGCCATGACG CTGGATATCG CGATGGGTGG ATCGACTAAC ACCGTACTTC ACCTGCTGGC GGCGGCGCAG GAAGCGGAAA TCGACTTCAC CATGAGTGAT ATCGATAAGC TTTCCCGCAA GGTTCCACAG CTGTGTAAAG TTGCGCCGAG CACCCAGAAA TACCATATGG AAGATGTTCA CCGTGCTGGT GGTGTTATCG GTATTCTCGG CGAACTGGAT CGCGCGGGGT TACTGAACCG TGATGTGAAA AACGTGCTCG GCCTGACGTT GCCGCAAACG CTGGAGCAAT ACGACGTTAT GCTGACCCAG GATGACGCGG TAAAAAATAT GTTCCGCGCC GGCCCTGCGG GTATCCGTAC CACCCAGGCA TTCTCGCAAG ATTGCCGTTG GGATACGCTG GACGACGATC GCGCAAACGG CTGTATCCGC TCGCTGGAAC ACGCCTACAG CAAAGACGGC GGCCTGGCGG TGCTCTACGG TAATTTTGCT GAAAACGGCT GCATCGTGAA AACCGCGGGC GTCGATGACA GCATCCTCAA ATTCACCGGC CCGGCAAAAG TGTACGAAAG CCAGGACGAC GCGGTAGAAG CGATCCTCGG TGGCAAAGTT GTCGCGGGCG ATGTGGTGGT GATCCGCTAT GAAGGCCCGA AAGGCGGCCC TGGCATGCAG GAAATGCTCT ACCCAACCAG CTTCCTGAAA TCAATGGGTC TCGGTAAAGC CTGTGCGCTG ATCACCGACG GTCGTTTCTC TGGCGGCACC TCTGGTCTTT CCATCGGCCA CGTCTCACCG GAAGCGGCAA GCGGCGGCAG CATTGGTCTG ATTGAAGATG GTGACCTGAT CGCTATCGAC ATCCCGAACC GTGGCATTCA GTTACAGGTA AGCGATGCCG AACTGGCGGC GCGTCGTGAA GCGCAGGAAG CTCGGGGTGA CAAAGCCTGG ACGCCGAAAA ACCGTGAACG TCAGGTTTCC TTTGCCCTGC GCGCCTACGC CAGCCTGGCA ACCAGCGCCG ACAAAGGTGC GGTGCGCGAT AAATCGAAAC TGGGGGGTTA A
|
Protein sequence | MPKYRSATTT HGRNMAGARA LWRATGMTDA DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV AEQIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDQII KLDLVDAMIQ GADPKVSDSQ SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADRKQ LFLNAGKRIV ELTKRYYEQN DESALPRNIA SKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD IDKLSRKVPQ LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVK NVLGLTLPQT LEQYDVMLTQ DDAVKNMFRA GPAGIRTTQA FSQDCRWDTL DDDRANGCIR SLEHAYSKDG GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY EGPKGGPGMQ EMLYPTSFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGSIGL IEDGDLIAID IPNRGIQLQV SDAELAARRE AQEARGDKAW TPKNRERQVS FALRAYASLA TSADKGAVRD KSKLGG
|
| |