Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_A0414 |
Symbol | aroE |
ID | 4679315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008785 |
Strand | - |
Start bp | 407465 |
End bp | 408343 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639844691 |
Product | shikimate 5-dehydrogenase |
Protein accession | YP_991764 |
Protein GI | 121599712 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.341202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGG CCGTCGAATC GGCAGCGCGC GCGCGCGATC GCTATGCGGT GATCGGCAAT CCGATCGCGC ACAGCAAATC GCCGTTCATC CATTCGCGTT TCGCCGAGCA GACGGGCGAG GCGATCGAAT ACACGCATCT GCTCGCGCCG CTCGACGGCT TTTCGGCCAC GGTGCGCGCG TTCATCGCGC AGGGCGGCCG CGGCGTGAAC GTCACGGTGC CGTTCAAGCT CGAGGCCTAT GCGCTCGCCG ATGCGCTGTC GCCGCGCGCG GCGGCGGCGG GCGCGGTCAA CACGCTGCGC TTCGACGCGG ACGGCGTTTT CGGCGACAAC ACCGACGGCG TCGGCCTCGT GCGCGACATC GAGGTGAATC TCGGCGTGAG CCTCACGGGC GCGCGGATCC TGCTGCTCGG CGCGGGCGGC GCGGCACGCG GCGTCGTGCT GCCGATGCTC GAGCGCGGGC CCGCGTCGCT CACGATCGTC AACCGGACCG CGAGCAAGGC GGAAGAACTC GTCGGCCAGT TCACGCAGGC GGCGCACGAC GCGGGCTGCG TGCTCGCCGG CGGCGGGCCC GAACGGATCG CGCGCGAGCC GTACGACGTG ATCGTCAACG CGACGGCGGG CAGCCTCGAC GCGGCGCTGC CCGAGTGCGA CGCGGCGGCC TTCGGCCCGG CGACGCTCGC GTACGACATG ATGTACGGCG CGCGCCCGAC CGTGTTCATG GAGCACGCCG CGGCGCTCGG CGCGCGCACG GCGGATGGCC TCGGAATGCT CGTCGAGCAG GCCGCGGAAT CGTTCCACGT CTGGCGAGGC GTGCGACCCG ACAGCGCGCC CGTGCTCGCC GCGCTGCGCG CCGCGCTCGC GGCGTCGGCC GCGCACTGA
|
Protein sequence | MSTAVESAAR ARDRYAVIGN PIAHSKSPFI HSRFAEQTGE AIEYTHLLAP LDGFSATVRA FIAQGGRGVN VTVPFKLEAY ALADALSPRA AAAGAVNTLR FDADGVFGDN TDGVGLVRDI EVNLGVSLTG ARILLLGAGG AARGVVLPML ERGPASLTIV NRTASKAEEL VGQFTQAAHD AGCVLAGGGP ERIAREPYDV IVNATAGSLD AALPECDAAA FGPATLAYDM MYGARPTVFM EHAAALGART ADGLGMLVEQ AAESFHVWRG VRPDSAPVLA ALRAALAASA AH
|
| |