Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0720 |
Symbol | aroA |
ID | 4882388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 698155 |
End bp | 699510 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640126648 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001057772 |
Protein GI | 126440301 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.105957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA GCGATCGCCT CCAGCCGTCG TTCGTCGAAG TGAAGAACAC CTCGACGCTC AGCGGCACCA TCGATCTTCC CGCGTCGAAG AGCTCGTCGA CGCGCGCGCT CCTCACCGCG GCGCTCACGC CCGGCATCAG CACGATCCGC AACGTCGCGA CGGGCTTCAA CTCGAACGCG ATGAAGCACA ACTGCGAGCG GCTCGGCGCG TCGTTCTCGA GCGAGGGCGA CACGACGGTC GTCAAGGGCG TCGACGTGAT GCACGTCGAT CGCGAGATCG TCTTCGACCC GGGCAACTCC GGCGTCGTGC TGCGCCTGCT GATGGGCGTC GCCGGCTACC TGCCGGACAC CCGGTTCGTC ACGCAATACC GCTATTCGCT CGGCGTGCGC TCGCAGGCGG AGATGGTCGC CGCGCTGCGC CGGCTGAACG TCGAATGCGA AGCGGTCGGC CCCGAGGCGC GGCTGCCGAT CAGCATGCGC TCGACGCGCG CGCTCGGCAA GCACACCGAA GTGTCGTGCA AGAAGAGCTC GCAGTTCCTG AGCGGCCTGC TCTATCTCGG CGCGATCGGC GAGCGCGATC TCGAGATCGA CGTGGTCGAT CACATCACCG CGCCGTCGAT GGTGCATACG ACGATCAACA ATCTCGCGCA CGCGGGCGTC GCCGTCGAAT ACGACGCGGC CTTTCGCCGC TTCTTCGTGC CGGGGCGCGA TCGCTTCAAG CCGTCCGAGT TCACGGTCGG CGCCGATCCG GCGAGCACGG CCGCGATCCT CGCGCTGTGC GGCTCGCTCG CGTCGGACGT CACGCTCAAC GGCTTCTTCG AGGAAGAGCT CGGCAGCGGC GCGGTGATCC GCTATCTGAC CGATACCGGC ACGCTGATCG ACGAGTTGCC CGGCAACCGC ATCCGCATCC GGGGCGGCGC GTCGATCCGC GCGCAGGATT TCGACGGCTC GCTCGCGCCG GACGCGGTGC CCGCGCTCGC CGGCCGCGCG GCGTTCGCCG AAGGCACGAG CACGTTCTAC AACATCGAGC ATATCCGCTA CAAGGAATCG GACCGCATCT CGGATTTCCG CCGCGAACTC GACAAGCTCG GCGTGCGCTC CGAGGAAAAG CTCGATCAGT TGATCATCCA CGGCAATCCG CGCGGCTACC GCGGCGGCGC GGTCGTCGAC GGCCACTACG ATCACGGGCT CATCATGGCG CTCACGACGA TCGGCCTGCA CTGCGAGCAT CCGGTGCTGA TCAAGGAGCC GCATCACGTC GGGCAGACGT ATCCCGATTA CTTCGCCGAC ATCGGCTCGA TCGGCGCGAA CGTCGACGAG CTGATCTACC CGAACGTCGC CGCGGCGCGC GCGTGA
|
Protein sequence | MTTSDRLQPS FVEVKNTSTL SGTIDLPASK SSSTRALLTA ALTPGISTIR NVATGFNSNA MKHNCERLGA SFSSEGDTTV VKGVDVMHVD REIVFDPGNS GVVLRLLMGV AGYLPDTRFV TQYRYSLGVR SQAEMVAALR RLNVECEAVG PEARLPISMR STRALGKHTE VSCKKSSQFL SGLLYLGAIG ERDLEIDVVD HITAPSMVHT TINNLAHAGV AVEYDAAFRR FFVPGRDRFK PSEFTVGADP ASTAAILALC GSLASDVTLN GFFEEELGSG AVIRYLTDTG TLIDELPGNR IRIRGGASIR AQDFDGSLAP DAVPALAGRA AFAEGTSTFY NIEHIRYKES DRISDFRREL DKLGVRSEEK LDQLIIHGNP RGYRGGAVVD GHYDHGLIMA LTTIGLHCEH PVLIKEPHHV GQTYPDYFAD IGSIGANVDE LIYPNVAAAR A
|
| |