Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3325 |
Symbol | aroG |
ID | 4902989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 3248127 |
End bp | 3249392 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640136551 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001067562 |
Protein GI | 126453129 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATGGCGC GCGAAGCGCA ACCGAATTCC CGAAAAACCG CCGGCGAACC CGGCGGTTTT TTTTCGCCCC GCCGGTTCGC AAGCAGGGAC GACGGGCCGA TCCACCGCTT TACCGAATCA CCGATTTGTC GAATCGAACC GCCAGCCGCA CCCGCGCACG CCGGGCGGCG CACCGAACCG GAGAACTCAA GCATGCCCCC GCACAATACC GACGACGTCC GCATCCGTGA ACTGAAGGAG CTGACTCCGC CCGCCCACCT GATCCGCGAA TTCGCGCTCG GCGAGGCGGT GTCGGAGCTC ATCTACAACG CGCGCCAGGC GATGCACCGG ATCCTGCACG GGATGGACGA TCGCCTGATC GTCATCATCG GGCCGTGCTC GATCCACGAC ACGAAGGCGG CGCTCGAATA CGCGGGCCGG CTCGTCCAGG AGCGCGAGCG CTTCGCAAGC GAACTCGAGA TCGTAATGCG CGTGTACTTC GAGAAGCCGC GCACGACGGT CGGCTGGAAG GGGCTCATCA ACGATCCGCA CCTGGATAAC AGCTTCAAGA TCAACGACGG CCTGCGCACC GCGCGCGAGC TGCTGCTGCA GATCAACGAG ATGGGGCTGC CCGCCGGCAC CGAATACCTC GACATGATCA GCCCGCAGTA CATCGCGGAC CTGATCTCGT GGGGCGCGAT CGGCGCGCGC ACGACCGAAT CGCAGGTGCA CCGCGAGCTC GCGTCGGGGC TGTCGTGCCC GGTCGGCTTC AAGAACGGCA CCGACGGCAA CGTGAAGATC GCGGTCGACG CGATCAAGGC CGCATCGCAG CCGCACCATT TCCTGTCGGT GACGAAGGGC GGCCATTCGG CGATCGTGTC GACGGCCGGC AACGAGGACT GCCACGTGAT CCTGCGCGGC GGCAAGGCGC CGAACTACGA TGCCGACAGC GTGAACGCCG CGTGCGCGGA CATCGGCAAG GCCGGCCTCG CCGCGCGCCT GATGATCGAC GCGAGCCATG CGAACAGCTC GAAGAAGCAC GAGAACCAGA TTCCGGTATG CGCGGACATC GGCCGCCAGA TCGCCGCGGG CGACGAGCGC ATCGTCGGCG TGATGGTCGA GTCGCACCTC GTCGAAGGCC GCCAGGACCT GAAGGAAGGC TGCCCGCTCA CGTACGGCCA GAGCATCACC GATGCATGCA TCAACTGGGA CGACAGCGTG AAGGTGCTCG AAGGGCTCGC CGAAGCGGTG AAGGCGCGGC GCGTCGCGCG CGGCAGCGGC AACTGA
|
Protein sequence | MMAREAQPNS RKTAGEPGGF FSPRRFASRD DGPIHRFTES PICRIEPPAA PAHAGRRTEP ENSSMPPHNT DDVRIRELKE LTPPAHLIRE FALGEAVSEL IYNARQAMHR ILHGMDDRLI VIIGPCSIHD TKAALEYAGR LVQERERFAS ELEIVMRVYF EKPRTTVGWK GLINDPHLDN SFKINDGLRT ARELLLQINE MGLPAGTEYL DMISPQYIAD LISWGAIGAR TTESQVHREL ASGLSCPVGF KNGTDGNVKI AVDAIKAASQ PHHFLSVTKG GHSAIVSTAG NEDCHVILRG GKAPNYDADS VNAACADIGK AGLAARLMID ASHANSSKKH ENQIPVCADI GRQIAAGDER IVGVMVESHL VEGRQDLKEG CPLTYGQSIT DACINWDDSV KVLEGLAEAV KARRVARGSG N
|
| |