Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_A1113 |
Symbol | aroG |
ID | 4792414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008836 |
Strand | + |
Start bp | 1141175 |
End bp | 1142440 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001027098 |
Protein GI | 124386426 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.667849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATGGCGC GCGAAGCGCA ACCGAATTCC CGAAAAACCG CCGGCGAACC CGGCGGTTTT TTTTCGCACC GCCGGTTCGC AAGCAGGGAC GACGGGCCGA TCCACCGCTT TACCGAATCA CCGATTTGTC GAATCGAACC GCCAGCCGCA CCCGCGCACG CCGGGCGGCG CACCGAACCG GAGAACTCAA GCATGCCCCC GCACAATACC GACGACGTCC GCATCCGTGA ACTGAAGGAG CTGACTCCGC CCGCCCACCT GATCCGCGAA TTCGCGCTCG GCGAGGCGGT GTCGGAGCTC ATCTACAACG CGCGCCAGGC GATGCACCGG ATCCTGCACG GGATGGACGA TCGCCTGATC GTCATCATCG GGCCGTGCTC GATCCACGAC ACGAAGGCGG CGCTCGAATA CGCGGGCCGG CTCGTCCAGG AGCGCGAGCG CTTCGCAAGC GAACTCGAGA TCGTAATGCG CGTGTACTTC GAGAAGCCGC GCACGACGGT CGGCTGGAAG GGGCTCATCA ACGATCCGCA CCTGGATAAC AGCTTCAAGA TCAACGACGG CCTGCGCACC GCGCGCGAGC TGCTGCTGCA GATCAACGAG ATGGGGCTGC CCGCCGGCAC CGAATATCTC GACATGATCA GCCCGCAATA CATCGCGGAC CTGATCTCGT GGGGCGCGAT CGGCGCGCGC ACGACCGAAT CGCAGGTGCA CCGCGAGCTC GCGTCGGGGC TGTCGTGCCC GGTCGGCTTC AAGAACGGCA CCGACGGCAA CGTGAAGATC GCGGTCGACG CGATCAAGGC CGCATCGCAG CCGCACCATT TCCTGTCGGT GACGAAGGGC AACCATTCGG CGATCGTGTC GACGGCCGGC AACGAGGACT GCCACGTGAT CCTGCGCGGC GGCAAGGCGC CGAACTACGA TGCCGACAGC GTGAACGCCG CGTGCGCGGA CATCGGCAAG GCCGGCCTCG CCGCGCGCCT GATGATCGAC GCGAGCCATG CGAACAGCTC GAAGAAGCAC GAGAACCAGA TTCCGGTATG CGCGGACATC GGCCGCCAGA TCGCCGCGGG CGACGAGCGC ATCGTCGGCG TGATGGTCGA GTCGCACCTC GTCGAAGGCC GCCAGGACCT GAAGGAAGGC TGCCCGCTCA CGTACGGCCA GAGCATCACC GATGCATGCA TCAACTGGGA CGACAGCGTG AAGGTGCTCG AAGGGCTCGC CGAAGCGGTG AAGGCGCGGC ACGTCGCGCG CGGCAGCGGC AACTGA
|
Protein sequence | MMAREAQPNS RKTAGEPGGF FSHRRFASRD DGPIHRFTES PICRIEPPAA PAHAGRRTEP ENSSMPPHNT DDVRIRELKE LTPPAHLIRE FALGEAVSEL IYNARQAMHR ILHGMDDRLI VIIGPCSIHD TKAALEYAGR LVQERERFAS ELEIVMRVYF EKPRTTVGWK GLINDPHLDN SFKINDGLRT ARELLLQINE MGLPAGTEYL DMISPQYIAD LISWGAIGAR TTESQVHREL ASGLSCPVGF KNGTDGNVKI AVDAIKAASQ PHHFLSVTKG NHSAIVSTAG NEDCHVILRG GKAPNYDADS VNAACADIGK AGLAARLMID ASHANSSKKH ENQIPVCADI GRQIAAGDER IVGVMVESHL VEGRQDLKEG CPLTYGQSIT DACINWDDSV KVLEGLAEAV KARHVARGSG N
|
| |