Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_2220 |
Symbol | aroG |
ID | 4893990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009080 |
Strand | + |
Start bp | 2192444 |
End bp | 2193706 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640150873 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001081749 |
Protein GI | 126448166 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGCG AAGCGCAACC GAATTCCCGA AAAACCGCCG GCGAACCCGG CGGTTTTTTT TCGCACCGCC GGTTCGCAAG CAGGGACGAC GGGCCGATCC ACCGCTTTAC CGAATCACCG ATTTGTCGAA TCGAACCGCC AGCCGCACCC GCGCACGCCG GGCGGCGCAC CGAACCGGAG AACTCAAGCA TGCCCCCGCA CAATACCGAC GACGTCCGCA TCCGTGAACT GAAGGAGCTG ACTCCGCCCG CCCACCTGAT CCGCGAATTC GCGCTCGGCG AGGCGGTGTC GGAGCTCATC TACAACGCGC GCCAGGCGAT GCACCGGATC CTGCACGGGA TGGACGATCG CCTGATCGTC ATCATCGGGC CGTGCTCGAT CCACGACACG AAGGCGGCGC TCGAATACGC GGGCCGGCTC GTCCAGGAGC GCGAGCGCTT CGCAAGCGAA CTCGAGATCG TAATGCGCGT GTACTTCGAG AAGCCGCGCA CGACGGTCGG CTGGAAGGGG CTCATCAACG ATCCGCACCT GGATAACAGC TTCAAGATCA ACGACGGCCT GCGCACCGCG CGCGAGCTGC TGCTGCAGAT CAACGAGATG GGGCTGCCCG CCGGCACCGA ATACCTCGAC ATGATCAGCC CGCAATACAT CGCGGACCTG ATCTCGTGGG GCGCGATCGG CGCGCGCACG ACCGAATCGC AGGTGCACCG CGAGCTCGCG TCGGGGCTGT CGTGCCCGGT CGGCTTCAAG AACGGCACCG ACGGCAACGT GAAGATCGCG GTCGACGCGA TCAAGGCCGC ATCGCAGCCG CACCATTTCC TGTCGGTGAC GAAGGGCAAC CATTCGGCGA TCGTGTCGAC GGCCGGCAAC GAGGACTGCC ACGTGATCCT GCGCGGCGGC AAGGCGCCGA ACTACGATGC CGACAGCGTG AACGCCGCGT GCGCGGACAT CGGCAAGGCC GGCCTCGCCG CGCGCCTGAT GATCGACGCG AGCCATGCGA ACAGCTCGAA GAAGCACGAG AACCAGATTC CGGTATGCGC GGACATCGGC CGCCAGATCG CCGCGGGCGA CGAGCGCATC GTCGGCGTGA TGGTCGAGTC GCACCTCGTC GAAGGCCGCC AGGACCTGAA GGAAGGCTGC CCGCTCACGT ACGGCCAGAG CATCACCGAT GCATGCATCA ACTGGGACGA CAGCGTGAAG GTGCTCGAAG GGCTCGCCGA AGCGGTGAAG GCGCGGCACG TCGCGCGCGG CAGCGGCAAC TGA
|
Protein sequence | MAREAQPNSR KTAGEPGGFF SHRRFASRDD GPIHRFTESP ICRIEPPAAP AHAGRRTEPE NSSMPPHNTD DVRIRELKEL TPPAHLIREF ALGEAVSELI YNARQAMHRI LHGMDDRLIV IIGPCSIHDT KAALEYAGRL VQERERFASE LEIVMRVYFE KPRTTVGWKG LINDPHLDNS FKINDGLRTA RELLLQINEM GLPAGTEYLD MISPQYIADL ISWGAIGART TESQVHRELA SGLSCPVGFK NGTDGNVKIA VDAIKAASQP HHFLSVTKGN HSAIVSTAGN EDCHVILRGG KAPNYDADSV NAACADIGKA GLAARLMIDA SHANSSKKHE NQIPVCADIG RQIAAGDERI VGVMVESHLV EGRQDLKEGC PLTYGQSITD ACINWDDSVK VLEGLAEAVK ARHVARGSGN
|
| |