Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3699 |
Symbol | aroB |
ID | 4881745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3622034 |
End bp | 3623113 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640129627 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001060703 |
Protein GI | 126441391 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00532697 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTACCG TCAACGTCGA CCTGGGCGAG CGCGCCTATC CGATCCACAT CGGCGCCGAT CTGATCGGCC GCACCGAGCT TTTCGCGCCG CACATCGCGG GCGCATCCGT CACGATCGTC ACGAACACCA CCGTCGAGCC GCTCTACGGC GACACGCTGC GCGCCGCGCT CGCGCCGCTC GGCAAGCGCG TGTCGACCGT CGTCCTGCCC GACGGCGAAG CGTACAAGAA CTGGGAAACG CTCAATCTGA TCTTCGATGG CCTGCTCGAG CAGCACGCCG ATCGCAAGAC GACGCTGATC GCGCTCGGCG GCGGCGTGAT CGGCGACATG ACGGGCTTCG CGGCCGCATG CTATATGCGC GGCGTGCCGT TCATCCAGGT GCCGACGACG CTCCTGTCGC AGGTTGATTC GTCGGTCGGC GGCAAGACGG GCATCAACCA TCCGCTCGGC AAGAACATGA TCGGCGCGTT CTATCAGCCG CAGGCGGTGA TCGCCGATAT CGGCGCGCTG TCGACGCTGC CCGATCGCGA GCTTGCCGCG GGCGTCGCCG AGATCGTCAA GACGGGCGCG ATCGCCGATG CCGCGTTCTT CGACTGGATC GAGGCGAACG TGGGCGCGCT CACTCGCCGC GATCCCGACG CGCTCGCGCA CGCGGTCAAG CGCTCGTGCG AGATCAAGGC GGGCGTCGTC GCGGCGGACG AGCGCGAGGG CGGTCTGCGC GCGATCCTTA ATTTTGGCCA TACGTTCGGG CACGCGATCG AAGCGGGGCT CGGCTACGGC GAGTGGCTGC ACGGCGAGGC GGTGGGCTGC GGCATGGTGA TGGCGGCCGA CCTGTCGGTG CGAACCGGCC ATCTCGACGA AGCGTCGCGC GCGCGGCTGT GCCGCGTCGT CGAGGCCGCG CATCTGCCGA CGCGCGCGCC GGATCTCGGC GACGCGCGTT ATGTCGAGCT GATGCGCGTC GACAAGAAGG CCGAGGCGGG CGCGATCAAG TTCATACTGC TCAAACGCTT CGGCGAAACG ATCATCACTC CGGCGCCCGA CGACGCCGTT CTCGCGACAC TGGCGGCAAC CACCCGGTAA
|
Protein sequence | MITVNVDLGE RAYPIHIGAD LIGRTELFAP HIAGASVTIV TNTTVEPLYG DTLRAALAPL GKRVSTVVLP DGEAYKNWET LNLIFDGLLE QHADRKTTLI ALGGGVIGDM TGFAAACYMR GVPFIQVPTT LLSQVDSSVG GKTGINHPLG KNMIGAFYQP QAVIADIGAL STLPDRELAA GVAEIVKTGA IADAAFFDWI EANVGALTRR DPDALAHAVK RSCEIKAGVV AADEREGGLR AILNFGHTFG HAIEAGLGYG EWLHGEAVGC GMVMAADLSV RTGHLDEASR ARLCRVVEAA HLPTRAPDLG DARYVELMRV DKKAEAGAIK FILLKRFGET IITPAPDDAV LATLAATTR
|
| |