Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMAA0533 |
Symbol | trpE |
ID | 3086560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei ATCC 23344 |
Kingdom | Bacteria |
Replicon accession | NC_006349 |
Strand | - |
Start bp | 542327 |
End bp | 543820 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637564451 |
Product | anthranilate synthase component I |
Protein accession | YP_105301 |
Protein GI | 53717534 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.137643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAC TCGAATTCCA ATCGCTTGCC AACGAGGGCT ACAACCGCAT TCCGCTCATC GCCGAAGCGC TGGCCGACCT CGAAACGCCG CTTTCACTGT ATCTGAAGCT CGCGCAGCCC GAACGCGGCG GCGCCAACTC GTTCCTGCTC GAATCGGTGG TGGGCGGCGA GCGCTTCGGA CGCTATTCGT TCATCGGCCT GCCCGCGCAT ACGCTGGTGC GCACGAAGAA CGGCGTGTCG GAGGTCGTGA CGGACGGCCA GGTCACCGAG ACCCACGACG GCGACCCGTT CGCGTTCATC GCGACATTCC AGAGCCGCTT CAAGGTCGCG CAGCGCCCCG GCCTGCCGCG CTTCTGCGGC GGCCTCGCCG GCTATTTCGG CTACGACGCG GTGCGCTACA TCGAGAAGAA GCTCGCGCAC ACCGCGCCGC GCGACGATCT CGGCCTGCCC GACATCCAGT TGCTGCTGAC CGAGGAAGTC GCCGTGATCG ACAACCTCGC CGGCAAGCTC TACCTGATCG TCTATGCCGA TCCGACGAAG CCCGAGGCGT ACACGAAAGC CAAGCAACGG CTGCGCGAGC TCAAGCAGCG GCTGCGCGCG AGCGTCGTGC CGCCCGTCAC GTCGGCGAGC GTGCGCACCG AGATATATCG CGAATTCAAG AAGGATGACT ATCTGGCCGC CGTGCGCACG GCGAAGGAAT ACATCGCGGC GGGCGAGCTG ATGCAGATCC AGGTCGGCCA GCGCCTGACG AAGCCGTATC GCGACAATCC GCTGTCGCTG TACCGCGCGC TGCGCTCGCT GAACCCGTCG CCATACATGT ATTACTACAA TTTCGGCGAA TTCCATGTCG TCGGCGCTTC GCCGGAGATT CTCGTGCGTC AGGAGAAGCG CGGCGACGAC CAGATCGTGA CGATCCGCCC GCTTGCCGGC ACGCGGCCGC GCGGCAACAC GCCCGAGCGC GACGCCGAGC TCGCGACCGA ACTGCTCAAC GACCCGAAGG AAATCGCCGA GCACGTGATG CTGATCGACC TCGCGCGCAA CGACGTCGGC CGCATCGCGG AAATCGGCTC GGTCCACGTG ACCGACAAGA TGGTGATCGA GAAATACTCG CACGTGCAGC ACATCGTGAG TTCGGTCGAG GGCAAGCTGA AGCCCGGCGT GACGAACTAT GACGTGCTGC GCGCGACGTT CCCGGCGGGC ACGCTGTCCG GCGCGCCGAA AGTCCGCGCG ATGGAGCTGA TCGACGAGCT CGAGCCGATC AAGCGCGGGC TGTACGGCGG CGCGGTCGGC TACCTGTCGT TCTCGGGCGA GATGGATCTC GCGATCGCGA TCCGCACGGG CCTCATCCAC AACGGCAATC TGTACGTGCA GGCGGCGGCG GGCATCGTCG CCGACTCGGT GCCCGAATCC GAATGGCAGG AGACCGAGAA CAAGGCGCGC GCGGTGCTGC GCGCGGCCGA ACAGGTACAA GACGGCCTCG ATTCCGATTT CTGA
|
Protein sequence | MTELEFQSLA NEGYNRIPLI AEALADLETP LSLYLKLAQP ERGGANSFLL ESVVGGERFG RYSFIGLPAH TLVRTKNGVS EVVTDGQVTE THDGDPFAFI ATFQSRFKVA QRPGLPRFCG GLAGYFGYDA VRYIEKKLAH TAPRDDLGLP DIQLLLTEEV AVIDNLAGKL YLIVYADPTK PEAYTKAKQR LRELKQRLRA SVVPPVTSAS VRTEIYREFK KDDYLAAVRT AKEYIAAGEL MQIQVGQRLT KPYRDNPLSL YRALRSLNPS PYMYYYNFGE FHVVGASPEI LVRQEKRGDD QIVTIRPLAG TRPRGNTPER DAELATELLN DPKEIAEHVM LIDLARNDVG RIAEIGSVHV TDKMVIEKYS HVQHIVSSVE GKLKPGVTNY DVLRATFPAG TLSGAPKVRA MELIDELEPI KRGLYGGAVG YLSFSGEMDL AIAIRTGLIH NGNLYVQAAA GIVADSVPES EWQETENKAR AVLRAAEQVQ DGLDSDF
|
| |