Gene BMAA0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA0533 
SymboltrpE 
ID3086560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp542327 
End bp543820 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content65% 
IMG OID637564451 
Productanthranilate synthase component I 
Protein accessionYP_105301 
Protein GI53717534 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.137643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC TCGAATTCCA ATCGCTTGCC AACGAGGGCT ACAACCGCAT TCCGCTCATC 
GCCGAAGCGC TGGCCGACCT CGAAACGCCG CTTTCACTGT ATCTGAAGCT CGCGCAGCCC
GAACGCGGCG GCGCCAACTC GTTCCTGCTC GAATCGGTGG TGGGCGGCGA GCGCTTCGGA
CGCTATTCGT TCATCGGCCT GCCCGCGCAT ACGCTGGTGC GCACGAAGAA CGGCGTGTCG
GAGGTCGTGA CGGACGGCCA GGTCACCGAG ACCCACGACG GCGACCCGTT CGCGTTCATC
GCGACATTCC AGAGCCGCTT CAAGGTCGCG CAGCGCCCCG GCCTGCCGCG CTTCTGCGGC
GGCCTCGCCG GCTATTTCGG CTACGACGCG GTGCGCTACA TCGAGAAGAA GCTCGCGCAC
ACCGCGCCGC GCGACGATCT CGGCCTGCCC GACATCCAGT TGCTGCTGAC CGAGGAAGTC
GCCGTGATCG ACAACCTCGC CGGCAAGCTC TACCTGATCG TCTATGCCGA TCCGACGAAG
CCCGAGGCGT ACACGAAAGC CAAGCAACGG CTGCGCGAGC TCAAGCAGCG GCTGCGCGCG
AGCGTCGTGC CGCCCGTCAC GTCGGCGAGC GTGCGCACCG AGATATATCG CGAATTCAAG
AAGGATGACT ATCTGGCCGC CGTGCGCACG GCGAAGGAAT ACATCGCGGC GGGCGAGCTG
ATGCAGATCC AGGTCGGCCA GCGCCTGACG AAGCCGTATC GCGACAATCC GCTGTCGCTG
TACCGCGCGC TGCGCTCGCT GAACCCGTCG CCATACATGT ATTACTACAA TTTCGGCGAA
TTCCATGTCG TCGGCGCTTC GCCGGAGATT CTCGTGCGTC AGGAGAAGCG CGGCGACGAC
CAGATCGTGA CGATCCGCCC GCTTGCCGGC ACGCGGCCGC GCGGCAACAC GCCCGAGCGC
GACGCCGAGC TCGCGACCGA ACTGCTCAAC GACCCGAAGG AAATCGCCGA GCACGTGATG
CTGATCGACC TCGCGCGCAA CGACGTCGGC CGCATCGCGG AAATCGGCTC GGTCCACGTG
ACCGACAAGA TGGTGATCGA GAAATACTCG CACGTGCAGC ACATCGTGAG TTCGGTCGAG
GGCAAGCTGA AGCCCGGCGT GACGAACTAT GACGTGCTGC GCGCGACGTT CCCGGCGGGC
ACGCTGTCCG GCGCGCCGAA AGTCCGCGCG ATGGAGCTGA TCGACGAGCT CGAGCCGATC
AAGCGCGGGC TGTACGGCGG CGCGGTCGGC TACCTGTCGT TCTCGGGCGA GATGGATCTC
GCGATCGCGA TCCGCACGGG CCTCATCCAC AACGGCAATC TGTACGTGCA GGCGGCGGCG
GGCATCGTCG CCGACTCGGT GCCCGAATCC GAATGGCAGG AGACCGAGAA CAAGGCGCGC
GCGGTGCTGC GCGCGGCCGA ACAGGTACAA GACGGCCTCG ATTCCGATTT CTGA
 
Protein sequence
MTELEFQSLA NEGYNRIPLI AEALADLETP LSLYLKLAQP ERGGANSFLL ESVVGGERFG 
RYSFIGLPAH TLVRTKNGVS EVVTDGQVTE THDGDPFAFI ATFQSRFKVA QRPGLPRFCG
GLAGYFGYDA VRYIEKKLAH TAPRDDLGLP DIQLLLTEEV AVIDNLAGKL YLIVYADPTK
PEAYTKAKQR LRELKQRLRA SVVPPVTSAS VRTEIYREFK KDDYLAAVRT AKEYIAAGEL
MQIQVGQRLT KPYRDNPLSL YRALRSLNPS PYMYYYNFGE FHVVGASPEI LVRQEKRGDD
QIVTIRPLAG TRPRGNTPER DAELATELLN DPKEIAEHVM LIDLARNDVG RIAEIGSVHV
TDKMVIEKYS HVQHIVSSVE GKLKPGVTNY DVLRATFPAG TLSGAPKVRA MELIDELEPI
KRGLYGGAVG YLSFSGEMDL AIAIRTGLIH NGNLYVQAAA GIVADSVPES EWQETENKAR
AVLRAAEQVQ DGLDSDF