Gene BMA10247_A1910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A1910 
SymboltrpE 
ID4891286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp1841859 
End bp1843427 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content66% 
IMG OID640148175 
Productanthranilate synthase component I 
Protein accessionYP_001079087 
Protein GI126445607 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGCGGG CGCGAGCGCA GCGCCACGCG AAACCCGGCG CGCCGCGCCG ATCGTCCCGA 
CGACAGGACC GGAACATGAC TGAACTCGAA TTCCAATCGC TTGCCAACGA GGGCTACAAC
CGCATTCCGC TCATCGCCGA AGCGCTGGCC GACCTCGAAA CGCCGCTTTC ACTGTATCTG
AAGCTCGCGC AGCCCGAACG CGGCGGCGCC AACTCGTTCC TGCTCGAATC GGTGGTGGGC
GGCGAGCGCT TCGGACGCTA TTCGTTCATC GGCCTGCCCG CGCATACGCT GGTGCGCACG
AAGAACGGCG TGTCGGAGGT CGTGACGGAC GGCCAGGTCA CCGAGACCCA CGACGGCGAC
CCGTTCGCGT TCATCGCGAC ATTCCAGAGC CGCTTCAAGG TCGCGCAGCG CCCCGGCCTG
CCGCGCTTCT GCGGCGGCCT CGCCGGCTAT TTCGGCTACG ACGCGGTGCG CTACATCGAG
AAGAAGCTCG CGCACACCGC GCCGCGCGAC GATCTCGGCC TGCCCGACAT CCAGTTGCTG
CTGACCGAGG AAGTCGCCGT GATCGACAAC CTCGCCGGCA AGCTCTACCT GATCGTCTAT
GCCGATCCGA CGAAGCCCGA GGCGTACACG AAAGCCAAGC AACGGCTGCG CGAGCTCAAG
CAGCGGCTGC GCGCGAGCGT CGTGCCGCCC GTCACGTCGG CGAGCGTGCG CACCGAGATA
TATCGCGAAT TCAAGAAGGA TGACTATCTG GCCGCCGTGC GCACGGCGAA GGAATACATC
GCGGCGGGCG AGCTGATGCA GATCCAGGTC GGCCAGCGCC TGACGAAGCC GTATCGCGAC
AATCCGCTGT CGCTGTACCG CGCGCTGCGC TCGCTGAACC CGTCGCCATA CATGTATTAC
TACAATTTCG GCGAATTCCA TGTCGTCGGC GCTTCGCCGG AGATTCTCGT GCGTCAGGAG
AAGCGCGGCG ACGACCAGAT CGTGACGATC CGCCCGCTTG CCGGCACGCG GCCGCGCGGC
AACACGCCCG AGCGCGACGC CGAGCTCGCG ACCGAACTGC TCAACGACCC GAAGGAAATC
GCCGAGCACG TGATGCTGAT CGACCTCGCG CGCAACGACG TCGGCCGCAT CGCGGAAATC
GGCTCGGTCC ACGTGACCGA CAAGATGGTG ATCGAGAAAT ACTCGCACGT GCAGCACATC
GTGAGTTCGG TCGAGGGCAA GCTGAAGCCC GGCGTGACGA ACTATGACGT GCTGCGCGCG
ACGTTCCCGG CGGGCACGCT GTCCGGCGCG CCGAAAGTCC GCGCGATGGA GCTGATCGAC
GAGCTCGAGC CGATCAAGCG CGGGCTGTAC GGCGGCGCGG TCGGCTACCT GTCGTTCTCG
GGCGAGATGG ATCTCGCGAT CGCGATCCGC ACGGGCCTCA TCCACAACGG CAATCTGTAC
GTGCAGGCGG CGGCGGGCAT CGTCGCCGAC TCGGTGCCCG AATCCGAATG GCAGGAGACC
GAGAACAAGG CGCGCGCGGT GCTGCGCGCG GCCGAACAGG TACAAGACGG CCTCGATTCC
GATTTCTGA
 
Protein sequence
MLRARAQRHA KPGAPRRSSR RQDRNMTELE FQSLANEGYN RIPLIAEALA DLETPLSLYL 
KLAQPERGGA NSFLLESVVG GERFGRYSFI GLPAHTLVRT KNGVSEVVTD GQVTETHDGD
PFAFIATFQS RFKVAQRPGL PRFCGGLAGY FGYDAVRYIE KKLAHTAPRD DLGLPDIQLL
LTEEVAVIDN LAGKLYLIVY ADPTKPEAYT KAKQRLRELK QRLRASVVPP VTSASVRTEI
YREFKKDDYL AAVRTAKEYI AAGELMQIQV GQRLTKPYRD NPLSLYRALR SLNPSPYMYY
YNFGEFHVVG ASPEILVRQE KRGDDQIVTI RPLAGTRPRG NTPERDAELA TELLNDPKEI
AEHVMLIDLA RNDVGRIAEI GSVHVTDKMV IEKYSHVQHI VSSVEGKLKP GVTNYDVLRA
TFPAGTLSGA PKVRAMELID ELEPIKRGLY GGAVGYLSFS GEMDLAIAIR TGLIHNGNLY
VQAAAGIVAD SVPESEWQET ENKARAVLRA AEQVQDGLDS DF