Gene BURPS1710b_3575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3575 
SymboltrpE 
ID3690645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3898155 
End bp3899723 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content66% 
IMG OID637730030 
Productanthranilate synthase component I 
Protein accessionYP_334940 
Protein GI76811778 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.889198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGCGGG CGCGAGCGCA GCGCCACGCG AAACCCGGCG CGCCGCGCCG ATCGTCCCGA 
CGACAGGACC GGAACATGAC TGAACTCGAA TTCCAATCGC TTGCCAACGA GGGCTACAAC
CGCATTCCGC TCATCGCCGA AGCGCTGGCC GACCTCGAAA CGCCGCTTTC ACTGTATCTG
AAGCTCGCGC AGCCCGAACG CGGCGGCGCC AACTCGTTCC TGCTCGAATC GGTGGTGGGC
GGCGAGCGCT TCGGACGCTA TTCGTTCATC GGCCTGCCCG CGCATACGCT GGTGCGCACG
AAGAACGGCG TGTCGGAGGT CGTGACGGAC GGCCAGGTCA CCGAGACCCA CGACGGCGAC
CCGTTCGCGT TCATCGCGAC ATTCCAGAGC CGCTTCAAGG TCGCGCAGCG CCCCGGCCTG
CCGCGCTTCT GCGGCGGCCT CGCCGGCTAT TTCGGCTACG ACGCGGTGCG CTACATCGAG
AAGAAGCTCG CGCACACCGC GCCGCGCGAC GATCTCGGCC TGCCCGACAT CCAGTTGCTG
CTGACCGAGG AAGTCGCCGT GATCGACAAC CTCGCCGGCA AGCTCTACCT GATCGTCTAT
GCCGATCCGA CGAAGCCCGA GGCGTACACG AAAGCCAAGC AACGGCTGCG CGAGCTCAAG
CAGCGGCTGC GCGCGAGCGT CGTGCCGCCC GTCACGTCGG CGAGCGTGCG CACCGAGATC
TATCGCGAAT TCAAGAAGGA TGACTATCTG GCCGCCGTGC GCACGGCGAA GGAATACATC
GCGGCGGGCG AGCTGATGCA GATCCAGGTC GGCCAGCGCC TGACGAAGCC GTATCGCGAC
AATCCGCTGT CGCTGTACCG CGCGCTGCGC TCGCTGAACC CGTCGCCATA CATGTATTAC
TACAATTTCG GCGAATTCCA TGTCGTCGGC GCTTCGCCGG AGATTCTCGT GCGTCAGGAG
AAGCGCGGCG ACGACCAGAT CGTGACGATC CGCCCGCTTG CCGGCACGCG GCCGCGGGGC
AACACGCCCG AGCGCGACGC CGAGCTCGCG ACCGAACTGC TCAACGATCC GAAGGAAATC
GCCGAGCACG TGATGCTGAT CGACCTCGCG CGCAACGACG TCGGCCGCAT CGCGGAAATC
GGCTCGGTCC ACGTGACCGA CAAGATGGTG ATCGAGAAAT ACTCGCACGT GCAGCACATC
GTGAGTTCGG TCGAGGGCAA GCTGAAGCCC GGCGTGACGA ACTACGACGT GCTGCGCGCG
ACGTTCCCGG CGGGCACGCT GTCCGGCGCG CCGAAAGTCC GCGCGATGGA GCTGATCGAC
GAGCTCGAGC CGATCAAGCG CGGGCTGTAC GGCGGCGCGG TCGGCTACCT GTCGTTCTCG
GGCGAGATGG ATCTCGCGAT CGCGATCCGC ACGGGCCTCA TCCACAACGG CAATCTGTAC
GTGCAGGCGG CGGCGGGCAT CGTCGCCGAC TCGGTGCCCG AATCCGAATG GCAGGAGACC
GAGAACAAGG CGCGCGCGGT GCTGCGCGCG GCCGAACAGG TGCAAGACGG CCTCGATTCC
GATTTCTGA
 
Protein sequence
MLRARAQRHA KPGAPRRSSR RQDRNMTELE FQSLANEGYN RIPLIAEALA DLETPLSLYL 
KLAQPERGGA NSFLLESVVG GERFGRYSFI GLPAHTLVRT KNGVSEVVTD GQVTETHDGD
PFAFIATFQS RFKVAQRPGL PRFCGGLAGY FGYDAVRYIE KKLAHTAPRD DLGLPDIQLL
LTEEVAVIDN LAGKLYLIVY ADPTKPEAYT KAKQRLRELK QRLRASVVPP VTSASVRTEI
YREFKKDDYL AAVRTAKEYI AAGELMQIQV GQRLTKPYRD NPLSLYRALR SLNPSPYMYY
YNFGEFHVVG ASPEILVRQE KRGDDQIVTI RPLAGTRPRG NTPERDAELA TELLNDPKEI
AEHVMLIDLA RNDVGRIAEI GSVHVTDKMV IEKYSHVQHI VSSVEGKLKP GVTNYDVLRA
TFPAGTLSGA PKVRAMELID ELEPIKRGLY GGAVGYLSFS GEMDLAIAIR TGLIHNGNLY
VQAAAGIVAD SVPESEWQET ENKARAVLRA AEQVQDGLDS DF