Gene BURPS668_3556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3556 
SymbolpabB 
ID4882076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3489012 
End bp3490580 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content66% 
IMG OID640129484 
Productanthranilate synthase component I 
Protein accessionYP_001060561 
Protein GI126438699 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.317199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGCGGG CGCGAGCGCA GCGCCACGCG AAACCCGGCG CGCTGCGCCG ATCGTCCCGA 
CGACAGGACC GGAACATGAC TGAACTCGAA TTCCAATCGC TTGCCAACGA GGGCTACAAC
CGCATTCCGC TCATCGCCGA AGCGCTGGCC GACCTCGAAA CGCCGCTTTC ACTGTATCTG
AAGCTCGCGC AGCCCGAACG CGGCGGCGCC AACTCGTTCC TGCTCGAATC GGTGGTGGGC
GGCGAGCGCT TCGGACGCTA TTCGTTCATC GGCCTGCCCG CGCATACGCT GGTGCGCACG
AAGAACGGCG TGTCGGAGGT CGTGACGGAC GGCCAGGTCA CCGAGACCCA CGACGGCGAC
CCGTTCGCGT TCATCGCGAC ATTCCAGAGC CGCTTCAAGG TCGCGCAGCG CCCCGGCCTG
CCGCGCTTCT GCGGCGGCCT CGCCGGCTAT TTCGGCTACG ACGCGGTGCG CTACATCGAG
AAGAAGCTCG CGCACACCGC GCCGCGCGAC GATCTCGGCC TGCCCGACAT CCAGTTGCTG
CTGACCGAGG AAGTCGCCGT GATCGACAAC CTCGCCGGCA AGCTCTACCT GATCGTCTAT
GCCGATCCGA CGAAGCCCGA GGCGTACACG AAAGCCAAGC AACGGCTGCG CGAGCTCAAG
CAGCGGCTGC GCGCGAGCGT CGTGCCGCCC GTCACGTCGG CGAGCGTGCG CACCGAGATC
TATCGCGAAT TCAAGAAGGA TGACTATCTG GCCGCCGTGC GCACGGCGAA GGAATACATC
GCGGCGGGCG AGCTGATGCA GATCCAGGTC GGCCAGCGCC TGACGAAGCC GTATCGCGAC
AATCCGCTGT CGCTGTACCG CGCGCTGCGC TCGCTGAACC CGTCGCCATA CATGTATTAC
TACAATTTCG GCGAATTCCA TGTCGTCGGC GCTTCGCCGG AGATTCTCGT GCGTCAGGAG
AAGCGCGGCG ACGACCAGAT CGTGACGATC CGCCCGCTTG CCGGCACGCG GCCGCGCGGC
AACACGCCCG AGCGCGACGC CGAGCTCGCG ACCGAACTGC TCAACGACCC GAAGGAAATC
GCCGAGCACG TGATGCTGAT CGACCTCGCG CGCAACGACG TCGGCCGCAT CGCGGAAATC
GGCTCGGTCC ACGTGACCGA CAAGATGGTG ATCGAGAAAT ACTCGCACGT GCAGCACATC
GTGAGTTCGG TCGAGGGCAA GCTGAAGCCC GGCGTGACGA ACTACGACGT GCTGCGCGCG
ACGTTCCCGG CGGGCACGCT GTCCGGCGCG CCGAAAGTCC GCGCGATGGA GCTGATCGAC
GAGCTCGAGC CGATCAAGCG CGGGCTGTAC GGCGGCGCGG TCGGCTACCT GTCGTTCTCG
GGCGAGATGG ATCTCGCGAT CGCGATCCGC ACGGGCCTCA TCCACAACGG CAATCTGTAC
GTGCAGGCGG CGGCGGGTAT CGTCGCCGAC TCGGTGCCCG AATCCGAATG GCAGGAGACC
GAGAACAAGG CGCGCGCGGT GCTGCGCGCG GCCGAACAGG TGCAAGACGG CCTCGATTCC
GATTTCTGA
 
Protein sequence
MLRARAQRHA KPGALRRSSR RQDRNMTELE FQSLANEGYN RIPLIAEALA DLETPLSLYL 
KLAQPERGGA NSFLLESVVG GERFGRYSFI GLPAHTLVRT KNGVSEVVTD GQVTETHDGD
PFAFIATFQS RFKVAQRPGL PRFCGGLAGY FGYDAVRYIE KKLAHTAPRD DLGLPDIQLL
LTEEVAVIDN LAGKLYLIVY ADPTKPEAYT KAKQRLRELK QRLRASVVPP VTSASVRTEI
YREFKKDDYL AAVRTAKEYI AAGELMQIQV GQRLTKPYRD NPLSLYRALR SLNPSPYMYY
YNFGEFHVVG ASPEILVRQE KRGDDQIVTI RPLAGTRPRG NTPERDAELA TELLNDPKEI
AEHVMLIDLA RNDVGRIAEI GSVHVTDKMV IEKYSHVQHI VSSVEGKLKP GVTNYDVLRA
TFPAGTLSGA PKVRAMELID ELEPIKRGLY GGAVGYLSFS GEMDLAIAIR TGLIHNGNLY
VQAAAGIVAD SVPESEWQET ENKARAVLRA AEQVQDGLDS DF