Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3308 |
Symbol | pabB |
ID | 4902449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3225453 |
End bp | 3227462 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640136534 |
Product | para-aminobenzoate synthase, component I |
Protein accession | YP_001067545 |
Protein GI | 126454445 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGACG AAGCGGGCGC GGTTTTCGCG CTCTTCGACG ATTGCGACTC GACCGCGGCG GCGCGCTCGA GTCGTTTGTA TTCGGGCTTC TCGCACGAGC GGGCCTGCGC CGATCCGGCC GGCCTCGACG CGGTGTGCGC GGCCGTCGCC GACGACGCGC GGCGCGGGCT GCACGCCGTC GTGCTCGGCG ACTACGAGTT CGGGCGCGAC CTGCAGCTCG GCAAGCGCGG CGGCGGCGCG TTGCGGTTCC TGCTGTTCGC CGAATGCAGG ATGCTCTCGC GCGAGGAGGC CGACGCGTGG CTCGCGTCGC GCGACGGCGG CTCGGCCGAG CCGTCGGCGG CGGGCGTCGC GCACGTGACG AAGAGCGTCG CGCGCGACGC GTTCGACGCG GCGATCGCCG CCGTGCACGA TGCGCTGCGC GCGGGCGATT CGTATCAGAT CAACTACACG TACCGGCTGC ATTTCGATGC GTTCGGCGCG CCGCTCGCGC TGTATCGCCG CCTGCGGGCG CGCCAGCCGG TGCGCTACGG CGCGCTCGTC GCGCTGCCGG GCGGCGCGTG GATCGTGTCG TGCTCGCCCG AGCTGTTCGT CGAGAAGACG GGCGCGTTGC TGCGCGCGCG GCCGATGAAA GGCACCGCGC CTCGCTCGGA CGATCCGCGC GAGGATGCGG CCGCCGCGCG CTTTCTCGCG AACGATGCGA AGAATCGCGC CGAGAACGTG ATGATCGTCG ATCTGCTGCG CAACGATCTC GCGCGCATCG CGCGCACCGG ATCGGTGACG GTGCCGGCGT TGTTCTCGGT CGAGCCGTAT GCGTCGGTGT GGCAGATGAC GTCGACGGTC GAGGCGGGCA TCGTCGAAGG CGCGACGTTC GCCGACGTGC TGCGCGCGCT CTTTCCGTGC GGCTCGATCA CGGGCGCGCC GAAGCACAGG ACGATGCAAC TGATCGATGC GCTCGAGACG ACGCCGCGCG GGCTCTATAC GGGCGCGATC GGCTGGCTCG ATGCGGAGGA GGCGCGTGCG GACGGCGCCG AGCCTGCCGC CGGCGCGTTC GCGCCGGCGC GTGCGGCGAG CCCGGCCGCA GGCACGGCGT CGGATGGTGC GTGCGCGGGC GAAATCGCGT CGAAGCAGGC GTCGAAGCGT GCCGGCGCGA CAAGCCCGGC CGGCGCGTGC GGCGATTTCT GCCTGTCGGT CGCGATCCGC ACGCTGACGC TCGACGCGCC GTCGGCCGGC GGCGAGCGGC GCGGCACGAT GGGCGTCGGC GCGGGCATCG TGCTCGACAG CGTTGCCGCC GACGAATACG CGGAGTGCGA ATTGAAGGCG CGTTTCCTGA CCGACGCCGA GCCCGGCTTC CAGTTGTTCG AGACGATGCT CGCCACGCGT GACGCCGGCG TGCGCCATCT CGACCGCCAT GTCGCGCGGC TGCGCGCGAG TGCCGACGCG CTCGGCTTCG CGTTCGACGA AGCGGCGGCC AGGCAGCGCA TCGGCGCGCG CTGCGTGCAG CTCGGCGACG GCGAGCATCG CTTGCGCGTC GCGCTCGCGA AGGACGGCGT GCTCGACATC GTGGCCGCGC CGCTCGCGCC GCTCGCGGGC GCCGCGGTGG CCGTGTTGCT CGCGCCCGAG CACGGGTTCG CGCCGATGCG CTCGGGCGAC TTCCTGCTCG CGCACAAGAC GACGCGCCGC GCCGATTACG ATCGCGCGTG GAAAGCCGCG CAGGCGTGCG GTGCGTTCGA CATGCTGTTC TTCAACGAAC GCGGCGAATT GACCGAAGGC GGGCGCACGA GCGTATTCGT GAAGCTGGCC GGGCGCTGGT TCACGCCGCC GCTGTCGTCG GGCGTGTTGC CGGGCGTGAT GCGCGGCGCG CTGCTCGACG ATCCGGCATG GCAGGCGTCC GAGCGTGTAA TGACGCTGGA CGACGTGCTG CGCGCCGACG CGCTGATGCT GACGAACGCA TTGCGCGGCG CGATGCCCGC GCAACTCGTG CGGGCCGCCG GCGCGGCGGC ACCGCGATAG
|
Protein sequence | MTDEAGAVFA LFDDCDSTAA ARSSRLYSGF SHERACADPA GLDAVCAAVA DDARRGLHAV VLGDYEFGRD LQLGKRGGGA LRFLLFAECR MLSREEADAW LASRDGGSAE PSAAGVAHVT KSVARDAFDA AIAAVHDALR AGDSYQINYT YRLHFDAFGA PLALYRRLRA RQPVRYGALV ALPGGAWIVS CSPELFVEKT GALLRARPMK GTAPRSDDPR EDAAAARFLA NDAKNRAENV MIVDLLRNDL ARIARTGSVT VPALFSVEPY ASVWQMTSTV EAGIVEGATF ADVLRALFPC GSITGAPKHR TMQLIDALET TPRGLYTGAI GWLDAEEARA DGAEPAAGAF APARAASPAA GTASDGACAG EIASKQASKR AGATSPAGAC GDFCLSVAIR TLTLDAPSAG GERRGTMGVG AGIVLDSVAA DEYAECELKA RFLTDAEPGF QLFETMLATR DAGVRHLDRH VARLRASADA LGFAFDEAAA RQRIGARCVQ LGDGEHRLRV ALAKDGVLDI VAAPLAPLAG AAVAVLLAPE HGFAPMRSGD FLLAHKTTRR ADYDRAWKAA QACGAFDMLF FNERGELTEG GRTSVFVKLA GRWFTPPLSS GVLPGVMRGA LLDDPAWQAS ERVMTLDDVL RADALMLTNA LRGAMPAQLV RAAGAAAPR
|
| |