Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1646 |
Symbol | |
ID | 4887222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1576949 |
End bp | 1580425 |
Gene Length | 3477 bp |
Protein Length | 1158 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640131585 |
Product | putative non-ribosomal peptide synthase/polyketide synthase |
Protein accession | YP_001062642 |
Protein GI | 126443980 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.169853 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAAT CTGTTTTCGC CCGGTTCGAG CAAGTGTGCC GCAGCAATGC GCAAGCGGTC GCGCTGGAGA GCGCCGAGCA CCGTCTCACG TACGCGCAGC TGCATGACCG GGTCGCGTCG ATCGGCGCGC GTCTCGCCGC GGCGGGCGTC ACGCCCGGCA CGCTGCTCGG CATCTTCCTG CCGCGCGACG TGCGGCTGCC GGCCGCGCTC CTGGCGTCGC TCGGAAGCGG CGCCGTATAC GTGCCGCTCA CCGAGAAGTA TCCGCCCGAG CGCCTGCGCG AGATCATCGA GACGCACGGC ATCGAGCACG TCGTCACGAC CGAGGCGCTC GCCTCGCAGT TGCCCGCGTC GTGCGGCAAG ATCGTGCTGC CCGCCGAGGG TTCGGAGACG GCCGGCCCGC GGCCCGCCGG CGAGCGCGCC GACTGGCGGC CCGCGGGCGA GCGGGCGCGG AACGCGCCCG TCTACGTCGT GTTCACGTCC GGCTCGACGG GCACGCCGAA GGGCGTGCTG ATCGGCGAGC GCAATCTCGG CAATCTGATC GACTGGTACG CGGCGTCCTT TAGCGCCGAG CAGCGGCGCT CGGTGCTCGC GTCGACGCAG ATCACGTTCG ACCTGTCGGT GTTCGAGCTG ATCTGCACGC TCTGCACGGG CAGCAAGGTC GTGATCGTCG AGAACGTGCT GCAGTTGCTC GACGAAGGCG CGCCGTGCGA CGTGAGCCTG ATCAACACGG TGCCGTCGGC GGCGCGCGAG CTGGTGCGCC ACCGCAAGTT CCCGGCGGCC GCGCGCGTCG TGAATCTGGC GGGCGAGGCG CTGTATCAGG ATCTCGTCGA CGACATCTAC GAGGCCGCGC CGCAGCTCGA GCAGGTGTTC AATCTGTACG GCCCGTCCGA GGACACGACG TACTCGACGG GCCATGCGGT GCCGCGCGGC GGCGCGAGCC GCACGGTCAG CATCGGCCGC TCGCTGCCGG GCAAGCGTGC GCACATCCTG TCCGACGCGC TCACGCCCGT CGCGCCGGGC GAGGTCGGCG AAATCTGCCT GAGCGGCGAA GGCGTCGCGC TCGGCTATCT GAACGACGCG ACGCTGACGG CCGAGAAATT TCCGACGATC GGGCACGGCC CGCTCGCCGG CGAGCGTATC TACCGCACGG GCGACCTCGG CAGCATCGAC GGCGACGGCC TGCTGCGCTA TCTCGGGCGC GCGGACCGGC AGGTGAAGGT GCGCGGCGTG CGCATCGAGC CCGGCGAGGT CGAGGTCGCG CTGCGCAGCA TCGACGGCAT CGCGGATGCG GCGGTGGTGA AGATCGCCGA TGCGGCGAAC AACGATCAGC TCGTCGCGCT CGTGGTCGCG CAGCCGTCGT GCCCGGCCGA GCATGCGATC CTGGACCGGC TGCAGGCGCT GATCCCGGCG TTCATGGTGC CCTCGCGCGT CGAGCGCATC GACGCGATCC CGCTGAACGG CAACGGCAAG ACCGATCGCA CGAAGCTCGA GCAGATCGCC GGCGCGCTGT TCGGCGCGGC GCCGCCCGCC GACGACATCC AGGCGCGCGT CGCGCAGATC GTCGCCAAGC TGATGTCTCG AACCGACGTG GCGGCCGATG CCGATTTCTT TCGGATCGGC GGCAACTCGC TGCTGAGCGC GCAGCTCACG TTCATGCTGC AGAAGCAGTT CTCGGTGACG CTCAGCATTG CCGACGTGTT TCGCCATCGG ACGATCGACG CGCTCGCGGC GATCATTCGC GAACGAACCC ACAAGGCCGC GCAGACGGCG CCTGCGGCCG CCGGCCAACA GCCGGCGGCG GTGCGATCGA CCTCCGGCGG CGCAGGCGCC GGGCAGCCGG TGGTGTTCGC CACGCCCGCG CAACGCGGAA TCTGGCTGCT CGAAAACAGC CCCGGCGGCC GGGCGGTGTC GAACGCGCCG CTCGTGTTCG AGTATGCGGG CACGCTCGAG CGCGCGCTGC TCGAGCGGTG CGTGACGCAA CTGCTCGAGC GCCACGCGAT CCTGCGCAGC AACTATGTGT GGGAGGACGG CAGCCTGCGG ATCAAGTGCA ACGCGCCCGT GCCGTTTCGC GTCGAGACGG TCGATCTCGG TTCGCTGGCG CCCGACGCCC AGCAGGCGAA GGCGCGCGAG CTGGTCGCGC GGGAGGCCAT GCGCCCGTTC GATCTGCGCC GCGATCCGAT GCTGCGCGTG ACCGACATCG CGTTCGGCGA GCGCGCGGGC CAACTGGTGT TCGTGTTCCA TCACATCGCG GTCGACGATC GGGCGCTGAA CATCGTGTTC TCCGAATTGC AGCGGCTCTA CGCCGCGGGC GGCGACCCGG CGGCGATCGG CGCCGCGCCG GCCCGGCAGT TCGCCGATTA CGCGGCGGCG GTGCGCGAGG CGAGCGCGCG GCTCGGCGAG CATCTCGATT ACTGGCGGCA CAAGCTGGCC GATTACCGCG GCGCGACGCC GTATCTGGTC GACCCGCAGG CGAAGCCCGC CGCGCGCTTC GCCGGACGGC TGCACCTGCA TCGCGTGGCG CGGCACGTGA GCCGGCAGCT CGACGCGGCC GCCGCGCGCC GCGGGCTCAC GCCGTTCGCG CTGTTCGCGG CCGCCGTGGC GTGCATCGTG CACCGGGCGT CCGGCAGCGA CGACGTGACG ATCGGCACGT TCTTCTCGAA TCGCGATCAT TTCCAGGACA ACGATCTCGT CGGCTTCTTC GTCAACACGC TGCCGCTGCG CGTGCGGATC GACGCCGGCT GGGACGTCGA TCAGCTCGCG CAGGCGATGT CGGCGACGCT CGCGCAAGCG CATGCGCACC GCGACGTGAC GACCGAAGAC GTGTTCGACG CGTTGCAGGC CAACAACGCG CTGCGCCGCG CGGCATTCCG CGTGATGGTC AACCTCGAGC CGGAGCAGGC GGAGGTGCTG ACGATGGGCG CGCTGAGCGC GCGCCGGATG CTGCTCGACC GGCATGTCGC GAAGTACGAC CTGCTGTTCT CGCTGCGCAA GGAGGACGGC GACTATCGCG TGCTCGTCGA ATACAACACG GAGCTCTACG ACGGCGACGT CATGGCGGGC GTCTGCGCCA ATCTCGACCG CTCGCTCGCG GCGCTGACAG GCACGGCGAG CGAGCGGCTC GACGCGATCG CGCTGCCGGA TCCGCTCGAG CGCGGCGCGC CGGGCGATGC CGCCGCCGGC CCGGCCGACG CCGATGCGCC GGCGGCCGCG GCGGCCGATA TCGATCTGCG CGAGGTGGTG CGCGACATCT GGGCGAGCGA GGTCGGCCAC CGGACGTTCG GCGACGACGA CAACTTCTTC GACATCGGCG GCAGCTCGCT CAAGATCATG ACGGTCTACG AGAAGCTGTC GAACTTCCTG AAGCGCCACG GCATCGAGAA GGAAATCGAC ATCGTCGACC TGTTCGAGCA CGTGAGCGTC GCGGCGCTGA CCGATTTCCT CGGAGCCTTG ATCGAACATG ACGCAATCGC ACAGTGA
|
Protein sequence | MTQSVFARFE QVCRSNAQAV ALESAEHRLT YAQLHDRVAS IGARLAAAGV TPGTLLGIFL PRDVRLPAAL LASLGSGAVY VPLTEKYPPE RLREIIETHG IEHVVTTEAL ASQLPASCGK IVLPAEGSET AGPRPAGERA DWRPAGERAR NAPVYVVFTS GSTGTPKGVL IGERNLGNLI DWYAASFSAE QRRSVLASTQ ITFDLSVFEL ICTLCTGSKV VIVENVLQLL DEGAPCDVSL INTVPSAARE LVRHRKFPAA ARVVNLAGEA LYQDLVDDIY EAAPQLEQVF NLYGPSEDTT YSTGHAVPRG GASRTVSIGR SLPGKRAHIL SDALTPVAPG EVGEICLSGE GVALGYLNDA TLTAEKFPTI GHGPLAGERI YRTGDLGSID GDGLLRYLGR ADRQVKVRGV RIEPGEVEVA LRSIDGIADA AVVKIADAAN NDQLVALVVA QPSCPAEHAI LDRLQALIPA FMVPSRVERI DAIPLNGNGK TDRTKLEQIA GALFGAAPPA DDIQARVAQI VAKLMSRTDV AADADFFRIG GNSLLSAQLT FMLQKQFSVT LSIADVFRHR TIDALAAIIR ERTHKAAQTA PAAAGQQPAA VRSTSGGAGA GQPVVFATPA QRGIWLLENS PGGRAVSNAP LVFEYAGTLE RALLERCVTQ LLERHAILRS NYVWEDGSLR IKCNAPVPFR VETVDLGSLA PDAQQAKARE LVAREAMRPF DLRRDPMLRV TDIAFGERAG QLVFVFHHIA VDDRALNIVF SELQRLYAAG GDPAAIGAAP ARQFADYAAA VREASARLGE HLDYWRHKLA DYRGATPYLV DPQAKPAARF AGRLHLHRVA RHVSRQLDAA AARRGLTPFA LFAAAVACIV HRASGSDDVT IGTFFSNRDH FQDNDLVGFF VNTLPLRVRI DAGWDVDQLA QAMSATLAQA HAHRDVTTED VFDALQANNA LRRAAFRVMV NLEPEQAEVL TMGALSARRM LLDRHVAKYD LLFSLRKEDG DYRVLVEYNT ELYDGDVMAG VCANLDRSLA ALTGTASERL DAIALPDPLE RGAPGDAAAG PADADAPAAA AADIDLREVV RDIWASEVGH RTFGDDDNFF DIGGSSLKIM TVYEKLSNFL KRHGIEKEID IVDLFEHVSV AALTDFLGAL IEHDAIAQ
|
| |