Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0877 |
Symbol | |
ID | 4886521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 851552 |
End bp | 856057 |
Gene Length | 4506 bp |
Protein Length | 1501 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640130817 |
Product | non-ribosomal peptide synthase |
Protein accession | YP_001061876 |
Protein GI | 126444056 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.198561 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAGT CCGCTGTCGA TTCCATCTCC GAACTTTCCG CTTCTTCTTC CGACGTTTTT TTCCGTTCCC GGCTCGTCGC GCTGCTCGCC GAGTTGCTCG GCGAGCCCGC CGACGACATC GCCGCGCTGG GCGACGATGA GGATCTGCTG AGCTACGGCG TCGATTCCAT CCGCCTGATG TACGTGCAGA CGCGCTTGAG CCGCATGGGC CATGCGCTCG CATTCGACGC GCTCGCGCGC ACGCCGACGC TCGGCGCGTG GACCGCGCTG CTCGCGCAGA CGATGCGTGC CGAGCCGGCC GCGCAGGGCA CGGATCGCGC GGCGCGCGCC GATGTCGTCA CCAACGCCGA CGCCGACGCC GACGCCGACG CCGACGCCAA CGCCAACGCC AACGCCAACG CCAACGCCAA CGCCAACGCC AACGCCAACG CCAACGCCGA CATCGACATC GACGTGCACG CGGAATTCGA ACTATCCGCG GTTCAGCAGG CCTATTGGCT CGGCCGCGGC GCGGGTGAGG TGCTCGGCAA CGTGAGTTGC CATGCGTTCC TCGAATTCCG CAGCCGCGGG ATCGATCCGC AGCGTCTCGC CGCGGCGTGC CGGCTCGTGC GCGCTCGTCA CCCGATGCTG CGCGCGCGCT TTGTCGGCGG GCGGCAACAG ATCGTCGCCG CGCCCGATGC GCCCGTATTC GATTACCGCG ACTGGCGCGG CAGGGCGCCG GCGCAAGCCG AAGCCGAATG GGCCGCGCTG CGCGCGTTTC GCTCGCACGA ATGCCTCGAT ATCGCGCACG CGCAGGTTTT CATCGCCGGA CTCGCGCAGA TGCCGGACGG CGAGGATCGC GTGTGGCTGA GCATCGATCT GCTCGCCGCC GATGTCGACA GCGTGCGGCT CCTGATGAAC GAGATCGGCG CCGCGTACGC ATCGCCCGCG TCGCTGCCCG ATGCGCCGGC GACGTGGTTT CCCGTTTATC TCGCGCAGCG CGCGGCCGCC ACGCGCGCGG CCCGTGAGGC CGCGCGCGGG CACTGGCAAG CGCGGCTCGC CGACTTGCCG GACGGCCCGG CGCTGCCGCT CGCGTGCGCG CCCGAATCAA TCCGCGCGCC GCGCTTCAGC CGGCGCGCGC ACACGCTGAG CGCCGCCGAG CTCGCGCGCC TGCGGCAGCG CGCGGCGCAG CATCGCGTGA CGCTGCCGTC GGTGTTCGGC TATGCGTTCG CCGCGGTGCT CGCGCGCTGG AGCGGCCAGC ACGCGTTCGT GCTGAACGTG CCGCTGTTCG ACCGGCACGG CGAGGCGCCC GATCTGGCCG CGATGATCGC GGATTTCACG ACGCTGCTGC TCGTCGAGTG CGAGGTGCGG CCGCACGCGT GCGTCGCCGA TGCGGTGCGC GCGTTCCAGG CGTGCCTGCA CGGCGCGATC GCGCATGCGG CGTATCCGGC GCTCGAGGTG CTGCGCGATG CGCGCCGTCA GGGCGCGCCG CGCGCGGCGC CCGTCGTGTT CTCGAGCAAT CTCGGCGACG AGCCGTTCGT GCCGGCCGCG TTTCGCGAGG CGTTCGGCGA TCTGCACGAC ATGATTTCGC AGACGCCGCA AGTGTGGCTC GACCATCAGC TCTATCGCGT GACGGATGGC GTGCTGCTCG CGTGGGACAG TGTCGACGGT CTGTTTCCGG ACGGCATGCT CGATGCGATG TTCGACGCGT ACATCGCGTT CGTGCAGGCG CTGTGCGATC GCGACTGGCG GCAGCCGGCC GCGGTGGCGC TGCCGCCGGC GCAGCGCCGC GTGCGCGATG CGCTGAACGC CGTGCCCGCC CCCGGCCGGC CGCGCACGCT GCACGGCGAT TTCTTCGCGC TCGCCGCGCG CGAGCCGGCC GCCGTCGCGT TGTGGTGCGG CGAGCGCGCG ATCACGCGCG GCGAGCTCGC CGCGCAGGCG CTCGCGATCG CGGCGGGCCT GCGCGCGGCG GGCGTCGGCC ACGGCGAGGC GGTCGAGATC AGTTTGCCGC GCGGACCGGC GCAGATCGCC GCGGCGTTCG GCGTGCTCGC GGCGGGCGCG TGCTATGTGC CCGTCGACGT CGCGCAGCCG CCCGCGCGGC GCGCGTTGAT CGAGCAGGCG GCGGGCATCC GCGCGGTGAT CGGCGTGACG CCGGAGCCGG CCGCCACGCC GCCGCGCCTG GACGCGGCCG CGCTCGCGCG CAGCGCGCCG CTCGCCGCGC CGCGGCCGGT CGCGCCGCGC AGCACCGCTT ACGTGATCTA CACGTCGGGC TCGACGGGCG TGCCGAAGGG CGTCGAGATG ACGCACGAGG CGGCGATGAA CACGATCGAC GCGATCAACC CGCTGCTCGG CGTGAGCGCC GACGACCGGT TGCTCGCGGT ATCGGCGCTC GACTTCGATC TGTCGGTGTA CGACTTGTTC GGGGTGCTCG GCGCGGGCGG CGCGCTCGTA TTGCCGACGC AGGACGAGGC GCGCGACGCG GCGCGCTGGA TCGAATTGAT CGAGCGGCAT CGCGTGACGC TGTGGAACTC GGCCCCGGCG CTGCTCGAGA TGGCGCTCGC CGCGCCGGGC GCCGCCGGCG CGTGCCGCAG CGTGCGCGCG GTGCTCGCGT CCGGCGACTG GATCGCGCTC GATCTGCCGG CGCGATTGCG CGCGCGTTGC GGCGGCGCAT GCGCGTTCCA TGCGCTCGGC GGCGCGACGG AGGCCGGCAT CTGGTCGAAC CTGCAGACAG TCGACGCGGT GCCGCCGCAC TGGCGCTCGA TTCCATACGG CCGGCCGTTG CCGGGGCAGG CGTATCGCGT CGTCGACGAC AGCGGCCGCG ATGCGCCCGA CCATGTCGCG GGCGAGCTGC TGATCGGCGG CGCGAGCCTC GCGCGCGGCT ACCGGAACGA TCCGGTGCTG AGCGCGGCGC GCTTCGTCGA ATCCGATACG GGCCGCTGGT ATCGCACGGG CGATCGCGGC CGCTACTGGC CGGACGGCAC GCTGGAGTTT CTCGGCCGCG CGGACCGGCA GGTGAAGGTG CGCGGCCACC GGATCGAGCT CGGCGAGATC GAGGCCGCGT TGAGCGCGCA TCCGCAAGTG GAGGGCGCGT GCGCGAGCGT CGTGTCGGGC GATGCCGCGC ACGTCGTCGC GGCGTTCGTG CCGGTTGACG TCGCGCTCGA TCCGGCGTCG GCCGGCGCGC TCGCGTATCG GCCGGCGGCG GACACCGTGC AGGCGCAAGC CGCCGTGACG CGCGCCGTCC TGAGCCGCGT GCTCGACGGC GGCGCGCGCG TGCCGGCGCC CGTGCGCGCG CGTTGGGACG CATGGCTCGC GCGGGCGTCG CAGCCGCACG CGATTGCGCT CGAAGCCGCG CTCGAGGCGC TCGACTGGCC CGCCGCGCGG CTCGACGCGT GCGCGGCCGC GCTGCGCGCG CTCGTCGACG ATCCGCACGG CTGCGCGCCG CGCGTGCTGC TCGATGCGCA GCTCGCGCCG CAGGCGCTCG CGTCGGGCCT GCCCGACGGC GTGCGCGCGA TCGGGCAGAT CGGCGCGGCG TTGCGAACGC TCGCCGATGC GCATGCTCGC GTGGTGCGCG TCGCGGTGCT CGATGCGCGC GCCGGCCAAC TGTTCGCGCA CGGGCTTCGG CTGCTCGACG ATCCGCGCTT CGCGCTCACG CTGTTCGACG CGTCGCCGGG CCTGCTGCGC GACGCGCAAT CCCGCTTCGC GCGAACGTCG CCGGCGATGC ACGCGATGCC GGACGGTTTG CTGCCCGCTC GGTACCTGGG CCAGTTCGAT TGCGTCGTGA GCTTTGCCGC CGCGCATCTT CGCGACGATC CGCGCGATAC GTTCCGGCTC GCGGCCGCGT TGCTCGCGCG GGACGGGCAC GCGTTCGTCG CGGACGTGCT GCGCGATTCA CCGCTGCGCG AGCTGACGGC CGCGCTGCTC GGCGACGCAT CGCCGCCCCG GCTCGTTTCC GGCGAGGCGC TCGCGGCGGC CGCGCGCGCG TGCGGCTTCG CGCCCGATGC GCAGAGCTGG CGCTCGGACG CGTTCGCGCT GATCGCGGCG CGCGCGCGCG CCGAGCCGCT CACGCACGCG CGTCTCGCCG GCTGGCTGCG CGAGCGCCTG CCGGACGCGA TGCGGCCCGA GCGGCTCTGG TGCGCGCCGC GCTGGCCGCT CAACGGCAAC GGCAAGATCG ACCGCCGGGC GATCGGCGAT GCGCTGGCGC GCACGCTCGG CGACGCGCCG GCGGCGCACG CCGCGTTCGC GCCGGCCGAC GAACGGCAGG CGACGCTGCT CGCGTGCTGG GAGCAGGCGC TGGGTCGCCC TGCCGATGCG CGCGACGCCA CGTTCTTCGC GCTCGGCGGC GACAGCCTGC TCGCGACGCG GCTGCTCGCG CAATTGCGCG AGCGGCTCGG CGTGCGGATC GGCATGGCCG AGTTCTACCG CGAGCCGACG CTCGCGGGCC TCGCGGCGAA ACTGGCCGGC GCGGCGGCGG CCGTGCGCGG GCACCGCGCG GCACACGCCG CGGCGATGGA GGAGGGCGTG CTATGA
|
Protein sequence | MSQSAVDSIS ELSASSSDVF FRSRLVALLA ELLGEPADDI AALGDDEDLL SYGVDSIRLM YVQTRLSRMG HALAFDALAR TPTLGAWTAL LAQTMRAEPA AQGTDRAARA DVVTNADADA DADADANANA NANANANANA NANANADIDI DVHAEFELSA VQQAYWLGRG AGEVLGNVSC HAFLEFRSRG IDPQRLAAAC RLVRARHPML RARFVGGRQQ IVAAPDAPVF DYRDWRGRAP AQAEAEWAAL RAFRSHECLD IAHAQVFIAG LAQMPDGEDR VWLSIDLLAA DVDSVRLLMN EIGAAYASPA SLPDAPATWF PVYLAQRAAA TRAAREAARG HWQARLADLP DGPALPLACA PESIRAPRFS RRAHTLSAAE LARLRQRAAQ HRVTLPSVFG YAFAAVLARW SGQHAFVLNV PLFDRHGEAP DLAAMIADFT TLLLVECEVR PHACVADAVR AFQACLHGAI AHAAYPALEV LRDARRQGAP RAAPVVFSSN LGDEPFVPAA FREAFGDLHD MISQTPQVWL DHQLYRVTDG VLLAWDSVDG LFPDGMLDAM FDAYIAFVQA LCDRDWRQPA AVALPPAQRR VRDALNAVPA PGRPRTLHGD FFALAAREPA AVALWCGERA ITRGELAAQA LAIAAGLRAA GVGHGEAVEI SLPRGPAQIA AAFGVLAAGA CYVPVDVAQP PARRALIEQA AGIRAVIGVT PEPAATPPRL DAAALARSAP LAAPRPVAPR STAYVIYTSG STGVPKGVEM THEAAMNTID AINPLLGVSA DDRLLAVSAL DFDLSVYDLF GVLGAGGALV LPTQDEARDA ARWIELIERH RVTLWNSAPA LLEMALAAPG AAGACRSVRA VLASGDWIAL DLPARLRARC GGACAFHALG GATEAGIWSN LQTVDAVPPH WRSIPYGRPL PGQAYRVVDD SGRDAPDHVA GELLIGGASL ARGYRNDPVL SAARFVESDT GRWYRTGDRG RYWPDGTLEF LGRADRQVKV RGHRIELGEI EAALSAHPQV EGACASVVSG DAAHVVAAFV PVDVALDPAS AGALAYRPAA DTVQAQAAVT RAVLSRVLDG GARVPAPVRA RWDAWLARAS QPHAIALEAA LEALDWPAAR LDACAAALRA LVDDPHGCAP RVLLDAQLAP QALASGLPDG VRAIGQIGAA LRTLADAHAR VVRVAVLDAR AGQLFAHGLR LLDDPRFALT LFDASPGLLR DAQSRFARTS PAMHAMPDGL LPARYLGQFD CVVSFAAAHL RDDPRDTFRL AAALLARDGH AFVADVLRDS PLRELTAALL GDASPPRLVS GEALAAAARA CGFAPDAQSW RSDAFALIAA RARAEPLTHA RLAGWLRERL PDAMRPERLW CAPRWPLNGN GKIDRRAIGD ALARTLGDAP AAHAAFAPAD ERQATLLACW EQALGRPADA RDATFFALGG DSLLATRLLA QLRERLGVRI GMAEFYREPT LAGLAAKLAG AAAAVRGHRA AHAAAMEEGV L
|
| |