Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0787 |
Symbol | |
ID | 4903972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 777245 |
End bp | 781714 |
Gene Length | 4470 bp |
Protein Length | 1489 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640143893 |
Product | non-ribosomal peptide synthase |
Protein accession | YP_001074823 |
Protein GI | 126456962 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAGT CCGCTGTCGA TTCCATCTCC GAACTTTCCG CTTCTTCTTC CGACGTTTTT TTCCGTTCCC GGCTCGTCGC GCTGCTCGCC GAGTTGCTCG GCGAGCCCGC CGCCGACATC GCCGCGCTGG GCGACGATGA GGATCTGCTG AGCTACGGCG TCGATTCCAT CCGCCTGATG TACATGCAGA CGCGCTTGAG CCGCATGGGC CATGCGCTCG CATTCGACGC GCTCGCGCGC ACGCCGACGC TCGGCGCGTG GACCGCGCTG CTCGCGCGGG CGATGCGTGC CGAGCCGGCC GCGCAGGGCA CGGATCGCGC GGCGCGCGCC GATGTCGTCA CCAACGCCGA CGCCGACGCC GACGCCAACG CCAACGCCAA CGCCAACGCC AACGCCAACG CCAACGCCGA CATCGACGTG CACGCGGAAT TCGAACTATC CGCGGTTCAG CAGGCCTATT GGCTCGGCCG CGGCGCGGGT GAGGTGCTCG GCAACGTGAG TTGCCATGCG TTCCTCGAAT TCCGCAGCCG CGGGATCGAT CCGCAGCGTC TCGCCGCGGC GTGCCGGCTC GTGCGCGCTC GTCACCCGAT GCTGCGCGCG CGCTTTGTCG GCGGGCGGCA ACAGATCGTC GCCGCGCCCG ATGCGCCCGT ATTCGATTAC CGCGACTGGC GCGGCAGGGC GCCGGCGCAA GCCGAAGCCG AATGGGCCGC GCTGCGCGCG TTTCGCTCGC ACGAATGCCT CGATATCGCG CACGCGCAGG TTTTCATCGC CGGACTCGCG CAGATGCCGG ACGGCGAGGA TCGCGTGTGG CTGAGCATCG ATCTGCTCGC CGCCGATGTC GACAGCGTGC GGCTCCTGAT GAACGAGATC GGCGCCGCGT ACGCATCGCC CGCGTCGCTG CCCGATGCGC CGGCGACGTG GTTTCCCGTT TATCTCGCGC AGCGCGCGGC CGCCACGCGC GCGGCCCGTG AGGCCGCGCG CGGGCACTGG CAAGCGCGGC TCGCCGACTT GCCGGACGGC CCGGCGCTGC CGCTCGCGTG CGCGCCCGAA TCGATCCGCG CGCCGCGCTT CAGCCGGCGC GCGCACACGC TGAGCGCCGC CGAGCTCGCG CGCCTGCGGC AGCGCGCGGC GCAGCATCGC GTGACGCTGC CGTCGGTGTT CGGCTATGCG TTCGCCGCGG TGCTCGCGCG CTGGAGCGGC CAGCACGCGT TCGTGCTGAA CGTGCCGCTG TTCGACCGGC ACGGCGAGGC GCCCGATCTG GCCGCGATGA TCGCGGATTT CACGACGCTG CTGCTCGTCG AGTGCGAGGT GCGGCCGCAC GCGTGCGTCG CCGATGCGGT GCGCGCGTTC CAGGCGTGCC TGCACGGCGC GATCGCGCAT GCGGCGTATC CGGCGCTCGA GGTGCTGCGC GATGCGCGCC GTCAGGGCGC GCCGCGCGCG GCGCCCGTCG TGTTCTCGAG CAATCTCGGC GACGAGCCGT TCGTGCCGGC CGCGTTTCGC GAGGCGTTCG GCGATCTGCA CGACATGATT TCGCAGACGC CGCAAGTGTG GCTCGACCAT CAGCTCTATC GCGTGACGGA TGGCGTGCTG CTCGCGTGGG ACAGTGTCGA CGGTCTGTTT CCGGACGGCA TGCTCGATGC GATGTTCGAC GCGTACATCG CGTTCGTGCA GGCGCTGTGC GATCGCGACT GGCGGCAGCC GGCCGCGGTG GCGCTGCCGC CGGCGCAGCG CCGCGTGCGC GATGCGCTGA ACGCCGTGCC CGCCCCCGGC CGGCCGCGCA CGCTGCACGG CGATTTCTTC GCGCTCGCCG CGCGCGAGCC GGCCGCCGTC GCGTTGTGGT GCGGCGAGCG CGCGATCACG CGCGGCGAGC TCGCCGCGCA GGCGCTCGCG ATCGCGGCGG GCCTGCGCGC GGCGGGCGTC GGCCACGGCG AGGCGGTCGA GATCAGTTTG CCGCGCGGAC CGGCGCAGAT CGCCGCGGCG TTCGGCGTGC TCGCGGCGGG CGCGTGCTAT GTGCCCGTCG ACGTCGCGCA GCCGCCCGCG CGGCGCGCGT TGATCGAGCA GGCGGCGGGC ATCCGCGCGG TGATCGGCGT GACGCCGGAG CCGGCCGCCA CGCCGCCGCG CCTGGACGCG GCCGCGCTCG CGCGCAGCGC GCCGCTCGCC GCGCCGCGGC CGGTCGCGCC GCGCAGCACC GCTTACGTGA TCTACACGTC GGGCTCGACG GGCGTGCCGA AGGGCGTCGA GATGACGCAC GAGGCGGCGA TGAACACGAT CGACGCGATC AACCCGCTGC TCGGCGTGAG CGCCGACGAC CGGTTGCTCG CGGTATCGGC GCTCGACTTC GATCTGTCGG TGTACGACTT GTTCGGGGTG CTCGGCGCGG GCGGCGCGCT CGTATTGCCG ACGCAGGACG AGGCGCGCGA CGCGGCGCGC TGGATCGAAC TGATCGAGCG GCATCGCGTG ACGCTGTGGA ACTCGGCCCC GGCGCTGCTC GAGATGGCGC TCGCCGCGCC GGGCGCCGCC GGCGCGTGCC GCAGCGTGCG CGCGGTGCTC GCGTCCGGCG ACTGGATCGC GCTCGATCTG CCGGCGCGAT TGCGCGCGCG TTACGGCGGC GCATGCGCGT TCCATGCGCT CGGCGGCGCG ACGGAGGCCG GCATCTGGTC GAACCTGCAG ACAGTCGACG CGGTGCCGCC GCACTGGCGC TCGATTCCAT ACGGCCGGCC GTTGCCGGGG CAGGCGTATC GCGTCGTCGA CGACAGCGGC CGCGATGCGC CCGACCATGT CGCGGGCGAG CTGCTGATCG GCGGCGCGAG CCTCGCGCGC GGCTACCGGA ACGATCCGGT GCTGAGCGCG GCGCGCTTCG TCGAATCCGA TACGGGCCGC TGGTATCGCA CGGGCGATCG CGGCCGCTAC TGGCCGGACG GCACGCTGGA GTTTCTCGGC CGCGCGGACC GGCAGGTGAA GGTGCGCGGC CACCGGATCG AGCTCGGCGA GATCGAGGCC GCGTTGAGCG CGCATCCGCA AGTGGAGGGC GCGTGCGCGA GCGTCGTGTC GGGCGATGCC GCGCACGTCG TCGCGGCGTT CGTGCCGGTT GACGTCGCGC TCGATCCGGC GTCGGCCGGC GCGCTCGCGT ATCGGCCGGC GGCGGACACC GTGCAGGCGC AAGCCGCCGT GACGCGCGCC GTCCTGAGCC GCGTGCTCGA CGGCGGCGCG CGCGTGCCGG CGCCCGTGCG CGCGCGTTGG GACGCATGGC TCGCGCGGGC GTCGCAGCCG CACGCGATTG CGCTCGAAGC CGCGCTCGAG GCGCTCGACT GGCCCGCCGC GCGGCTCGAC GCGTGCGCGG CCGCGCTGCG CGCGCTCGTC GACGATCCGC ACGGCTGCGC GCCGCGCGTG CTGCTCGATG CGCAGCTCGC GCCGCAGGCG CTCGCGTCGG GCCTGCCCGA CGGCGTGCGC GCGATCGGGC AGATCGGCGC GGCGTTGCGA ACGCTCGCCG ATGCGCATGC TCGCGTGGTG CGCGTCGCGG TGCTCGATGC GCGCGCCGGC CAACTGTTCG CGCACGGGCT TCGGCTGCTC GACGATCCGC GCTTCGCGCT CACGCTGTTC GACGCGTCGC CGGGCCTGCT GCGCGACGCG CAATCCCGCT TCGCGCGAAC GTCGCCGGCG ATGCACGCGA TGCCGGACGG TTTGCTGCCC GCTCGGTACC TGGGCCAGTT CGATTGCGTC GTGAGCTTTG CCGCCGCGCA TCTTCGCGAC GATCCGCGCG ATACGTTCCG GCTCGCGGCC GCGTTGCTCG CGCGGGACGG GCACGCGTTC GTCGCGGACG TGCTGCGCGA TTCGCCGTTG CGCGAGCTGA CGGCCGCGCT GCTCGGCGAC GCATCGCCGC CCCGGCTCGT TTCCGGCGAG GCGCTCGCGG CGGCCGCGCG CGCGTGCGGC TTCGCGCCCG ATGCGCAGAG CTGGCGCTCG GACGCGTTCG CGCTGATCGC GGCGCGCGCG CGCGCCGAGC CGCTCACGCA CGCGCGTCTC GCCGGCTGGC TGCGCGAGCG CCTGCCGGAC GCGATGCGGC CCGAGCGGCT CTGGTGCGCG CCGCGCTGGC CGCTCAACGG CAACGGCAAG ATCGACCGCC GGGCGATCGG CGATGCGCTG GCGCGCACGC TCGGCGACGC GCCGGCGGCG CACGCCGCGT TCGCGCCGGC CGACGAACGG CAGGCGACGC TGCTCGCGTG CTGGGAGCAG GCGCTGGGTC GCCCTGCCGA TGCGCGCGAC GCCACGTTCT TCGCGCTCGG CGGCGACAGC CTGCTCGCGA CGCGGCTGCT CGCGCAATTG CGCGAGCGGC TCGGCGTGCG GATCGGCATG GCCGAGTTCT ACCGCGAGCC GACGCTCGCG GGCCTCGCGG CGAAACTGGC CGGCGCGGCG GCGGCCGTGC GCGGGCACCG CGCGGCACAC GCCGCGGCGA TGGAGGAGGG CGTGCTATGA
|
Protein sequence | MSQSAVDSIS ELSASSSDVF FRSRLVALLA ELLGEPAADI AALGDDEDLL SYGVDSIRLM YMQTRLSRMG HALAFDALAR TPTLGAWTAL LARAMRAEPA AQGTDRAARA DVVTNADADA DANANANANA NANANADIDV HAEFELSAVQ QAYWLGRGAG EVLGNVSCHA FLEFRSRGID PQRLAAACRL VRARHPMLRA RFVGGRQQIV AAPDAPVFDY RDWRGRAPAQ AEAEWAALRA FRSHECLDIA HAQVFIAGLA QMPDGEDRVW LSIDLLAADV DSVRLLMNEI GAAYASPASL PDAPATWFPV YLAQRAAATR AAREAARGHW QARLADLPDG PALPLACAPE SIRAPRFSRR AHTLSAAELA RLRQRAAQHR VTLPSVFGYA FAAVLARWSG QHAFVLNVPL FDRHGEAPDL AAMIADFTTL LLVECEVRPH ACVADAVRAF QACLHGAIAH AAYPALEVLR DARRQGAPRA APVVFSSNLG DEPFVPAAFR EAFGDLHDMI SQTPQVWLDH QLYRVTDGVL LAWDSVDGLF PDGMLDAMFD AYIAFVQALC DRDWRQPAAV ALPPAQRRVR DALNAVPAPG RPRTLHGDFF ALAAREPAAV ALWCGERAIT RGELAAQALA IAAGLRAAGV GHGEAVEISL PRGPAQIAAA FGVLAAGACY VPVDVAQPPA RRALIEQAAG IRAVIGVTPE PAATPPRLDA AALARSAPLA APRPVAPRST AYVIYTSGST GVPKGVEMTH EAAMNTIDAI NPLLGVSADD RLLAVSALDF DLSVYDLFGV LGAGGALVLP TQDEARDAAR WIELIERHRV TLWNSAPALL EMALAAPGAA GACRSVRAVL ASGDWIALDL PARLRARYGG ACAFHALGGA TEAGIWSNLQ TVDAVPPHWR SIPYGRPLPG QAYRVVDDSG RDAPDHVAGE LLIGGASLAR GYRNDPVLSA ARFVESDTGR WYRTGDRGRY WPDGTLEFLG RADRQVKVRG HRIELGEIEA ALSAHPQVEG ACASVVSGDA AHVVAAFVPV DVALDPASAG ALAYRPAADT VQAQAAVTRA VLSRVLDGGA RVPAPVRARW DAWLARASQP HAIALEAALE ALDWPAARLD ACAAALRALV DDPHGCAPRV LLDAQLAPQA LASGLPDGVR AIGQIGAALR TLADAHARVV RVAVLDARAG QLFAHGLRLL DDPRFALTLF DASPGLLRDA QSRFARTSPA MHAMPDGLLP ARYLGQFDCV VSFAAAHLRD DPRDTFRLAA ALLARDGHAF VADVLRDSPL RELTAALLGD ASPPRLVSGE ALAAAARACG FAPDAQSWRS DAFALIAARA RAEPLTHARL AGWLRERLPD AMRPERLWCA PRWPLNGNGK IDRRAIGDAL ARTLGDAPAA HAAFAPADER QATLLACWEQ ALGRPADARD ATFFALGGDS LLATRLLAQL RERLGVRIGM AEFYREPTLA GLAAKLAGAA AAVRGHRAAH AAAMEEGVL
|
| |