Gene BURPS1106A_A0787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0787 
Symbol 
ID4903972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp777245 
End bp781714 
Gene Length4470 bp 
Protein Length1489 aa 
Translation table11 
GC content74% 
IMG OID640143893 
Productnon-ribosomal peptide synthase 
Protein accessionYP_001074823 
Protein GI126456962 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGT CCGCTGTCGA TTCCATCTCC GAACTTTCCG CTTCTTCTTC CGACGTTTTT 
TTCCGTTCCC GGCTCGTCGC GCTGCTCGCC GAGTTGCTCG GCGAGCCCGC CGCCGACATC
GCCGCGCTGG GCGACGATGA GGATCTGCTG AGCTACGGCG TCGATTCCAT CCGCCTGATG
TACATGCAGA CGCGCTTGAG CCGCATGGGC CATGCGCTCG CATTCGACGC GCTCGCGCGC
ACGCCGACGC TCGGCGCGTG GACCGCGCTG CTCGCGCGGG CGATGCGTGC CGAGCCGGCC
GCGCAGGGCA CGGATCGCGC GGCGCGCGCC GATGTCGTCA CCAACGCCGA CGCCGACGCC
GACGCCAACG CCAACGCCAA CGCCAACGCC AACGCCAACG CCAACGCCGA CATCGACGTG
CACGCGGAAT TCGAACTATC CGCGGTTCAG CAGGCCTATT GGCTCGGCCG CGGCGCGGGT
GAGGTGCTCG GCAACGTGAG TTGCCATGCG TTCCTCGAAT TCCGCAGCCG CGGGATCGAT
CCGCAGCGTC TCGCCGCGGC GTGCCGGCTC GTGCGCGCTC GTCACCCGAT GCTGCGCGCG
CGCTTTGTCG GCGGGCGGCA ACAGATCGTC GCCGCGCCCG ATGCGCCCGT ATTCGATTAC
CGCGACTGGC GCGGCAGGGC GCCGGCGCAA GCCGAAGCCG AATGGGCCGC GCTGCGCGCG
TTTCGCTCGC ACGAATGCCT CGATATCGCG CACGCGCAGG TTTTCATCGC CGGACTCGCG
CAGATGCCGG ACGGCGAGGA TCGCGTGTGG CTGAGCATCG ATCTGCTCGC CGCCGATGTC
GACAGCGTGC GGCTCCTGAT GAACGAGATC GGCGCCGCGT ACGCATCGCC CGCGTCGCTG
CCCGATGCGC CGGCGACGTG GTTTCCCGTT TATCTCGCGC AGCGCGCGGC CGCCACGCGC
GCGGCCCGTG AGGCCGCGCG CGGGCACTGG CAAGCGCGGC TCGCCGACTT GCCGGACGGC
CCGGCGCTGC CGCTCGCGTG CGCGCCCGAA TCGATCCGCG CGCCGCGCTT CAGCCGGCGC
GCGCACACGC TGAGCGCCGC CGAGCTCGCG CGCCTGCGGC AGCGCGCGGC GCAGCATCGC
GTGACGCTGC CGTCGGTGTT CGGCTATGCG TTCGCCGCGG TGCTCGCGCG CTGGAGCGGC
CAGCACGCGT TCGTGCTGAA CGTGCCGCTG TTCGACCGGC ACGGCGAGGC GCCCGATCTG
GCCGCGATGA TCGCGGATTT CACGACGCTG CTGCTCGTCG AGTGCGAGGT GCGGCCGCAC
GCGTGCGTCG CCGATGCGGT GCGCGCGTTC CAGGCGTGCC TGCACGGCGC GATCGCGCAT
GCGGCGTATC CGGCGCTCGA GGTGCTGCGC GATGCGCGCC GTCAGGGCGC GCCGCGCGCG
GCGCCCGTCG TGTTCTCGAG CAATCTCGGC GACGAGCCGT TCGTGCCGGC CGCGTTTCGC
GAGGCGTTCG GCGATCTGCA CGACATGATT TCGCAGACGC CGCAAGTGTG GCTCGACCAT
CAGCTCTATC GCGTGACGGA TGGCGTGCTG CTCGCGTGGG ACAGTGTCGA CGGTCTGTTT
CCGGACGGCA TGCTCGATGC GATGTTCGAC GCGTACATCG CGTTCGTGCA GGCGCTGTGC
GATCGCGACT GGCGGCAGCC GGCCGCGGTG GCGCTGCCGC CGGCGCAGCG CCGCGTGCGC
GATGCGCTGA ACGCCGTGCC CGCCCCCGGC CGGCCGCGCA CGCTGCACGG CGATTTCTTC
GCGCTCGCCG CGCGCGAGCC GGCCGCCGTC GCGTTGTGGT GCGGCGAGCG CGCGATCACG
CGCGGCGAGC TCGCCGCGCA GGCGCTCGCG ATCGCGGCGG GCCTGCGCGC GGCGGGCGTC
GGCCACGGCG AGGCGGTCGA GATCAGTTTG CCGCGCGGAC CGGCGCAGAT CGCCGCGGCG
TTCGGCGTGC TCGCGGCGGG CGCGTGCTAT GTGCCCGTCG ACGTCGCGCA GCCGCCCGCG
CGGCGCGCGT TGATCGAGCA GGCGGCGGGC ATCCGCGCGG TGATCGGCGT GACGCCGGAG
CCGGCCGCCA CGCCGCCGCG CCTGGACGCG GCCGCGCTCG CGCGCAGCGC GCCGCTCGCC
GCGCCGCGGC CGGTCGCGCC GCGCAGCACC GCTTACGTGA TCTACACGTC GGGCTCGACG
GGCGTGCCGA AGGGCGTCGA GATGACGCAC GAGGCGGCGA TGAACACGAT CGACGCGATC
AACCCGCTGC TCGGCGTGAG CGCCGACGAC CGGTTGCTCG CGGTATCGGC GCTCGACTTC
GATCTGTCGG TGTACGACTT GTTCGGGGTG CTCGGCGCGG GCGGCGCGCT CGTATTGCCG
ACGCAGGACG AGGCGCGCGA CGCGGCGCGC TGGATCGAAC TGATCGAGCG GCATCGCGTG
ACGCTGTGGA ACTCGGCCCC GGCGCTGCTC GAGATGGCGC TCGCCGCGCC GGGCGCCGCC
GGCGCGTGCC GCAGCGTGCG CGCGGTGCTC GCGTCCGGCG ACTGGATCGC GCTCGATCTG
CCGGCGCGAT TGCGCGCGCG TTACGGCGGC GCATGCGCGT TCCATGCGCT CGGCGGCGCG
ACGGAGGCCG GCATCTGGTC GAACCTGCAG ACAGTCGACG CGGTGCCGCC GCACTGGCGC
TCGATTCCAT ACGGCCGGCC GTTGCCGGGG CAGGCGTATC GCGTCGTCGA CGACAGCGGC
CGCGATGCGC CCGACCATGT CGCGGGCGAG CTGCTGATCG GCGGCGCGAG CCTCGCGCGC
GGCTACCGGA ACGATCCGGT GCTGAGCGCG GCGCGCTTCG TCGAATCCGA TACGGGCCGC
TGGTATCGCA CGGGCGATCG CGGCCGCTAC TGGCCGGACG GCACGCTGGA GTTTCTCGGC
CGCGCGGACC GGCAGGTGAA GGTGCGCGGC CACCGGATCG AGCTCGGCGA GATCGAGGCC
GCGTTGAGCG CGCATCCGCA AGTGGAGGGC GCGTGCGCGA GCGTCGTGTC GGGCGATGCC
GCGCACGTCG TCGCGGCGTT CGTGCCGGTT GACGTCGCGC TCGATCCGGC GTCGGCCGGC
GCGCTCGCGT ATCGGCCGGC GGCGGACACC GTGCAGGCGC AAGCCGCCGT GACGCGCGCC
GTCCTGAGCC GCGTGCTCGA CGGCGGCGCG CGCGTGCCGG CGCCCGTGCG CGCGCGTTGG
GACGCATGGC TCGCGCGGGC GTCGCAGCCG CACGCGATTG CGCTCGAAGC CGCGCTCGAG
GCGCTCGACT GGCCCGCCGC GCGGCTCGAC GCGTGCGCGG CCGCGCTGCG CGCGCTCGTC
GACGATCCGC ACGGCTGCGC GCCGCGCGTG CTGCTCGATG CGCAGCTCGC GCCGCAGGCG
CTCGCGTCGG GCCTGCCCGA CGGCGTGCGC GCGATCGGGC AGATCGGCGC GGCGTTGCGA
ACGCTCGCCG ATGCGCATGC TCGCGTGGTG CGCGTCGCGG TGCTCGATGC GCGCGCCGGC
CAACTGTTCG CGCACGGGCT TCGGCTGCTC GACGATCCGC GCTTCGCGCT CACGCTGTTC
GACGCGTCGC CGGGCCTGCT GCGCGACGCG CAATCCCGCT TCGCGCGAAC GTCGCCGGCG
ATGCACGCGA TGCCGGACGG TTTGCTGCCC GCTCGGTACC TGGGCCAGTT CGATTGCGTC
GTGAGCTTTG CCGCCGCGCA TCTTCGCGAC GATCCGCGCG ATACGTTCCG GCTCGCGGCC
GCGTTGCTCG CGCGGGACGG GCACGCGTTC GTCGCGGACG TGCTGCGCGA TTCGCCGTTG
CGCGAGCTGA CGGCCGCGCT GCTCGGCGAC GCATCGCCGC CCCGGCTCGT TTCCGGCGAG
GCGCTCGCGG CGGCCGCGCG CGCGTGCGGC TTCGCGCCCG ATGCGCAGAG CTGGCGCTCG
GACGCGTTCG CGCTGATCGC GGCGCGCGCG CGCGCCGAGC CGCTCACGCA CGCGCGTCTC
GCCGGCTGGC TGCGCGAGCG CCTGCCGGAC GCGATGCGGC CCGAGCGGCT CTGGTGCGCG
CCGCGCTGGC CGCTCAACGG CAACGGCAAG ATCGACCGCC GGGCGATCGG CGATGCGCTG
GCGCGCACGC TCGGCGACGC GCCGGCGGCG CACGCCGCGT TCGCGCCGGC CGACGAACGG
CAGGCGACGC TGCTCGCGTG CTGGGAGCAG GCGCTGGGTC GCCCTGCCGA TGCGCGCGAC
GCCACGTTCT TCGCGCTCGG CGGCGACAGC CTGCTCGCGA CGCGGCTGCT CGCGCAATTG
CGCGAGCGGC TCGGCGTGCG GATCGGCATG GCCGAGTTCT ACCGCGAGCC GACGCTCGCG
GGCCTCGCGG CGAAACTGGC CGGCGCGGCG GCGGCCGTGC GCGGGCACCG CGCGGCACAC
GCCGCGGCGA TGGAGGAGGG CGTGCTATGA
 
Protein sequence
MSQSAVDSIS ELSASSSDVF FRSRLVALLA ELLGEPAADI AALGDDEDLL SYGVDSIRLM 
YMQTRLSRMG HALAFDALAR TPTLGAWTAL LARAMRAEPA AQGTDRAARA DVVTNADADA
DANANANANA NANANADIDV HAEFELSAVQ QAYWLGRGAG EVLGNVSCHA FLEFRSRGID
PQRLAAACRL VRARHPMLRA RFVGGRQQIV AAPDAPVFDY RDWRGRAPAQ AEAEWAALRA
FRSHECLDIA HAQVFIAGLA QMPDGEDRVW LSIDLLAADV DSVRLLMNEI GAAYASPASL
PDAPATWFPV YLAQRAAATR AAREAARGHW QARLADLPDG PALPLACAPE SIRAPRFSRR
AHTLSAAELA RLRQRAAQHR VTLPSVFGYA FAAVLARWSG QHAFVLNVPL FDRHGEAPDL
AAMIADFTTL LLVECEVRPH ACVADAVRAF QACLHGAIAH AAYPALEVLR DARRQGAPRA
APVVFSSNLG DEPFVPAAFR EAFGDLHDMI SQTPQVWLDH QLYRVTDGVL LAWDSVDGLF
PDGMLDAMFD AYIAFVQALC DRDWRQPAAV ALPPAQRRVR DALNAVPAPG RPRTLHGDFF
ALAAREPAAV ALWCGERAIT RGELAAQALA IAAGLRAAGV GHGEAVEISL PRGPAQIAAA
FGVLAAGACY VPVDVAQPPA RRALIEQAAG IRAVIGVTPE PAATPPRLDA AALARSAPLA
APRPVAPRST AYVIYTSGST GVPKGVEMTH EAAMNTIDAI NPLLGVSADD RLLAVSALDF
DLSVYDLFGV LGAGGALVLP TQDEARDAAR WIELIERHRV TLWNSAPALL EMALAAPGAA
GACRSVRAVL ASGDWIALDL PARLRARYGG ACAFHALGGA TEAGIWSNLQ TVDAVPPHWR
SIPYGRPLPG QAYRVVDDSG RDAPDHVAGE LLIGGASLAR GYRNDPVLSA ARFVESDTGR
WYRTGDRGRY WPDGTLEFLG RADRQVKVRG HRIELGEIEA ALSAHPQVEG ACASVVSGDA
AHVVAAFVPV DVALDPASAG ALAYRPAADT VQAQAAVTRA VLSRVLDGGA RVPAPVRARW
DAWLARASQP HAIALEAALE ALDWPAARLD ACAAALRALV DDPHGCAPRV LLDAQLAPQA
LASGLPDGVR AIGQIGAALR TLADAHARVV RVAVLDARAG QLFAHGLRLL DDPRFALTLF
DASPGLLRDA QSRFARTSPA MHAMPDGLLP ARYLGQFDCV VSFAAAHLRD DPRDTFRLAA
ALLARDGHAF VADVLRDSPL RELTAALLGD ASPPRLVSGE ALAAAARACG FAPDAQSWRS
DAFALIAARA RAEPLTHARL AGWLRERLPD AMRPERLWCA PRWPLNGNGK IDRRAIGDAL
ARTLGDAPAA HAAFAPADER QATLLACWEQ ALGRPADARD ATFFALGGDS LLATRLLAQL
RERLGVRIGM AEFYREPTLA GLAAKLAGAA AAVRGHRAAH AAAMEEGVL