Gene BURPS1106A_A0178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0178 
Symbol 
ID4904510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp167806 
End bp170964 
Gene Length3159 bp 
Protein Length1052 aa 
Translation table11 
GC content68% 
IMG OID640143285 
Productputative non-ribosomal peptide synthase 
Protein accessionYP_001074221 
Protein GI126457274 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGTTCG TCGGCATGGA TAAAACAATC GAATCATCCG GCATTTTTTG TCGATGGATT 
CAGTGGATCG TTTCGGAATC ACGTTCGCCT TCGTGCGTGA TTCCGGAGAA ATCGGGCATG
CCGGAGCGCA GCGGCGCGCG TGATCGGATT CGCTCGAACA ATCCCGCTTT TTACCGGGCA
GGCCGCCGGT TCGTTTCATG CAGTCGAGTG TCAATCGGCC GTCGACGCTC GCGAGAGCGC
AACGGCGGCC AGTCAGGTAG GGATTCACCC GCGATCGTCA ACCGACGCGT TCGGGCGCGC
GTGCCCGGAC GTTCGTTCCC ACTTGCACCA TCAGCATGGA GTACCTGCAC CATGACGGCA
TCCACCCTCG ATTTACCGCG CGATTGCGAA CACGCATTGC GCGCCGCTTC GCCGCCAAAC
ATCGTCGACC TGCTGTTGCG GGCCGCACGG CTGCATCCGC ATACGGGCGT GCGCTTCATC
GCCGCCGAAT CCGAACACAA GGGCGCCTTC GTCACATATC CCGAGCTGCT CGACGAGGCG
CGCCGCATCC TGGGCGGCCT GCGCGCGCGC GGCTATCGGT CCGGCATGAA GGTCGCGCTG
CTGCTCGAGC ACGCGAGCGA TTTCATTCCG GCGTTCTGGG CCTGCGCGCT CGGCGGCTTC
GTGCCGTGCC CGCTCGTGCC GATCCGCAAC GATCCCGAGC GCTGGGCGAA GCACCTCGCG
CACGTCGATA CGCTGCTCGA CCATCCGCTG CTCGTCACCA CCGAAGCGCT GAACAACGAT
CTGCCGGGCG GCGCGTCGGC CGTCAACCTG AACGCGCTGC GCGCGAGCCT GCCCGATGCG
TCGACGCACG TCGCGCAACC GTCGGACCCG GCGGTTTTCG TGCTCACGTC GGGCTCCACC
GGCAATTCGA AGGCGGTCGT GCTCACGCAC GGCAACCTGC TCGCGTCGAT GGCGGGCAAG
AACGACCGGC AGCAGCTCGC GGGCGCGGAC GTCACGCTCA ACTGGATCTC GTTCGACCAC
GTCGCCGCGC TGCTCGAAGC GCACCTGCTG CCGCTGTACG TCGGCGCCGT GCAGCTTCAC
GTCGAAGCCG CGGCGGTTCT CACCGATCCG CTGCGCTTCT TGCGGCTCGT CAGCCGCTAT
CGCGTGACGA TGACGTTCTC GCCGAACTTC CTGTTCGGGC AACTGAACGC CGCGCTCGAA
GCGATGGGCG ACGAGGCGCT CGCCGCGTGG CGCGGCGCGG TGGATCTGTC GTCGCTGCGG
CACGTCGTGT CGGGCGGCGA GGCGATCGTC GTCGCGACCG GGCAGCGTTT TCTCGATCTG
CTCGCGCCGT GCGGCCTCGC GCGCGATGCG CTGTGGCCCG CGTTCGGGAT GACGGAGACG
TGCGCCGGCT CCGTGTATTC GCGCGAGTTC CCGGAAGGCG ACGCGGGCCG CGAGTTCGCA
TCGCTCGGCC TGCCGGTGGC CGGGCTGCAG ATGCGCATCG CGGACGACCG CAACAACGTG
CTGCCGGAAG GCGAGGCGGG CGAGTTCCAG GTGCGCGGCC CGATGATCTT CCAGCGCTAT
CACAACAATG CCGAGGCGAC GCGCGCGGCG TTCACGAGCG ACGGCTGGTT CCGCACGGGC
GACCTCGGGC GCATCGAGCG CGGCCGGCTG TGGCTCGTCG GCCGCAGCAA GGACAGCATC
ATCGTCAACG GCGTCAATTA CTTCAGCCAC GAGCTGGAGA CGACTCTCGA GGCGCTCGAC
GGCGTCAAGC CCTCGTTCGT CGCGGCGTTT CCGACGCGCG GGGCCGGCGA CGAATCCGAG
CAACTCGTCG TCACGTTCAC GCCGTCGTTT CCGCTCGACG ACGAGGACGC GCTGTATCGC
CTCGTCATCG CGATCCGCAA CAGCACGATC CTGCTGTGGG GCTTCCGGCC CGCGCTGATC
CTGCCGCTGC CGGAGGACGA ATTTCCGAAG ACGAGCCTCG GCAAGACCCA GCGCGCGATC
ATGCGCAAGC GCCTCGAAGC GGGCAGCTAC GACGGCTACA AGGCGCGCGT CGCCGATCTC
GCGAACCGGC AGATGGGCGG CTATGTCGCG CCCGACGGGC AGACCGAGGC CGCGGTGGCC
GCGATCTTCG CGCGGATGTT CCAGGTCGCG CCCGAGGCGA TCAGCGCGAC CGCGAGCTTC
TTCGATCTCG GCGGCACGTC GCTCGACATC CTGAAGCTCA AGCGCCACGT CGAACAGCGG
CTCGGCGTGA TCGACCTGCC GATCGTGACG ATCCTCCAGA ACCCGAGCGT GCGCGCGCTG
GCCGCGCGTC TCGCCCCGGG CGAGCGCGTG ACGGCGGGCG AATACGATCC GGTCGTGCCG
TTGCAGCTCA CCGGCGGCAA GACGCCGCTG TTCTGCGTGC ACCCCGGCGT CGGCGAGGTG
CTCGTGTTCG TCAACCTCGC GAAGTACTTC GTCAACGAGC GCCCGTTCTA TGCATTGCGC
GCGCGCGGCT TCAACGAAGG GGAGACGTAT TTCTCCAGCT TCGACGAAAT GGTGAACACG
TATGTCGACG CGATCCGCAA GCGGCAGCCG CACGGGCCGT ACGCGGTGGC CGGCTATTCG
TACGGCGGCG CGGTCGCGTT CGAGATCGCG AAGGTGCTCG AAGCGCAGGG CGAGCGGGTG
GATTTCGTCG GCAGCTTCAA TCTGCCGCCG CACATCAAGT ACCGGATGGA CGAGCTCGAC
GAGGTGGAGG GCGCGGTCAA CCTCGCGTTC TTCCTGTCGC TGATCGACAA GCAGCAGTCG
CTCACGCTGC CGCCGCAACT GCGCGCGGCG ATGCCGGAGC AAGACCCGCT CGCGTACCTG
ATCGACCACG CGCCGCCCGG GCGGCTCGTC GAGCTCGACC TCGATCTGCC GAAATTCCGC
GCGTGGGCGG GGCTCGCGCA ATCGCTGCTC ACGCTCGGGC GTTCGTACGC GCCGTCGGGC
AGCGTGCGGG CGATGTCGAT CTTCTATGCG ATTCCGCTGC GCGGCACGAA GGACGACTGG
CTGAACAAGG AACTGCGCCG CTGGGACGAG TTCACGCGCG CGCCGAACCG CTATATCGAC
GTGGCGGGCG AACACTACAC GCTGATGGGG CCCGCGCACG TCGCGACGTT CCAGGCGGTG
CTGCGGGCCG AGCTCGATCG CGCGCTCGGC GGCAAATGA
 
Protein sequence
MQFVGMDKTI ESSGIFCRWI QWIVSESRSP SCVIPEKSGM PERSGARDRI RSNNPAFYRA 
GRRFVSCSRV SIGRRRSRER NGGQSGRDSP AIVNRRVRAR VPGRSFPLAP SAWSTCTMTA
STLDLPRDCE HALRAASPPN IVDLLLRAAR LHPHTGVRFI AAESEHKGAF VTYPELLDEA
RRILGGLRAR GYRSGMKVAL LLEHASDFIP AFWACALGGF VPCPLVPIRN DPERWAKHLA
HVDTLLDHPL LVTTEALNND LPGGASAVNL NALRASLPDA STHVAQPSDP AVFVLTSGST
GNSKAVVLTH GNLLASMAGK NDRQQLAGAD VTLNWISFDH VAALLEAHLL PLYVGAVQLH
VEAAAVLTDP LRFLRLVSRY RVTMTFSPNF LFGQLNAALE AMGDEALAAW RGAVDLSSLR
HVVSGGEAIV VATGQRFLDL LAPCGLARDA LWPAFGMTET CAGSVYSREF PEGDAGREFA
SLGLPVAGLQ MRIADDRNNV LPEGEAGEFQ VRGPMIFQRY HNNAEATRAA FTSDGWFRTG
DLGRIERGRL WLVGRSKDSI IVNGVNYFSH ELETTLEALD GVKPSFVAAF PTRGAGDESE
QLVVTFTPSF PLDDEDALYR LVIAIRNSTI LLWGFRPALI LPLPEDEFPK TSLGKTQRAI
MRKRLEAGSY DGYKARVADL ANRQMGGYVA PDGQTEAAVA AIFARMFQVA PEAISATASF
FDLGGTSLDI LKLKRHVEQR LGVIDLPIVT ILQNPSVRAL AARLAPGERV TAGEYDPVVP
LQLTGGKTPL FCVHPGVGEV LVFVNLAKYF VNERPFYALR ARGFNEGETY FSSFDEMVNT
YVDAIRKRQP HGPYAVAGYS YGGAVAFEIA KVLEAQGERV DFVGSFNLPP HIKYRMDELD
EVEGAVNLAF FLSLIDKQQS LTLPPQLRAA MPEQDPLAYL IDHAPPGRLV ELDLDLPKFR
AWAGLAQSLL TLGRSYAPSG SVRAMSIFYA IPLRGTKDDW LNKELRRWDE FTRAPNRYID
VAGEHYTLMG PAHVATFQAV LRAELDRALG GK