Gene BURPS668_A0269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0269 
Symbol 
ID4887708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp236719 
End bp239526 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content69% 
IMG OID640130210 
Productputative non-ribosomal peptide synthase 
Protein accessionYP_001061275 
Protein GI126442941 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.977076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCAT CCGCCCTCGA TTTACCGCGC GATTGCGAAC ACGCATTGCG CGCCGCTTCG 
CCGCCAAACA TCGTCGACCT GCTGTTGCGG GCCGCACGGC TGCATCCGCA TACGGGCGTG
CGCTTCATCG CCGCCGAATC CGAACACAAG GGCGCCTTCG TCACGTATCC CGAGCTGCTC
GACGAGGCGC GCCGCATCCT GGGCGGCCTG CGCGCGCGCG GCTATCGGTC CGGCATGAAG
GTCGCGCTGC TGCTCGAGCA CGCGAGCGAT TTCATTCCGG CGTTCTGGGC CTGCGCGCTC
GGCGGCTTCG TGCCGTGCCC GCTCGTGCCG ATCCGCAACG ATCCCGAGCG CTGGGCGAAG
CACCTCGCGC ACGTCGATAC GCTGCTCGAC CATCCGCTGC TCGTCACCAC CGAAGCGCTG
AACAACGATC TGCCGGGCGG CGCGTCGGCC GTCAACCTGA ACGCGCTGCG CGCGAGCCTG
CCCGATGCGT CGACGCACGT CGCGCAACCG TCGGACCCGG CGGTTTTCGT GCTCACGTCG
GGCTCCACCG GCAATTCGAA GGCGGTCGTG CTCACGCACG GCAACCTGCT CGCGTCGATG
GCGGGCAAGA ACGATCGGCA GCAGCTCGCG GGCGCGGACG TCACGCTCAA CTGGATCTCG
TTCGACCACG TCGCCGCGCT GCTCGAAGCG CACCTGCTGC CGCTGTACGT CGGCGCCGTG
CAGCTTCACG TCGAAGCCGC GGCGGTTCTC ACCGATCCGC TGCGCTTCTT GCGGCTCGTC
AGCCGCTATC GCGTGACGAT GACGTTCTCG CCGAACTTCC TGTTCGGGCA ACTGAACGCC
GCGCTCGAAG CGATGGGCGA CGAGGCGCTC GCCGCGTGGC GCGGCGCGGT GGATCTGTCG
TCGCTGCGGC ACGTCGTGTC GGGCGGCGAG GCGATCGTCG TCGCGACCGG GCAGCGTTTT
CTCGATCTGC TCGCGCCGTG CGGCCTCGCG CGCGATGCGC TGTGGCCCGC GTTCGGGATG
ACGGAGACGT GCGCCGGCTC CGTGTATTCG CGCGAGTTCC CGGAAGGCGA CGCGGGCCGC
GAGTTCGCAT CGCTCGGCCT GCCGGTGGCC GGGCTGCAGA TGCGCATCGC GGACGACCGC
AACAACGTGC TGCCGGAAGG CGAGGCGGGC GAGTTCCAGG TGCGCGGCCC GATGATCTTC
CAGCGCTATC ACAACAATGC CGAGGCGACG CGCGCGGCGT TCACGAGCGA CGGCTGGTTC
CGCACGGGCG ACCTCGGGCG CATCGAGCGC GGCCGGCTGT GGCTCGTCGG CCGCAGCAAG
GACAGCATCA TCGTCAACGG CGTCAATTAC TTCAGCCACG AGCTGGAGAC GACTCTCGAG
GCGCTCGACG GCGTCAAGCC CTCGTTCGTC GCGGCGTTTC CGACGCGCGG GGCCGGCGAC
GAATCCGAGC AACTCGTCGT CACGTTCACG CCGTCGTTTC CGCTCGACGA CGAGGACGCG
CTGTATCGCC TCGTCATCGC GATCCGCAAC AGCACGATCC TGCTGTGGGG CTTCCGGCCC
GCGCTGATCC TGCCGCTGCC GGAGGACGAA TTCCCGAAGA CGAGCCTCGG CAAGACCCAG
CGCGCGATCA TGCGCAAGCG CCTCGAAGCG GGCAGCTACG ACGGCTACAA GGCGCGCGTC
GCCGATCTCG CGAACCGGCA GATGGGCGGC TATGTCGCGC CCGACGGGCA GACCGAGGCC
GCGGTGGCCG CGATCTTCGC GCGGATGTTC CAGGTCGCGC CCGAGGCGAT CAGCGCGACC
GCGAGCTTCT TCGATCTCGG CGGCACGTCG CTCGACATCC TGAAGCTCAA GCGCCACGTC
GAACAGCGGC TCGGCGTGAT CGACCTGCCG ATCGTGACGA TCCTCCAGAA CCCGAGCGTG
CGCGCGCTGG CCGCGCGTCT CGCCCCGGGC GAGCGCGTGG CGGCGGGCGA ATACGATCCG
GTCGTGCCGT TGCAGCTCAC CGGCGGCAAG ACGCCGCTAT TCTGCGTGCA CCCCGGCGTC
GGCGAGGTGC TCGTGTTCGT CAACCTCGCG AAGTACTTCG TCAACGAGCG CCCGTTCTAC
GCATTGCGCG CGCGCGGCTT CAACGAAGGG GAGACGTATT TCTCCAGCTT CGACGAAATG
GTGAACACGT ATGTCGACGC GATCCGCAAG CGGCAGCCGC ACGGGCCGTA CGCGGTGGCC
GGCTATTCGT ACGGCGGCGC GGTCGCGTTC GAGATCGCGA AGGTGCTCGA AGCGCAGGGC
GAGCGGGTGG ATTTCGTCGG CAGCTTCAAT CTGCCGCCGC ACATCAAGTA CCGGATGGAC
GAGCTCGACG AGGTGGAGGG CGCGGTCAAC CTCGCGTTCT TCCTGTCGCT GATCGACAAG
CAGCAGTCGC TCACGCTGCC GCCGCAACTG CGCGCGGCGA TGCCGGAGCA AGACCCGCTC
GCGTACCTGA TCGACCACGC GCCGCCCGGG CGGCTCGTCG AGCTCGACCT CGATCTGCCG
AAATTCCGCG CGTGGGCGGG GCTCGCGCAA TCGCTGCTCA CGCTCGGGCG TTCGTACGCG
CCGTCGGGCA GCGTGCGGGC GATGTCGATC TTCTATGCGA TTCCGCTGCG CGGCACGAAG
GACGACTGGC TGAACAAGGA ACTGCGCCGC TGGGACGAGT TCACGCGCGC GCCGAACCGC
TATATCGACG TGGCGGGCGA ACACTACACG CTGATGGGGC CCGCGCACGT CGCGACGTTC
CAGGCGGTGC TGCGGGCCGA GCTCGATCGC GCGCTCGGCG GCAAATGA
 
Protein sequence
MTASALDLPR DCEHALRAAS PPNIVDLLLR AARLHPHTGV RFIAAESEHK GAFVTYPELL 
DEARRILGGL RARGYRSGMK VALLLEHASD FIPAFWACAL GGFVPCPLVP IRNDPERWAK
HLAHVDTLLD HPLLVTTEAL NNDLPGGASA VNLNALRASL PDASTHVAQP SDPAVFVLTS
GSTGNSKAVV LTHGNLLASM AGKNDRQQLA GADVTLNWIS FDHVAALLEA HLLPLYVGAV
QLHVEAAAVL TDPLRFLRLV SRYRVTMTFS PNFLFGQLNA ALEAMGDEAL AAWRGAVDLS
SLRHVVSGGE AIVVATGQRF LDLLAPCGLA RDALWPAFGM TETCAGSVYS REFPEGDAGR
EFASLGLPVA GLQMRIADDR NNVLPEGEAG EFQVRGPMIF QRYHNNAEAT RAAFTSDGWF
RTGDLGRIER GRLWLVGRSK DSIIVNGVNY FSHELETTLE ALDGVKPSFV AAFPTRGAGD
ESEQLVVTFT PSFPLDDEDA LYRLVIAIRN STILLWGFRP ALILPLPEDE FPKTSLGKTQ
RAIMRKRLEA GSYDGYKARV ADLANRQMGG YVAPDGQTEA AVAAIFARMF QVAPEAISAT
ASFFDLGGTS LDILKLKRHV EQRLGVIDLP IVTILQNPSV RALAARLAPG ERVAAGEYDP
VVPLQLTGGK TPLFCVHPGV GEVLVFVNLA KYFVNERPFY ALRARGFNEG ETYFSSFDEM
VNTYVDAIRK RQPHGPYAVA GYSYGGAVAF EIAKVLEAQG ERVDFVGSFN LPPHIKYRMD
ELDEVEGAVN LAFFLSLIDK QQSLTLPPQL RAAMPEQDPL AYLIDHAPPG RLVELDLDLP
KFRAWAGLAQ SLLTLGRSYA PSGSVRAMSI FYAIPLRGTK DDWLNKELRR WDEFTRAPNR
YIDVAGEHYT LMGPAHVATF QAVLRAELDR ALGGK