Gene BMASAVP1_0118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_0118 
SymboldhbF 
ID4678157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008784 
Strand
Start bp110678 
End bp113893 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content73% 
IMG OID639842651 
Productnonribosomal peptide synthetase DhbF 
Protein accessionYP_989735 
Protein GI121596979 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATGC GCGATACATC GCTTCGCCCG CTGCTGCCCG CCCAGCGGGA GATCTGGCTC 
GCCGAGCAGT TGCGCCCGGG CACGGGTGTC TACAACACGG CGGGCTACGC CGACATCGTC
GGCGCCGTCG ATCACGGCGC GCTGCGGGAC GCGTTCGGCC ACGCGATGCG CGAGGCCGAC
GCCGTGCGCG CGACGTTCTC CGCGCACGGC GACGACGCGT GGCAGACGCC GCGCGATGCG
CCGCACGCCG ACGTGCCGCT CGTCGACGTG AGCGGCCAGG CCGATCCCGC CGGCGTCGCG
CTCGCATGGA TGATGCGCGA CATGCGCACG CCGCCCGATT TCGCGGCGGG CCCGCTCGCG
CGCAGCGCGC TCTTGCGGGT GGGGGCGGCG CGCTTTTTCT GGTACGTCTG CGCGCATCAC
ATCGTCACCG ACGCGTACGG CACGGCGCTC GTCGTGCAAC GCACGGCGCA GCTCTACGGC
GCGCGGGTGC GCGGCGCGGC GCCGCCGCCC GCGTGGTTCG GCACGCTGGA CGCGCTCATC
GACGAGGACC GCGCGTATCG TGAATCGGCC GCGTTCGCCG ACGATCGCGA TTACTGGCGC
GCCCGCCTGG CGGCGGCGCC GGAGCCGCGC AGCCTGTCGC CCGTCGCGCT GCCGGGGCAG
TTCGCGCCGG CCGGCGACTT TCACCGGCAA TGGGGCGAGC TCGACGCCGC GACGAGCGAC
GGGCTGCGCG CGATGGCGCG CGCCGGCGCA CAAGGCATGC CGCCGCTCGT CGCGGCGATG
ATGGCCGCCT ATGTGCACCG GTTCACGCAC GAGCGCGACG TCGTGCTCGG CTTGCCCGTC
ATGGCGCGGC TAAGCCGCGC CACGCGCAGG ACGCCGGGCA TGGTCGCGAA CGTGCTGCCG
CTGCGCTTCG CGTTCGATCG CGGGACGACG TTCGCGGCGC TCCACGCGCA ATGCGCGGCC
GAGATGCGCG CGTCGCTGCG GCACCAGCGC TATCGCGGCG AGGCTCTGCT GCGCGATGCG
CAGCAGAGCC GGCCGGCCGA GCGCCTTCAC GCGCAGAACG TGAACGTGAT GGCGTTCGAG
CGGGCGCTCG CGTTCGGCGA TTGCCCGGCG CGCTCGCACA GCCTGTCGAA CGGCCCCGTC
GACGATTTTT CGATCACCGT CTACGACGAC GGCGCGAGGC AACCGATCCG CATCGCGTTC
GACGCGAACG CGGCGCACTA TGCAAGCGGC GATCTGGCCG CGCATCGCGA GCGCTTTCTG
CGCCTTGGCG TCGCGCTCGC GAACGCGGCG CAGCGGCCGA TCGCCGACGC GCAGTGGCTC
ACGCCCGCCG AGCGCCGGAC GCTGGTCGGC GAACGCGGCA CGCCCGCGCC GGATGCGGGC
GTGCCCTTCG ACACCATCGC CGCGCGCTTC GCGGCGCGGG CCGCCGAGCG CCCGGACGCG
ATCACGCTCG TCGATCGCGG GCGGCGAATC ACGTACGGCG AGCTGAACGC GCGCGCGAAT
CGGCTCGCGC ACGTGCTGAT CGAGGCGGGC GTCGGGCCGG AGGCGCTGGT CGGCCTGCAC
ATGCCGCGCT CGGCCGAGCT CGTCGTCGGC ATGCTCGCGA TATTGAAGGC GGGCGGCGCG
TACGTGCCGC TCGACCCCGC CTATCCGGCC TCGCGCATCG AATTCATGGT GGCCGACGCG
CGGCCGATGC TGTCGATCAC GACGGGCGAG CACGCGGCGC AACTGCCGGC CCGCACGCCG
ACGATCGTGC TCGACGCCGC CGACGCGCAG GCGGCGTTGC GGCGCGCGCC CGCGCACGAC
CCGGTGCGGC CGGCGCCGCT CGATCGCGAG CACGCGGCGT ACGTGATCTA CACGTCCGGC
TCCACCGGCA AGCCGAAGGG CGCGGTGGTG TCGCATCGCA ACGTGATCCG CCTGCTGGAC
GGCACGCGCG GCTGGTTCGA TTTCGGCGCG GCGCATACGT GGACGCTGTT CCATTCGTTC
GCATTCGATT TCTCGGTGTG GGAGTGCTGG GGCGCGCTGC TCACGGGCGG GCGGCTCGTC
GTCGTGCCTT ACGACGTGAG CCGCTCGCCC GCCGAGTTCC TGAAGCTGCT GGTCGACGAG
CGCGTGACGG TGCTCAATCA GACGCCGTCG GCGTTTCGCC AGCTGATGCA GGCCGACGAG
GCGCACGCGG ACTTGAGCGC GCGGCTCGCG CTGCGCTACG TGGTGTTCGG CGGCGAGGCG
CTCGACGCGC GCAGCCTCGC GCGCTGGTAT GAGCGGCACG CGGACACCGC GCCCCGGCTC
GTCAACATGT ACGGCATCAC CGAGACGACG GTCCACGTCA GCTATCTGGC GCTCAGCCGC
GCGATCGCCG GGATGCCGGC GAACAGCCTG ATCGGCCGGC CGCTTCCCGA CTTGCGCGTG
TATGTGCTCG ACGCCGCGCT GCGCCCCGTG CCGGCGGGCG TGCCGGGCGA GATGTACGTC
GCCGGCGCGG GGCTCGCGCG CGGCTATCTG CGCCGGCCGT CGCTCACCGC GCAGCGCTTC
ATCGCCGATC CGTTCGGGCC GCCGGGCACG CGCATGTACC GCACGGGCGA CGTCGCGCGA
TGGCGGGCCG ACGGCGGGCT CGATTTCATC GGGCGGGCCG ACGAGCAGGT GAAAGTGCGC
GGCTTTCGCG TGGAGCTCGG CGAAATCGCC GCGCGGCTCG CGTGCGATCC GTCGGTCGCG
CAGGCGCAGG CCGTCGTGCG GCAGGACGGG CCTGCGCACG AGCGGCTCGT CGCGTACGTC
GTGCCGCGCG CCGGCGCGAC GATCGACGTT TGCGCGCTGC GCGCGTCGCT CGCGGCCGAG
ATGCCGGAAT ACATGGTGCC CGCGGCCATC GTCGCGCTCG ATGCGATGCC GCTCACGCCG
AACGGCAAAC TCGACCGCGC GGCGCTGCCG GCGCCGATCG TGACGGGCAC GAGCCGGCGC
GCGCCGGAAA ACCGCATCGA GCAGCAGGTG TGCGCGATGT TCGCGGAACT GCTCGATGCG
CAGACGCTCG GCGCCGAGGA CAATTTTTTC GAGCTCGGCG GCGATTCGCT GCTGGCGATG
CGCGCGATCA ACAAGCTGCG GCAGACGTTC GATGTCGAGC TGACGATTCG CGATCTGTTT
TCCGCGCCGA CTGTCGCGGC GCTCAGCATG CGGCTCGATG CGCAATTGGC GGCGCGCCGC
GCTCACGCGG ACGGCGCTGA GCTGCCCGCC GGGTAG
 
Protein sequence
MNMRDTSLRP LLPAQREIWL AEQLRPGTGV YNTAGYADIV GAVDHGALRD AFGHAMREAD 
AVRATFSAHG DDAWQTPRDA PHADVPLVDV SGQADPAGVA LAWMMRDMRT PPDFAAGPLA
RSALLRVGAA RFFWYVCAHH IVTDAYGTAL VVQRTAQLYG ARVRGAAPPP AWFGTLDALI
DEDRAYRESA AFADDRDYWR ARLAAAPEPR SLSPVALPGQ FAPAGDFHRQ WGELDAATSD
GLRAMARAGA QGMPPLVAAM MAAYVHRFTH ERDVVLGLPV MARLSRATRR TPGMVANVLP
LRFAFDRGTT FAALHAQCAA EMRASLRHQR YRGEALLRDA QQSRPAERLH AQNVNVMAFE
RALAFGDCPA RSHSLSNGPV DDFSITVYDD GARQPIRIAF DANAAHYASG DLAAHRERFL
RLGVALANAA QRPIADAQWL TPAERRTLVG ERGTPAPDAG VPFDTIAARF AARAAERPDA
ITLVDRGRRI TYGELNARAN RLAHVLIEAG VGPEALVGLH MPRSAELVVG MLAILKAGGA
YVPLDPAYPA SRIEFMVADA RPMLSITTGE HAAQLPARTP TIVLDAADAQ AALRRAPAHD
PVRPAPLDRE HAAYVIYTSG STGKPKGAVV SHRNVIRLLD GTRGWFDFGA AHTWTLFHSF
AFDFSVWECW GALLTGGRLV VVPYDVSRSP AEFLKLLVDE RVTVLNQTPS AFRQLMQADE
AHADLSARLA LRYVVFGGEA LDARSLARWY ERHADTAPRL VNMYGITETT VHVSYLALSR
AIAGMPANSL IGRPLPDLRV YVLDAALRPV PAGVPGEMYV AGAGLARGYL RRPSLTAQRF
IADPFGPPGT RMYRTGDVAR WRADGGLDFI GRADEQVKVR GFRVELGEIA ARLACDPSVA
QAQAVVRQDG PAHERLVAYV VPRAGATIDV CALRASLAAE MPEYMVPAAI VALDAMPLTP
NGKLDRAALP APIVTGTSRR APENRIEQQV CAMFAELLDA QTLGAEDNFF ELGGDSLLAM
RAINKLRQTF DVELTIRDLF SAPTVAALSM RLDAQLAARR AHADGAELPA G