Gene BURPS1106A_A0784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0784 
SymboldhbE 
ID4903593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp773882 
End bp775525 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content72% 
IMG OID640143890 
Product2,3-dihydroxybenzoate-AMP ligase 
Protein accessionYP_001074820 
Protein GI126457899 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCCGCCG CCGCGCGCAA CGCCGCGTCG AGCGACGCGC CCGACTGGCC CGACGACTTC 
GCGCGCCGCT ACCGCGCCGC CGGCTACTGG CGAGACGAGA CCTTCTACGG CGCGCTCGCC
GATCGCGCGC GGCGCAGCCC CGCCGCGACC GCGGTCGTCG ACGGCGAGCG CCGCGTCAGC
TACCGCGCGC TGCTCGAGCG CATCCGGCGT CTCGCCGGCG GCTTGCGCCG CCTGGGCCTC
GCGCGCGGCG ACACGGCCGT CGTGCATCTG CCGAACGGCG TGCGCTTCAT CGAAGCGTGC
TTCGCGTTGT TCCAGCTCGG CGTGCGCCCG GTGCTCGCGC TGCCCGCGCA TCGGCAACAC
GAGATCGGCG CGTTCTGCCG CTTCACGAAT GCCCGCGCAT ATCTCGGCGC GGCCCGGCTC
GGCGAGTTCG ACTGCCGCCC GCTCGCTCAC GCACTGCGGG CGAGCTGCCC GGCGCTCGCG
CACATCGTCG TCGCGGGCGA CGATCATGCC TTCTTGCATT TCGATTCGCT GTACGACGCG
CCGCCCGTCG CCGACTGCGC GGCGCGCGCG GACGACATCG CGTGCTTCCA GCTCTCGGGC
GGCACGACGG GCACGCCGAA ACTGATTCCG CGCCGCCATC GCGAGTACCT GTACAACGTG
CGCGCGTGCT CGGACGCGAG CGGCTTCGGC GCCGACACCG TCTATCTGGC CGCGCTGCCG
ATGGCGCACA ACTTCACGCT GTGCTGCCCC GGCGTGATCG GCGCGCTGCT CGCGGGCGGC
CGCGTCGTCG CGACCGAGCA CCCCGAGCCG GAGTGCGGCT TTGCGCTGAT CGCGCGCGAG
CGGGTCACGC ATACGGCGCT CGTCCCGCCG CTCGCGCTAC TGTGGCTGGA CGCGCAGCGC
GAGCGCCGAG CGGACCTGTC GAGCCTGCGC GTGCTGCAGG TCGGCGGCGC GCGGCTGATG
GACCACGCGG CCGAACGCGT GACGCCCGTC CTCGGCTGCC GGCTGCAACA GGTATTCGGA
ATGGCCGAAG GCCTCATCTG CTGCACGCGG CTCGACGATC CGCCCGCGCG CATCGCGCGT
ACGCAGGGCC GGCCGGTGTC GCCCGCCGAC GAAGTGCGCA TCGTCGACGA GGCGGGCCGC
GCCGTCGCGC CTGGCGAGAT CGGCGAATTG CAGGTGCGCG GGCCCTATAC GATCCGCGGC
TACTACCGGC TCGCCGAGCA TCATGCGGCC GCGTTCACGG CCGACGGCTT CTATCGCAGC
GGCGACCGCG TGCGCCGCAC CGAAGAAGGC GACCTCGTCG TCGAGGGCCG CGACAAGGAC
CAGATCAATC GCGGCGGCGA AAAAGTATCC GCCGAGGAAG TCGAAAACCT GCTGCTCGCG
CATTCGCAAA TCCGCGACGC GGCGCTCGTC GCGATGTCCG ATCCGCTGCT CGGCGAGCGC
ACGTGCGCGT TCGTCGTCGC GCGCCCGCCC GCGCCGACCT CGCTCGCGCT GAAGCGGCAT
TTGCGCGACC ACGGGCTCGC CGCGTTCAAG ATTCCCGATC GCATCGAATT CGTGCCGAGC
TTTCCGGAAA CCGGCATCGG CAAGACCAGC AAGAAATCGC TGCGCGACCT GCTGCGCCGC
CGGCTCGAAG CGGCGCGCGC ATGA
 
Protein sequence
MPAAARNAAS SDAPDWPDDF ARRYRAAGYW RDETFYGALA DRARRSPAAT AVVDGERRVS 
YRALLERIRR LAGGLRRLGL ARGDTAVVHL PNGVRFIEAC FALFQLGVRP VLALPAHRQH
EIGAFCRFTN ARAYLGAARL GEFDCRPLAH ALRASCPALA HIVVAGDDHA FLHFDSLYDA
PPVADCAARA DDIACFQLSG GTTGTPKLIP RRHREYLYNV RACSDASGFG ADTVYLAALP
MAHNFTLCCP GVIGALLAGG RVVATEHPEP ECGFALIARE RVTHTALVPP LALLWLDAQR
ERRADLSSLR VLQVGGARLM DHAAERVTPV LGCRLQQVFG MAEGLICCTR LDDPPARIAR
TQGRPVSPAD EVRIVDEAGR AVAPGEIGEL QVRGPYTIRG YYRLAEHHAA AFTADGFYRS
GDRVRRTEEG DLVVEGRDKD QINRGGEKVS AEEVENLLLA HSQIRDAALV AMSDPLLGER
TCAFVVARPP APTSLALKRH LRDHGLAAFK IPDRIEFVPS FPETGIGKTS KKSLRDLLRR
RLEAARA