Gene BURPS668_A0874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0874 
SymboldhbE 
ID4887254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp848201 
End bp849844 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content72% 
IMG OID640130814 
Product2,3-dihydroxybenzoate-AMP ligase 
Protein accessionYP_001061873 
Protein GI126444853 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCCGCCG CCGCGCGCAA CGCCGCGTCG AGCGACGCGC CCGACTGGCC CGACGACTTC 
GCGCGCCGCT ACCGCGACGC CGGCTACTGG CGAGACGAGA CCTTCTACGG CGCGCTCGCC
GAGCGCGCGC GGCGCAGCCC CACCGCGACC GCGGTCGTCG ACGGCGAGCG CCGCGTCAGC
TACCGCGCGC TGCTCGAGCG CATCCGGCGT CTCGCCGGCG GCTTGCGCCG CCTGGGCCTC
GCGCGCGGCG ACACGGCCGT CGTGCATCTG CCGAACGGCG TGCGCTTCAT CGAAGCGTGC
TTCGCGCTGT TCCAGCTCGG CGTGCGCCCG GTGCTCGCGC TGCCCGCGCA TCGGCAACAC
GAGATCGGCG CGTTCTGCCG CTTCACGAAT GCCCGCGCAT ATCTCGGCGC GGCCCGGCTC
GGCGAGTTCG ACTGCCGCCC GCTCGCTCAC GCGCTGCGGG CGAGCTGCCC GGCGCTCGCG
CACATCGTCG TCGCGGGCGA CGATCATGCC TTCTTGCATT TCGATTCGCT GTACGACACG
CCGCCCGTCG CCGACTGCGC GGCGCGCGCG GACGACATCG CGTGCTTCCA GCTCTCGGGC
GGCACGACGG GCACGCCGAA ACTGATTCCG CGCCGCCATC GCGAGTACCT GTACAACGTG
CGCGCGTGCT CGGACGCGAG CGGCTTCGGC GCCGACACCG TCTATCTGGC CGCGCTGCCG
ATGGCGCACA ACTTCACGCT GTGCTGCCCC GGCGTGATCG GCGCGCTGCT CGCGGGCGGC
CGCGTCGTCG CGACCGAGCA CCCCGAGCCG GAGTGCGGCT TTGCGCTGAT CGCGCGCGAG
CGGGTCACGC ATACGGCGCT CGTCCCGCCG CTCGCGCTAC TGTGGCTGGA CGCGCAGCGC
GAGCGCCGAG CGGACCTGTC GAGCCTGCGC GTGCTGCAGG TCGGCGGCGC GCGGCTGATG
GACCACGCGG CCGAACGCGT GACGCCCGTC CTCGGCTGCC GGCTGCAACA GGTATTCGGA
ATGGCCGAAG GCCTCATCTG CTGCACGCGG CTCGACGATC CGCCCGCGCG CATCGCGCGT
ACGCAGGGCC GGCCGGTGTC GCCCGCCGAC GAAGTGCGCA TCGTCGACGA GGCGGGCCGC
GCCGTCGCGC CTGGCGAGAT CGGCGAATTG CAGGTGCGCG GGCCCTATAC GATCCGCGGC
TACTACCGGC TCGCCGAGCA TCATGCGGCC GCGTTCACGG CCGACGGCTT CTATCGCACC
GGCGACCGCG TGCGCCGCAC CGAAGAAGGC GACCTCGTCG TCGAGGGCCG CGACAAGGAC
CAGATCAATC GCGGCGGCGA AAAAGTATCC GCCGAGGAAG TCGAGAACCT GCTGCTCGCG
CATTCGCAAA TCCGCGACGC GGCGCTCGTC GCGATGTCCG ATCCGCTGCT CGGCGAGCGC
ACGTGCGCGT TCGTCGTCGC GCGCCCGCCC GCGCCGACCT CGCTCGCGCT GAAACGGCAT
TTGCGCGACC ACGGGCTCGC CGCGTTCAAG ATTCCCGATC GCATCGAATT CGTGCCGAGC
TTTCCGGAAA CCGGCATCGG CAAGACCAGC AAGAAATCGC TGCGCGACCT GCTGCGCCGC
CGGCTCGAAG CGGCGCGCGC ATGA
 
Protein sequence
MPAAARNAAS SDAPDWPDDF ARRYRDAGYW RDETFYGALA ERARRSPTAT AVVDGERRVS 
YRALLERIRR LAGGLRRLGL ARGDTAVVHL PNGVRFIEAC FALFQLGVRP VLALPAHRQH
EIGAFCRFTN ARAYLGAARL GEFDCRPLAH ALRASCPALA HIVVAGDDHA FLHFDSLYDT
PPVADCAARA DDIACFQLSG GTTGTPKLIP RRHREYLYNV RACSDASGFG ADTVYLAALP
MAHNFTLCCP GVIGALLAGG RVVATEHPEP ECGFALIARE RVTHTALVPP LALLWLDAQR
ERRADLSSLR VLQVGGARLM DHAAERVTPV LGCRLQQVFG MAEGLICCTR LDDPPARIAR
TQGRPVSPAD EVRIVDEAGR AVAPGEIGEL QVRGPYTIRG YYRLAEHHAA AFTADGFYRT
GDRVRRTEEG DLVVEGRDKD QINRGGEKVS AEEVENLLLA HSQIRDAALV AMSDPLLGER
TCAFVVARPP APTSLALKRH LRDHGLAAFK IPDRIEFVPS FPETGIGKTS KKSLRDLLRR
RLEAARA