Gene BURPS1710b_A2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2033 
Symbol 
ID3691988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2472907 
End bp2474532 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content71% 
IMG OID637732287 
Product2-aminobenzoate-CoA ligase 
Protein accessionYP_337184 
Protein GI76817725 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.951217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAACT TTTGCCGCAC GAATCTGCCT GCACCGACCG ATCTGCCGGA GTTCGTCTTC 
GAGTTGCCGG GCCTGCAGTA TCCGGCGCGC ATCAACTGCG CGGCGGCGCT GCTCGACGAC
GCGGTGACCC GCCGGGGCTG GGGCGAGCGC GTCGCGATCA GGACCGAGTC CGGTGCCGCC
TGGTCGTATC GCGCGCTGTT CGAGCTGAGC AACCGGATCG CCAACCTGCT GGTGCGCGAC
GGCGGGCTCG TGCCGGGCAA CCGGGTGCTG CTGCACGGAA CCAATCATCC GTTTCTCGCC
GCCGCATGGT TCGCGATCGT CAAGGCGGGC GGCGTCGTGG TGACGACGAT GCCGCTGCTG
CGCGCGGGCG AGCTGTCGAA AGTCATCGCG CAGGCGCAGG TCACGCACGC GCTGTGCGAG
GCGGCGGTGT CCGCCGAGTT GCGCGCCGCG ATGGCGGCGG CGCCGGGCGT CGCGTTCGTG
CGGTACTACG AGACCGACGA CGCGGCCGCG TTCGAGCCGC TGCTGCACGC GTGCCCGCGC
ACGTTCGAGC CGGTCGATAC GCGCGCCGAC GAGCCGTGCA TCGTCGCGTT CACGTCGGGC
ACGACGGGGC GCCCGAAGGC GACCGTGCAT TTTCATCGCG ACGTGATGGC GATCTGCCAT
TGCTTTCCGC AGCACGTGCT GAAGCCGAAC GCCGACGACG TGTTCTGCGG CTCGCCGCCG
CTCGCGTTCA CGTTCGGGCT CGGCGCGCTG CTGCTGTTTC CGCTGAGCGT CGGCGCGAGC
GTCGTGCTGC TGCAGCGGGC GAAGCCGCAG CGGCTGCTCG CCGCGATCGG CGCGCATCGC
GTGAGCATCC TCTTCACCGC GCCGGCCGCG TATCGCGCGA TGCTCGACGA GCTCGGCGCG
CACGACATCG CCAGCCTGCG CAAGTGCGTG TGCGCGGGCG AGGCGCTGCC GGCGCCGACG
CGCAACGCGT GGCTCGCGCG CACGGGCATT CGCATCATCG ACGGCATCGG CGCGACCGAG
ATGCTGCACA TCTTCGCGTC CGCGGACGAA ACGCAGGCGA AGGAAGGCGC GATCGGCAAG
GCGGTGCCCG GCTACCGGCT CGCGATCCTC GACGAGCGCG GCGAGCGCCT GCCGCCGTAT
CACGTCGGCC GTCTCGCGGT GCAGGGGCCG ACCGGCTGCC GCTACCTGAA CGATGCGCGG
CAGCGCGATT ACGTGCGGCA CGGCTGGAAC CTGACGGGCG ACGCCGCCTA CCTCGACGAG
GACGGCTACC TGTTCTACCA GTCGCGCGCC GACGACCTGA TCATCAGCCT CGGCTACACC
ATCTCGCCCG CCGAGGTGGA GGAGGCGCTG CTGAGCCACG CGGACGTGCT CGAGTGCGGT
GTTGTCGGCG CGCCCGACGG GCGAGGCGGC ACGCTCGTGT GCGCGCACGT GGTGCCGCGG
CCCGGCGTGC ACGGCTGCGA TGCGCTGACG GCCGCGTTGC AGCAGCACGT GAAGGCGCGG
ATCGCGCCGT ACAAGTATCC GCGGCGCATC GAGTATCACG CGGCCGGGCT GCCGCGCAAC
GATTCCGGCA AGCTGCAGCG CTTCAAGCTG CGGCAGGCGG CCGAGGAAGA CGTGCAGGCG
GCCTGA
 
Protein sequence
MDNFCRTNLP APTDLPEFVF ELPGLQYPAR INCAAALLDD AVTRRGWGER VAIRTESGAA 
WSYRALFELS NRIANLLVRD GGLVPGNRVL LHGTNHPFLA AAWFAIVKAG GVVVTTMPLL
RAGELSKVIA QAQVTHALCE AAVSAELRAA MAAAPGVAFV RYYETDDAAA FEPLLHACPR
TFEPVDTRAD EPCIVAFTSG TTGRPKATVH FHRDVMAICH CFPQHVLKPN ADDVFCGSPP
LAFTFGLGAL LLFPLSVGAS VVLLQRAKPQ RLLAAIGAHR VSILFTAPAA YRAMLDELGA
HDIASLRKCV CAGEALPAPT RNAWLARTGI RIIDGIGATE MLHIFASADE TQAKEGAIGK
AVPGYRLAIL DERGERLPPY HVGRLAVQGP TGCRYLNDAR QRDYVRHGWN LTGDAAYLDE
DGYLFYQSRA DDLIISLGYT ISPAEVEEAL LSHADVLECG VVGAPDGRGG TLVCAHVVPR
PGVHGCDALT AALQQHVKAR IAPYKYPRRI EYHAAGLPRN DSGKLQRFKL RQAAEEDVQA
A