Gene BURPS668_2296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2296 
Symbol 
ID4884916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2276000 
End bp2277640 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content68% 
IMG OID640128224 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_001059331 
Protein GI126441386 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAGC CGTTGCTGGG CCAGATGATG GATATGCCGC TGCTCGTGTC GTCGCTGATC 
GCGCATGCGG CGCGGCACGC GGGTGACGTC GAGATCGTGT CCCGGCGCGT CGAGGGCGAC
ATTCACCGCT ATACCTATCG CGATTGCGAA GCGCGTTCCA AACGGCTCGC GCAGGCGCTG
ACCGGCCTGG GCGTGGGCGT GGGCGAGCGC ATTGGAACGC TCGCGTGGAA CGGCTACCGG
CACGTCGAGG CGTATTACGG CATCAGCGGG ATGGGCGCCG TCTGCCATAC GATCAACCCG
CGCCTCTTTC CCGAGCAGAT CGCGTACATC GTCAACCACG CCGAGGATCG ATACGTGCTG
TTCGACCTCA CCTTCGCGCC GCTCGTCGAT CAGCTCGCGC CGCAATGCCC GAACGTGAAG
GGCTGGATCG CGATGACGGA CGACGCGCAT CTGCCGAAGG GCGCGACGCC TTATCTCTGT
TACGAGACAC TCGTCGGCGC GCAGGACGGC GATTACGCCT GGCCGCTGCT CGACGAGCGG
CAGGCGTCGT CGCTCTGCTA CACGTCCGGC ACGACGGGCC ACCCGAAGGG CGCGCTCTAT
TCGCACCGCT CGACGGTGCT GCACGCGTAC GGCGCGGCGC TGCCCGACGC GATGGGGCTG
TCCTCGCGCG ACGCGGCGCT GCCCGTCGTG CCGATGTTCC ACGTCAACGC ATGGGGGCTG
CCGTACACGG CGGCGCTCAC CGGCACGAAG CTCGTGCTGC CCGGCAAGGA CCTCGACGGC
AAATCGCTGT ACGAGCTGAT CGAAAGCGAG CGCGTGACGT TCTCGGCGGG CGTGCCCACC
GTCTGGCTCG GTCTGCTCGG GTACATGCGC GAGGCCGGCG TGCGCTTCTC GACGCTCGAG
CGCACGGTGA TCGGCGGCTC CGCGTGCCCG CCTTCGATGC TCGAGACGTT CGAGGACGTC
TACGACGTGC GTGTGATCCA TGCATGGGGG ATGACCGAGC TGTCGCCGCT CGGCACGCTG
GCCAAGCTCA ATTGGGCGCA GTCGCAGCGC GGCATCGGCG AGCAGCGGCG GCTCCTCGAG
AAGCAGGGCC GGGTGATCTA CGGGATCGAT ATGCGCATCG TCGGCGAGGA CGGCCGCGAG
TTGCCGTGGG ACGGCGTCGC GTTCGGCGAC TTGCAGGTGC GCGGGCCGTG GGTGATCGAC
CGCTATTTCG GAATCGACGC GTCGCCGCTC GTCGACGGCT GGTTCCCGAC GGGCGACGTC
GCGACGATCG ACGCCGACGG CTTCCTGCAG ATCACCGATC GCAGCAAGGA CGTGATCAAG
TCGGGCGGCG AATGGATCAG CTCGATCGAC GTCGAGAACG TCGCGGTGGC GCACCCGGCC
GTCGCGGAGG CCGCGTGCAT CGCATGCGCG CATCCGAAAT GGACCGAGCG GCCGCTCCTC
GTCGTCGTCA AGCGCGCCGG CATGGACGTG ACGCGCGACG AACTGCTCGC GTTCTACGAG
GGCAAGGTCG CGAAATGGTG GATTCCGGAC GATGTCGCGT TCGTCGACGC GCTGCCGCAC
ACCGCGACGG GCAAGCTGCA AAAGCTCAAG CTGCGCGAGC AATTCCGGGG CCACGTGCTG
CCGACGGCCG TGGACGCGTA G
 
Protein sequence
MGKPLLGQMM DMPLLVSSLI AHAARHAGDV EIVSRRVEGD IHRYTYRDCE ARSKRLAQAL 
TGLGVGVGER IGTLAWNGYR HVEAYYGISG MGAVCHTINP RLFPEQIAYI VNHAEDRYVL
FDLTFAPLVD QLAPQCPNVK GWIAMTDDAH LPKGATPYLC YETLVGAQDG DYAWPLLDER
QASSLCYTSG TTGHPKGALY SHRSTVLHAY GAALPDAMGL SSRDAALPVV PMFHVNAWGL
PYTAALTGTK LVLPGKDLDG KSLYELIESE RVTFSAGVPT VWLGLLGYMR EAGVRFSTLE
RTVIGGSACP PSMLETFEDV YDVRVIHAWG MTELSPLGTL AKLNWAQSQR GIGEQRRLLE
KQGRVIYGID MRIVGEDGRE LPWDGVAFGD LQVRGPWVID RYFGIDASPL VDGWFPTGDV
ATIDADGFLQ ITDRSKDVIK SGGEWISSID VENVAVAHPA VAEAACIACA HPKWTERPLL
VVVKRAGMDV TRDELLAFYE GKVAKWWIPD DVAFVDALPH TATGKLQKLK LREQFRGHVL
PTAVDA