Gene BURPS1106A_2335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2335 
Symbol 
ID4900827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2309661 
End bp2311301 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content68% 
IMG OID640135564 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_001066599 
Protein GI126451552 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAGC CGTTGCTGGG CCAGATGATG GATATGCCGC TGCTCGTGTC GTCGCTGATC 
GCGCATGCGG CGCGACACGC GGGTGACGTC GAGATCGTGT CCCGGCGCGT CGAGGGCGAC
ATTCACCGCT ATACCTATCG CGATTGCGAA GCGCGTTCCA AACGGCTCGC GCAGGCGCTG
ACCGGCCTGG GCGTGGGCGT GGGCGAGCGC ATTGGAACGC TCGCGTGGAA CGGCTACCGG
CACGTCGAGG CGTATTACGG CATCAGCGGG ATGGGCGCCG TCTGCCATAC GATCAACCCG
CGCCTCTTTC CCGAGCAGAT CGCGTACATC GTCAACCACG CCGAGGATCG ATACGTGCTG
TTCGACCTCA CCTTCGCGCC GCTCGTCGAT CAGCTCGCGC CGCAATGCCC GAATGTGAAG
GGCTGGATCG CGATGACGGA CGACGCGCAT CTGCCGAAGG GCGCGACGCC TTATCTCTGT
TACGAGACGC TCGTCGGCGC GCAGGACGGC GATTACGCCT GGCCGCTGCT CGACGAGCGG
CAGGCGTCGT CGCTCTGCTA CACGTCCGGC ACGACGGGCC ACCCGAAGGG CGCGCTCTAT
TCGCACCGCT CGACGGTGCT GCACGCGTAC GGCGCGGCGC TGCCCGACGC GATGGGGCTG
TCCTCGCGCG ACGCGGCGCT GCCCGTCGTG CCGATGTTCC ACGTCAACGC ATGGGGGCTG
CCGTACACGG CGGCGCTCAC CGGCACGAAG CTCGTGCTGC CCGGCAAGGA CCTCGACGGC
AAATCGCTGT ACGAGCTGAT CGAAAGCGAG CGCGTGACGT TCTCGGCGGG CGTGCCCACC
GTCTGGCTCG GTCTGCTCGC GTACATGCGC GAGGCCGGCG TGCGCTTCTC GACGCTCGAG
CGCACGGTGA TCGGCGGCTC CGCGTGCCCG CCTTCGATGC TCGAGACGTT CGAGGACGTC
TACGACGTGC GTGTGATCCA TGCATGGGGG ATGACCGAGC TGTCGCCGCT CGGCACGCTG
GCCAAGCTCA ATTGGGCGCA GTCGCAGCGC GGCATCGGCG AGCAGCGGCG GCTCCTCGAG
AAGCAGGGCC GGGTGATCTA CGGGATCGAC ATGCGCATCG TCGGCGAGGA CGGCCGCGAG
CTGCCGTGGG ACGGCGTCGC GTTCGGCGAC TTGCAGGTGC GCGGGCCGTG GGTGATCGAC
CGCTATTTCG GAATCGACGC GTCGCCGCTC GTCGACGGCT GGTTCCCGAC GGGCGACGTC
GCGACGATCG ACGCCGACGG CTTCCTGCAG ATCACCGATC GCAGCAAGGA CGTGATCAAG
TCGGGCGGCG AATGGATCAG CTCGATCGAC GTCGAGAACG TCGCGGTGGC GCACCCGGCC
GTCGCGGAGG CCGCGTGCAT CGCATGCGCG CATCCGAAAT GGACCGAGCG GCCGCTCCTC
GTCGTTGTCA AGCGCGCCGG CATGGACGTG ACGCGCGACG AACTGCTCGC GTTCTACGAG
GGCAAGGTCG CGAAATGGTG GATTCCGGAC GATGTCGCGT TCGTCGACGC GCTGCCGCAC
ACCGCGACGG GCAAGCTGCA AAAGCTCAAG CTGCGCGAGC AATTCCGGGG CCACGTGCTG
CCGACGGCCG TGGACGCGTA G
 
Protein sequence
MGKPLLGQMM DMPLLVSSLI AHAARHAGDV EIVSRRVEGD IHRYTYRDCE ARSKRLAQAL 
TGLGVGVGER IGTLAWNGYR HVEAYYGISG MGAVCHTINP RLFPEQIAYI VNHAEDRYVL
FDLTFAPLVD QLAPQCPNVK GWIAMTDDAH LPKGATPYLC YETLVGAQDG DYAWPLLDER
QASSLCYTSG TTGHPKGALY SHRSTVLHAY GAALPDAMGL SSRDAALPVV PMFHVNAWGL
PYTAALTGTK LVLPGKDLDG KSLYELIESE RVTFSAGVPT VWLGLLAYMR EAGVRFSTLE
RTVIGGSACP PSMLETFEDV YDVRVIHAWG MTELSPLGTL AKLNWAQSQR GIGEQRRLLE
KQGRVIYGID MRIVGEDGRE LPWDGVAFGD LQVRGPWVID RYFGIDASPL VDGWFPTGDV
ATIDADGFLQ ITDRSKDVIK SGGEWISSID VENVAVAHPA VAEAACIACA HPKWTERPLL
VVVKRAGMDV TRDELLAFYE GKVAKWWIPD DVAFVDALPH TATGKLQKLK LREQFRGHVL
PTAVDA