Gene BURPS1106A_2579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2579 
Symbol 
ID4901111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2541654 
End bp2543492 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content73% 
IMG OID640135806 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001066833 
Protein GI126452346 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0745582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTTC ACGACATGAC TCACCCGGGC GAGCTCGCAC CGGTGCGAGC GGACCGATTC 
GGCCGGCGGT TCGCGACGGC GGCCGCGGCC GAGATCGCCG CCGCCGATTC GCTGACGGCG
CGATTCGCCG ACGTGGCCGC GCGCCGGGGC GATGCGCTCG CGGTGACGTG CGGCGACGAA
CGCTGGACGT ACGGCGCGCT TGCGGCACGC GCGCGCCGCA TCGCCGAAGC GGTTCGCGCG
GCGGGCGAGG CGGGCGGCGA ACCGGTCGCG CTCCTCTATC CGCACGGCGC GCCGATGATC
GCCGCGATGT TCGGCGTGCT CGGCGCCGGC AAGTTCTACG TCCCGCTGAT CGCCGACCAT
CCGCTGCCGC ACCTGCAATC GATCGTGCGC GAGTGCGGCT GCCGGCTCGT GCTCGCCGCG
CCCGAGCTCG CCGAGACGGC CGCGCGCCTC GGCGTCTCGG CGCGCGTCAT CGACGACGCG
CGCCTGCCGC CGGCGCACGG CCCGTTCGAC GCGCGCGGCG GCGACGCGGT TTCGTATCTG
CTGTTCACGT CGGGCACGAC CGGCGTGCCG AAGGGCGTGA TGCAGTGCGA CCGCAATGTG
CTGCACCACG CCGCGTGCTA CGCGGCGTCG ATCGGCCTTG ACGACGACGA CCGGATGACG
CTGCTGCCGT ACTACGGCTT CGACGCGTCG GTGATGGACA TCTTCGCCAC GCTGCTGACG
GGCGCGAGCC TGCATCTGTG GGACGTGCGC GAGCGCGGCG TCGACGGCAT CGGTGAATGG
CTCGCGCGCG AGCGCATGAC GATCTGGCAT TCGACGCCGA GCGTGCTGCG CGCCACGTTT
GCCGCCTTCG CGCGGCCGGC CGCGCTGCGC TGGGTCGTGC TGGGCGGCGA GGCCGCGACG
GGCGGCGACG TCGCGCTCGT CGCGCGGCAC GGCGGCCCGC GATGCCGGCT GCTCAACGGC
CTGGGGCCGA CCGAGTGCAC GACGGCGCTG CAATACGTCG CCGATCCCGC GGCCGATGCG
AGTGTCGCGC GTCTGCCCGT CGGCCGGCCG GTGCCGGGCG TCGAGGTCGA GCTCGCCGAT
GCGCGGGGCG AAGCCTGCGC GACCGAGGGC GAACTCGTGA TCGTCAGCCC CTTCGTCGCG
CTCGGGTACT GGGGTCGCGC GGAACTGAGC GCCGAGCGCT TTCGCCAGAC GGCGCGCCCG
GACGGCGCGC GGCGGTATCG CACGGGCGAT CTGCTGCGGA TCGACGCGCG CGGCTGCTAC
GAACATCTGA CGCGGGTCGA CGATCAGATC AAGATTCGCG GCCTGCGCGT CGAACTCGGC
GAAATCCAGG CGACGCTCGC CGCGCATGAC GACGTGCTTC AGGCGGTCGT GCTGCCGCGC
CTTGACGAAG CCACGCAGCA GCAGACGATC GTCGCGTACG TCGTGCCGCG CGCGGCGTCG
GCCGACGTCG CGGCGCTGCG CGAATACGTC GCGAGCCGTT TGCCCGCGCA CATGGTGCCG
CGCGCGATCG TTCGCGTCGA TGCGATGCCG TTGCTGCCGA ACGGCAAGCT GAATCGCCGC
GCGCTGCCGG CGCCGCCGCG CGCGGAAGTG GCGGCGGGCG AGCGCAAGGC GCCGCGCACG
CCGTTGCACC GGCTGCTCGC CGCATGCTGG GCCGACGTGC TGCGGCGCGA CGCGGTGGGC
ATCGACGAGA ACTTCTTCGA ACTCGGCGGC GATTCGCTGC TCGGCGCGCA ACTGCTGTCG
CGCGTGAAGC GCGACCTCGA GCTCGACGCG CGGCTCGGCG ACCTGTTTCG CCATCCGACA
GTCGAGTCGC TGGCCGAGCA TCTGCTGGCT TGCCGATGA
 
Protein sequence
MSVHDMTHPG ELAPVRADRF GRRFATAAAA EIAAADSLTA RFADVAARRG DALAVTCGDE 
RWTYGALAAR ARRIAEAVRA AGEAGGEPVA LLYPHGAPMI AAMFGVLGAG KFYVPLIADH
PLPHLQSIVR ECGCRLVLAA PELAETAARL GVSARVIDDA RLPPAHGPFD ARGGDAVSYL
LFTSGTTGVP KGVMQCDRNV LHHAACYAAS IGLDDDDRMT LLPYYGFDAS VMDIFATLLT
GASLHLWDVR ERGVDGIGEW LARERMTIWH STPSVLRATF AAFARPAALR WVVLGGEAAT
GGDVALVARH GGPRCRLLNG LGPTECTTAL QYVADPAADA SVARLPVGRP VPGVEVELAD
ARGEACATEG ELVIVSPFVA LGYWGRAELS AERFRQTARP DGARRYRTGD LLRIDARGCY
EHLTRVDDQI KIRGLRVELG EIQATLAAHD DVLQAVVLPR LDEATQQQTI VAYVVPRAAS
ADVAALREYV ASRLPAHMVP RAIVRVDAMP LLPNGKLNRR ALPAPPRAEV AAGERKAPRT
PLHRLLAACW ADVLRRDAVG IDENFFELGG DSLLGAQLLS RVKRDLELDA RLGDLFRHPT
VESLAEHLLA CR