Gene BURPS1106A_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_4040 
Symbol 
ID4901015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3945825 
End bp3947555 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content67% 
IMG OID640137266 
ProductAMP-binding domain protein 
Protein accessionYP_001068259 
Protein GI126453323 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCAG ATTCAGGCAT GGGCGCGCTG ATCGCGCCGA TGCACGGGCT TTCGTACGTA 
CGCGGCCCGG CGGATGTCCC ATTGAGCGAC GCGACGATCG GGCAGTTCCT GCTCGATACG
GTCGCCCGCT TTCCCGATCG CGCGGCGGTC GTGTTCCGCG AGCAGGGCGT GCGCTGGACG
TGGCGCGAGT TCGCGGACGA AGTCGACGCG CTCGCCGCCG CGCTGATCGA GCTCGGCATC
GCGCGCGGTG ACCGGGTCGG CATCTGGTCG CCGAACCGCG CCGAATGGCT GCTCACGCAA
TTTGCAACCG CGCGTATCGG CGCGGTGCTC GTCAATGTCA ATCCCGCCTA TCGGCTGGCC
GAGCTCGAAT ACGCGCTGAA CAAGGTCGGC TGCAAGCTGT TGATCGCCGC GGAGCGCTTC
AAGACGTCCG CGTACGCGGA GATGATCGCG GAAATCGCGC CGGAGCTCGC GACGACGCGC
GCGGGCGACG TGCTGTGCGC CGCGCGCGTG CCGAGTTTGC GCACGGTCGT GACGATGAGC
GATGCGGCGC ACGCGGGCAT GCTGAGCTTC GCGGACGTGC TCGCGCGCGG GCGGGCGGCG
CTCGCTTCCG CGCGGCTCGA CGCGATCGGC GCGACGCTCG ATTGCCGCGA TCCGATCAAC
ATCCAGTTCA CGAGCGGCAC GACGGGCAGC CCGAAGGGCG CGACGCTCAC GCACCGCAAC
GTCGTCAACA ACGCGCGCTC GATCGCGAAC GTGATGCGGC TGACCGAGGC CGATGCGATG
TGCATTCCGG TGCCGCTCTA TCACTGCTTC GGGATGGTGC TGTCGGTGCT CGCGTGCGTA
TCGGCGGGCG CGAAGATGGT GTTTCCCGGC GCGGCGTTCG AGCCGGGTGC GACGCTCGCG
GCGGTGTCCG ACGAGCGCTG CACCGCGCTG CAGGGCGTGC CGACGATGTT CATCGCCGAG
CTCGATCATC CGGATTTCGA CCGTTTCGAC CTGAGCACGC TGCGCACGGG CATCATGGCG
GGTTCGCCGT GCCCGATCGA GACGATGAAG CGCGTGGTCG CGAAGATGCA CATGTCCGAG
GTGACGATCG CCTACGGGAT GACGGAGACG AGCCCCGTGT CGTTCCAGAG CGCGACGACG
GATTCGCTCG AGAAGCGCAC GACGACGGTC GGCCGGATCC AGCCGCATCT GGAGGCGAAG
ATCGTCGACG CGACGGGCGC GATCGTGCCC GTCGGCGAGA CGGGCGAGCT GTGCACGCGC
GGCTATTCGG TGATGCTCGG CTATTGGGAC GACGAGGCCA GAACGCGCGA GGCGGTGGTC
GATGGCTGGA TGCGCACGGG CGACCTCGCG ACGCTCGACG AAGAAGGCTT TTGCAACATC
GTCGGGCGCC TGAAGGACAT GCTGATTCGC GGCGGCGAGA ACGTGTACCC GCGCGAGATC
GAGGAGTTTC TGTTCCGGCA TCCGAAGATC CAGAGCGTGC AGGTGTTCGG CGTGCCCGAT
TCGAAGTACG GCGAGGAAGT ATGCGCGTGG ATCGTGCTGC GCGCGGGCGA GACGATGACG
GACGACGAGC TGCGCGAGTT CTGCAGCGGC CAGATCGCGC ACTACAAGGT GCCGCGCTAC
GTGCGCTTCG TCGACGAACT GCCGATGACC GTGACGGGGA AGGTGCAGAA GTTCGTGATG
CGCGAACGAA TGATCGACGA ACTTGGTTTG AGCGTGCAGC AGACGGCTTG A
 
Protein sequence
MAADSGMGAL IAPMHGLSYV RGPADVPLSD ATIGQFLLDT VARFPDRAAV VFREQGVRWT 
WREFADEVDA LAAALIELGI ARGDRVGIWS PNRAEWLLTQ FATARIGAVL VNVNPAYRLA
ELEYALNKVG CKLLIAAERF KTSAYAEMIA EIAPELATTR AGDVLCAARV PSLRTVVTMS
DAAHAGMLSF ADVLARGRAA LASARLDAIG ATLDCRDPIN IQFTSGTTGS PKGATLTHRN
VVNNARSIAN VMRLTEADAM CIPVPLYHCF GMVLSVLACV SAGAKMVFPG AAFEPGATLA
AVSDERCTAL QGVPTMFIAE LDHPDFDRFD LSTLRTGIMA GSPCPIETMK RVVAKMHMSE
VTIAYGMTET SPVSFQSATT DSLEKRTTTV GRIQPHLEAK IVDATGAIVP VGETGELCTR
GYSVMLGYWD DEARTREAVV DGWMRTGDLA TLDEEGFCNI VGRLKDMLIR GGENVYPREI
EEFLFRHPKI QSVQVFGVPD SKYGEEVCAW IVLRAGETMT DDELREFCSG QIAHYKVPRY
VRFVDELPMT VTGKVQKFVM RERMIDELGL SVQQTA