Gene BURPS1106A_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_4014 
Symbol 
ID4900948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3918170 
End bp3919348 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID640137240 
Productputative acyltransferase 
Protein accessionYP_001068233 
Protein GI126452192 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3594] Fucose 4-O-acetylase and related acetyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGACT CCTATCTCGA TACCGCTCGG GGCGCGGGCA TCATCCTCGT CGTCTACGGC 
CATGTGCTGC GCGGGCTGTT CTCGGCCGGC CTCGTGCCGG CCGGCTGGCC AAGCGCGTTG
CTCGCCGCGA CCGACTACAC GATCTACACG TTCCACATGC CGCTCTTCTT CCTGCTGTCG
GGGCTGCACG TGCCGAAATC GCTGCGGCGC GCGGGCGATG TCTTCCTGCT CACGAAGCTG
CGCACGATCG TCTACCCCTA TTTCGTATGG TCGCTGCTGC AAGGCGCGGT GCAGATCGCG
CTGTCGGCAC GCGGCACGAA TCACGCATTC ACGCCGAACG ATCTGCTCGC GATCGGCTGG
CGGCCGTTCG GGCAATTCTG GTTCCTGTAC GCGCTGTTCA TCTGCATGCT GATTGCATGG
GCCGTGTCGG CGATCACGCT GCGCGCGCGA ACGCACGGTG GTGCGAGCGG TGCAAACGGC
GAGAACGGCG GGACCGAGGT GAGCGGCGAG AACGGCGAGC CATCGGGCGC GCCGTGCGCC
GTTCCCGCGA TGCCCATCGT GCTCGTCTTG CTCGCGATCG GCGGGCTCGC CGCCGTCGCG
TTCGTCGCCG GTTCCGCGAC GCGGTGGGGC ATCGTGTCGA TGACGCTCGC GTACTTCCCG
TTCTTCGTGC TCGGGATGCT GATCGGCGAA CGGTTGCCCG CGTTCCTCGA ACGCGTGTCG
AGCGGGCCCG CGCTCGTCGC CGTCGCGGCA ACGTTCGCCG CTTCCGTTGC GTTCGCGCAC
CGCTTCGGCG AATCGGACAG CATCTGGGCG CTCCCCGCCG CGCTGTCGGG CAGCGCGCTC
GTGCTGCTCG TCGCGCACCG CGCGGCACGG CGTGGCGACG CGGCGCGGCA CGCGCCTCGC
GCGTCGTGGC TCGAATACCT CGGTTTCGCG TCGATGCCGA TCTATCTCGC GCACATTCTC
GCGACGGCCG CGACGCGCAT CGCGCTCGTC ACGCTCGGCA TCGTCGATGT CGGCGCACAG
CTCGCGCTCG GCACGTTCGC GGGCGTTCTC GGGCCAACCC TGCTGTACGC GCTCGCGCTG
CGCGCCGGAA CCGCGCGGCT CGCCGGTTTC CCGCCGCTGC CGGCCGGCTA CGCGATGACG
CCCGACAAAG GCCGGATCGG CCGAGCCGAT GCGGCCTGA
 
Protein sequence
MRDSYLDTAR GAGIILVVYG HVLRGLFSAG LVPAGWPSAL LAATDYTIYT FHMPLFFLLS 
GLHVPKSLRR AGDVFLLTKL RTIVYPYFVW SLLQGAVQIA LSARGTNHAF TPNDLLAIGW
RPFGQFWFLY ALFICMLIAW AVSAITLRAR THGGASGANG ENGGTEVSGE NGEPSGAPCA
VPAMPIVLVL LAIGGLAAVA FVAGSATRWG IVSMTLAYFP FFVLGMLIGE RLPAFLERVS
SGPALVAVAA TFAASVAFAH RFGESDSIWA LPAALSGSAL VLLVAHRAAR RGDAARHAPR
ASWLEYLGFA SMPIYLAHIL ATAATRIALV TLGIVDVGAQ LALGTFAGVL GPTLLYALAL
RAGTARLAGF PPLPAGYAMT PDKGRIGRAD AA