Gene BURPS1106A_A1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1821 
Symbol 
ID4905007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1788072 
End bp1789061 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content69% 
IMG OID640144927 
Productputative serine O-acetyltransferase 
Protein accessionYP_001075855 
Protein GI126457462 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1045] Serine acetyltransferase 
TIGRFAM ID[TIGR01172] serine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGTAT TCGACATCGA CGACATCGTT CAATCGCTTC AAACCGTGCG CCAGCGTTGG 
CGCGAAGTGC AGCGCCGCTC GCTCGAGCCG GGCGGGCGCG AATTGCCGGC GCGCGAGGCG
CTTGCCGGCA TCGTCGAGAC GTTCAAGGGC GTGCTGTTTC CGATGCGCCT CGGGCCGCCC
GATCTTCGGC AGGAGAGTGA AAATTTCTAT GTGAGCCACG CGCTCGACGA CGCATTGCAT
GCGCTTCTCG CGCAGGCTCG GCTCGAATTG CGCTACAAGG GCCGACACGA TGCCGCCGCG
CCCGCCGAAG CCGCGATCGA CGCGAAAGCC GATGCGGCGG TGCGCGCGTT CGCCGCGCGC
CTGCCCGATA TCCGCGCGCT GCTCGACAGC GACGTGCTGG CCGCGTTTCA CGGCGATCCG
GCCGCGGGCA GCGTCGACGA GGTGCTGCTT TGCTACCCCG GCGTGCTGGC GATGATCCAT
CACCGGCTCG CGCACGCGCT GTATCGCCTC GAATTGCCGC TGCTCGCGCG CATCGTCGCC
GAGCATGCGC ATGCGCAGAC GGGGATCGAC ATTCATCCCG GCGCGCAGAT CGGCGGCGGA
TTCTTCATCG ATCACGGCAC GGGCGTCGTG ATCGGCGAGA CCGCGGTGAT CGGCGAGCGC
GTGCGCGTCT ATCAGGCGGT CACGCTCGGC GCGAAGCGCT TTCCGAGGGA CGCGTCCGGG
CATCTCGAAA AGGGACTCGC GCGCCACCCG ATCGTCGAGG ACGATGTCGT CGTCTATGCG
GGCGCGACGA TTCTCGGCCG CGTGACGATC GGCAAGGGCG CGGTGATCGG CGGCAACGTG
TGGATCACGC AGGACATCCC GCCCGGCAGC CATGTCACGC AAGCCGTCAC GCGCAGCGAT
CCGGCGCGGC CGGCCGACGC GGCCGCCTCG TCGCCGCGGC CGGCCGGCGC GCACGACGCG
ACGCTTTCTG CCGCGCAGGC GCTGCGATGA
 
Protein sequence
MAVFDIDDIV QSLQTVRQRW REVQRRSLEP GGRELPAREA LAGIVETFKG VLFPMRLGPP 
DLRQESENFY VSHALDDALH ALLAQARLEL RYKGRHDAAA PAEAAIDAKA DAAVRAFAAR
LPDIRALLDS DVLAAFHGDP AAGSVDEVLL CYPGVLAMIH HRLAHALYRL ELPLLARIVA
EHAHAQTGID IHPGAQIGGG FFIDHGTGVV IGETAVIGER VRVYQAVTLG AKRFPRDASG
HLEKGLARHP IVEDDVVVYA GATILGRVTI GKGAVIGGNV WITQDIPPGS HVTQAVTRSD
PARPADAAAS SPRPAGAHDA TLSAAQALR