Gene Bcep18194_B1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B1993 
Symbol 
ID3753758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp2283590 
End bp2285407 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content66% 
IMG OID637766841 
Productsulfoacetaldehyde acetyltransferase 
Protein accessionYP_372750 
Protein GI78062842 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR03457] sulfoacetaldehyde acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.15706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC AATCCACTTC CCTGCGTGCA TCGGCGAACG GTCCGCAGGA CATGACGCCG 
TCCGAAGCCT TCGTCGAGAC CCTCGCGGCC AACGGCGTGA CCGACATGTT CGGCATCATG
GGCTCCGCGT TCATGGATGC GATGGACATC TTCGCGCCGG CCGGCATCCG CCTGATCCCG
GTCGTGCACG AACAGGGCGC GGGCCACATG GCCGACGGCT ATGCGCGCGT ATCGGGCCGC
CACGGCGTCG TGATCGGCCA GAACGGCCCC GGCATCAGCA ACTGCGTGAC GGCAATCGCG
GCCGCGTACT GGGCGCACAG CCCGGTCGTG ATCGTCACGC CGGAAGCCGG CACGATGGGC
ATCGGCCTCG GCGGTTTCCA GGAAGCGAAC CAGCTGCCGA TGTTCCAGGA ATTCACGAAA
TACCAGGGCC ATGTCACGCA CCCGGCGCGG ATGGCCGAAT TCACCGCGCG CTGCTTCGAC
CGCGCGCAGG CCGAGATGGG CCCGACGCAG CTGAACATCC CGCGCGACTA CTTCTACGGC
AAGGTCAAGG TCGAGATTCC GCAACCGCGC CGGCTCGATC GCGGCGCCGG CGGCGAACAG
AGCCTGGACG ATGCGGCCGC GCTGATCGCG CAGGCGAAGT TCCCGGTGAT CATCTCGGGC
GGCGGCGTCG TGATGGCCGA TGCGATCGAG GAATGCAAGG CGCTCGCCGA ACGGCTCGGC
GCGCCGGTCG TCAACAGCTA CCTGCACAAC GACTCGTTCC CGGCGAACCA TCCGCTGTGG
TGCGGCCCGC TCGGCTACCA GGGCTCGAAG GCGGCGATGA AGCTGCTGTC GCGCGCGGAC
GTCGTGATCG CGCTCGGCTC GCGCCTCGGG CCGTTCGGCA CGCTGCCGCA GCACGGGATG
GACTACTGGC CGAAGGACGC GAAGATCATC CAGATCGACG CGGATCACAA GATGCTCGGC
CTCGTGAAGA AGATCTCGGT CGGCATCTGC GGCGACGCGA AGGCCGCGGC GGTCGCGCTC
ACGCAACGCC TCGAAGGCCG CACGCTCGCG TGCGACGGCT CGCGCGGCGA TCGCGCCGAC
CAGATCGCGA CCGAGAAGGC CGCGTGGGAA AAGGAACTCG ACGACTGGAC GCACGAGCGC
GACGCGTACA GCCTCGACAT GATCGAGGAG CAGAAGCACG AGAAGCCGTT CAGCGGCGGC
CAGTACCTGC ATCCGCGCCA GGTGCTGCGC GAACTCGAGA AGGCGATGCC CGAGGACGTG
ATGGTGTCGA CCGACATCGG CAACATCAAC TCGGTCGCGA ACAGCTACCT GCGCTTCAAC
AAGCCGCGCA GCTTCTTCGC GGCGATGAGC TGGGGCAACT GCGGCTATGC GTTCCCGACG
ATCATCGGCG CGAAGGTCGC GGCACCGCAC CGCCCGGCCG TGTCGTATGC GGGCGATGGC
GCGTGGGGCA TGAGCCTGAT GGAAACGATG ACCTGCGTGC GCCACAACAT CCCGGTCACG
GCCGTCGTGT TCCACAACCG TCAATGGGGC GCGGAGAAGA AGAACCAGGT CGACTTCTAC
AACCGCCGCT TCGTCGCCGG CGAACTCGAC AACCAGAGCT TCGCGGAAAT CGCGCGTGCA
ATGGGTGCCG AAGGGATCAC GGTCGACCGC CTCGAAGATG TGGGCCCGGC GCTCAAGCGT
GCGATCGACA TGCAGATGAA CGAAGGCAAG ACGACGATCA TCGAGATCAT GTGCACGCGC
GAACTCGGCG ATCCGTTCCG CCGCGATGCG CTGTCGAAGC CTGTGCGCAC GCTCGACAAG
TACAAGGACT ACGTGTGA
 
Protein sequence
MSEQSTSLRA SANGPQDMTP SEAFVETLAA NGVTDMFGIM GSAFMDAMDI FAPAGIRLIP 
VVHEQGAGHM ADGYARVSGR HGVVIGQNGP GISNCVTAIA AAYWAHSPVV IVTPEAGTMG
IGLGGFQEAN QLPMFQEFTK YQGHVTHPAR MAEFTARCFD RAQAEMGPTQ LNIPRDYFYG
KVKVEIPQPR RLDRGAGGEQ SLDDAAALIA QAKFPVIISG GGVVMADAIE ECKALAERLG
APVVNSYLHN DSFPANHPLW CGPLGYQGSK AAMKLLSRAD VVIALGSRLG PFGTLPQHGM
DYWPKDAKII QIDADHKMLG LVKKISVGIC GDAKAAAVAL TQRLEGRTLA CDGSRGDRAD
QIATEKAAWE KELDDWTHER DAYSLDMIEE QKHEKPFSGG QYLHPRQVLR ELEKAMPEDV
MVSTDIGNIN SVANSYLRFN KPRSFFAAMS WGNCGYAFPT IIGAKVAAPH RPAVSYAGDG
AWGMSLMETM TCVRHNIPVT AVVFHNRQWG AEKKNQVDFY NRRFVAGELD NQSFAEIARA
MGAEGITVDR LEDVGPALKR AIDMQMNEGK TTIIEIMCTR ELGDPFRRDA LSKPVRTLDK
YKDYV