Gene Bcen_4052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen_4052 
Symbol 
ID4096066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia AU 1054 
KingdomBacteria 
Replicon accessionNC_008061 
Strand
Start bp1222414 
End bp1223985 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content67% 
IMG OID638017346 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_623914 
Protein GI107026403 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCGATC GCTCGTCCTC ACGCGACAAC CGAATTCAGG AAAGAGCCCC AGACATGAAA 
CACGACAGCA ATGTGACCTC CACCGTCGGC CACCTGATCG ACGGCAAGCG CGTCGACGGC
GGCAGCCGCG TCCAGCCGGT GTTCGACCCG GCGACGGGCG AATCGCGCAA GAGCGTGGCG
CTCGCCGACA AGCTGACCGT CGAAGCCGCG ATCGCATTCG CGCAAGCCGC GTTTCCGGCA
TGGCGCAACA CGCCGCCGCT GAAGCGCGCC CGCGTGATGA GCCGCTTCAA GACGCTGCTG
GAAGAGCACG CCGACGAGCT GTGCGCGCTG ATCACCGCCG AGCACGGCAA GGTGCTCGCC
GATGCGATGG GCGAACTGCA GCGCGGGATC GAGAACGTCG AATACGCGAC CTACGTGCCC
GAGTTGCTGA AAGGCGAGCA CAGCAAGAAC GTCGGCCCTG CGATCGATTC GTGGAGCGAG
TTCCAGGCGC TCGGCGTCGC GGCCGGCATC ACGCCGTTCA ACTTCCCGGT GATGGTGCCG
CTGTGGATGT GGCCGATGGC CGTCGCGTGC GGCAACACGT TCGTGCTGAA GCCATCCGAG
CGCACGCCGT CGTCGACGCT GCGCATGGCC GAACTCGCGC TCGAGGCCGG CCTGCCGCCG
GGCGTGCTGA ACGTCGTGAA CGGCGACAAG GAAGCGGTCG ACACGATCCT CACCGATCCG
CGCGTGAAGG CCGTGAGCTT CGTCGGCTCG ACGCCGATCG CCGAATACAT CTACTCGACC
GGCTGCGCGC ACGGCAAGCG CGTGCAGGCG CTCGGCGGCG CGAAGAACTT CGCGGTCGTG
ATGCCGGATG CCGATATCCC CAACGCGGTG AACGCGCTGA TGGGCGCCGC ATACGGCTCC
TGCGGCGAGC GCTGCATGGC GATTCCGCTG GTCGTCGCGA TCGGTGACGA AACGGGCGAC
CAGGTCGTCG CGGGGCTGAA GGCCGAGATC GAGAAGATGA AGGTCGGCCC GGGCAACGGC
GCGGGCGTCG ACATGGGCCC GCTCGTCACG CAGCAGCATT TCGAGAAGGT GACGGGTTTC
GTCGAAGCGG GCGTGGCAGC GGGCGCGACG CTCGTCGTCG ACGGTCGCGG CGTGAAGGTC
GACGGCCACG ACGGCGGTTA CTACCTCGGG CCGTGCCTGT TCGACAACGT GAAGCCCGGC
ATGCCGATCT ACCAGCACGA GATCTTCGGG CCGGTGCTCG GCGTGATTCG CCTGAAATCG
CTCGACGAGG CGATGGCGCT GATCGATGCG CACGAATACG GCAACGGCAC CTGCCTGTTC
ACGCGCGACG GCGAGGCCGC GCGCTACTTC AGCGACAACA TCCAGATCGG CATGGTCGGC
ATCAACGTGC CGCTGCCGGT GCCGGTCGCG TACCACTCGT TCGGCGGCTG GAAGCGTTCG
CTGTTCGGCG ACCTGCACGC ATACGGCCCG GACGCCGTGC GGTTCTACAC GAAGCGCAAG
ACGATCACGC AGCGCTGGCC GTCGGCCGGT GTGCGCGAAG GGACGGTGTT CAGCTTCCCG
TCGAACCGCT GA
 
Protein sequence
MSDRSSSRDN RIQERAPDMK HDSNVTSTVG HLIDGKRVDG GSRVQPVFDP ATGESRKSVA 
LADKLTVEAA IAFAQAAFPA WRNTPPLKRA RVMSRFKTLL EEHADELCAL ITAEHGKVLA
DAMGELQRGI ENVEYATYVP ELLKGEHSKN VGPAIDSWSE FQALGVAAGI TPFNFPVMVP
LWMWPMAVAC GNTFVLKPSE RTPSSTLRMA ELALEAGLPP GVLNVVNGDK EAVDTILTDP
RVKAVSFVGS TPIAEYIYST GCAHGKRVQA LGGAKNFAVV MPDADIPNAV NALMGAAYGS
CGERCMAIPL VVAIGDETGD QVVAGLKAEI EKMKVGPGNG AGVDMGPLVT QQHFEKVTGF
VEAGVAAGAT LVVDGRGVKV DGHDGGYYLG PCLFDNVKPG MPIYQHEIFG PVLGVIRLKS
LDEAMALIDA HEYGNGTCLF TRDGEAARYF SDNIQIGMVG INVPLPVPVA YHSFGGWKRS
LFGDLHAYGP DAVRFYTKRK TITQRWPSAG VREGTVFSFP SNR