Gene Bcep18194_A5054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A5054 
Symbol 
ID3750262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp2091369 
End bp2092805 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content68% 
IMG OID637763350 
Productaldehyde dehydrogenase 
Protein accessionYP_369292 
Protein GI78066523 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCT ACGACCAGTT CTATATCGAC GGCGCGTGGC GCAAACCGGC CGGCACCGGC 
ACGATCGACG TGATCGACTC GGGCACCGAA GCCGTGATCG GGCGGATTCC GGAAGGCGTC
GCATCCGACG CGCAGGACGC GATCCGCGCG GCGCGCGCAG CCTTCGACGC CTGGGCGGCC
ACGCCGCCCG CGACGCGCGC GGGCTACCTG CGCAAGATCG TCGAGCATCT GCTGGCGCGC
AGCGAGGAAC TCGCGCAGTC GATCACCGGC GAAGTCGGGA TGCCGATCAA GCTGTCGCGC
GCGATCCAGG TCGGCGGCCC GATCTACAAC TGGAAGGCGT ACGCGAAGCT CGCCGAGTCG
TTCGAGTTCG AGGCACAGGT CGGCAACTCG CTCGTCGTGC GCGAGCCGGT CGGTGTCGTC
GCGGCAATCA CGCCGTGGAA CTACCCGCTC AACCAGGTCA CGCTGAAGGT CGCACCGGCA
CTGGCGGCCG GCTGCACGGT CGTCCTGAAG CCGTCCGAAG TCGCGCCGCT GAACGCGTTC
ATGCTCGCTG AAGCGATTCA CGAAGCCGGG CTGCCGGCCG GCGTGTTCAA CCTCGTGTGC
GGCTACGGCC CGGTGGTCGG CGAGGTGCTG GCCACCGATC CGGACGTCGA CATGGTGTCG
TTTACGGGCT CGACGCGCGC CGGCAAGCGC GTGGCCGAGC TGGCCGCCGC GGGCGTCAAG
CGCGTCGCGC TCGAACTGGG CGGCAAGTCG GCGTCGGTGA TTCTCGACGA TGCCGATTTC
GCGACGGCGG TGAAGGGCAC GGTCAACGCG TGCTACCTGA ACGCGGGGCA GACCTGCTCG
GCACACACGC GCATGCTGGT GCCGGAAGCG CGCTACGACG AGGCGCGCGC GATCGCGAAG
GCGGCGGCCG AAACCTACGT CGCCGGCGAT CCGCGGCAGG ATGCGACGCG CCTCGGCGCG
CTGGCATCGG CCGTCCAGCA GCAGCGTGTG CAGGACTACA TCCAGCGCGG GATCGACGAA
GGCGCGGAAC TCGTGACGGG TGGCACGGGC CTGCCGGAAG GGCTGGATAA AGGCTTCTTC
GTGAAGCCGA CCGTGTTCGG CCGCGTCGAT CCGAAATCGA CGATCGCGCA AGAGGAAATC
TTCGGGCCGG TGCTGTCGAT CATCACGTAT CGCGATGAAG ACGAGGCTGT GCGGATCGCG
AACGATTCGC CGTACGGGCT CGGCGGCGCG GTGTGGGCCG GCAGCGACGA ACGCGCGATG
GGCATCGCGC GCCGCATCCG CACCGGACAG GTCGACATCA ACGGCGGCGC GTGGAACATG
GCCGCGCCGT TCGGCGGCTA CAAGCAATCG GGTCACGGCC GCGAGAACGG CGTGTACGGG
CTCGAAGAAT ATCTCGAGTA CAAGTCGATG CAGCTCAAGC CCGCGAAGCC GGCCTGA
 
Protein sequence
MKIYDQFYID GAWRKPAGTG TIDVIDSGTE AVIGRIPEGV ASDAQDAIRA ARAAFDAWAA 
TPPATRAGYL RKIVEHLLAR SEELAQSITG EVGMPIKLSR AIQVGGPIYN WKAYAKLAES
FEFEAQVGNS LVVREPVGVV AAITPWNYPL NQVTLKVAPA LAAGCTVVLK PSEVAPLNAF
MLAEAIHEAG LPAGVFNLVC GYGPVVGEVL ATDPDVDMVS FTGSTRAGKR VAELAAAGVK
RVALELGGKS ASVILDDADF ATAVKGTVNA CYLNAGQTCS AHTRMLVPEA RYDEARAIAK
AAAETYVAGD PRQDATRLGA LASAVQQQRV QDYIQRGIDE GAELVTGGTG LPEGLDKGFF
VKPTVFGRVD PKSTIAQEEI FGPVLSIITY RDEDEAVRIA NDSPYGLGGA VWAGSDERAM
GIARRIRTGQ VDINGGAWNM AAPFGGYKQS GHGRENGVYG LEEYLEYKSM QLKPAKPA