Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_C6645 |
Symbol | |
ID | 3733967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007509 |
Strand | + |
Start bp | 152048 |
End bp | 153553 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637760352 |
Product | aldehyde dehydrogenase (acceptor) |
Protein accession | YP_366339 |
Protein GI | 78059764 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.217332 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0894525 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCAA CTTCAGCAGA AAATCATCAG CACGCGGACT GGAGACGTCT TGCGGCGCAT GTGGTCCCGA GAACGCTCGC ACATATCGGC GGCTGCAGCG TGCCGGCCCG AAGCGGCCGG ACGTTCGCCG CGATCAACCC GGCGACGGAG GCCGTCATCG CGGAAGTCGC GTCGTGCGAT GCGCCGGACG TCGACGACGC GGTACGTGCC GCGCGTCACG CGTTCGAATC CGGCGCCTGG TCGCGCTGCG CGCCGGCAGA GCGCAAGCGT GTGCTTTGCC GGCTTGGCGA ACTCATCGCA TCGCACGGTG CGGAACTCGC GCTGCTCGAC TCGCTCAACA TGGGCAAGCG TGTGGCGGAT GCGTTCAGTA TCGACGTGCC AGCCGCAGGC GGTCTGTTCA GCTGGTACGG CGAAGCGGTA GACAAACTGC ATGGCGAAGT CGCGTCGACC GACCCCGGCA ACCTCGCGGT CGTCACGCGC GAACCGCTGG GCGTCGTCGG CGCCGTGGTG CCGTGGAATT TCCCGCTCGA CATGGTCGCG TGGAAGGTGG CGCCGGCGCT GGCGGCCGGC AACAGCGTCG TGCTGAAGCC GGCGGAACAA TCGCCGCTGT CCGCATTGCG TCTCGCGGAA CTGGCGCTCG AGGCCGGCCT GCCGCCGGGC GTGCTGAACG TCGTGCCGGG CTATGGCGAG ACCGCGGGGC GCGCGCTCGG GCTGCATCCC GACGTGGACG TGCTCGCCTT CACGGGATCG ACGGCCGTCG GCAAGAAATT CCTCGAGTAC GCCGCGCAGT CGAACATGAA GCAGGTGTGG CTTGAGTGCG GCGGCAAGAG CCCGAACCTG GTCTTCGACG ATACGGACGA TCTCGACCTG GCCGCACGCA AGGCGTGCTT CGGCATTTTC TTCAACCAGG GCGAGGTGTG TTCCGCCAAC TCGAGGCTGC TGGTTCAGCG TTCGATTCAC GATGCGTTCG TCGACCGGCT GATTGCGCAT GCCGCTGCGT TCATGCCCGG TGATCCGCTC GATCCGTCGA GCGGGATGGG CGCCATCGTC GACGAGCAGC AGCATCGACG CGTGCGCGAG TGGATTGCAC GCGGCCGCGA TAGCGCAACG CTCGCGATCG GCGGAGGCGC GCCGCGCGTC GACGGCAAGG GTTACTTCAT CGAGCCGACG ATCTTCATCG ACGTGAAGCA CGACGACGCC ATCGCACGCG AGGAGATCTT CGGGCCGGTG CTGTCGGTGA TGGCGTTCGA CACCGAGGAC GAGGCCGTCC GGCTCGCGAA CGACTCGATC TATGGCCTTG CCGCGTCGCT CTGGACCGGC AGCCTGTCGC GCGCGCACCG TGTGTCGGGC CGGTTGCGCG CCGGCACGGT GTCCGTCAAC ACGGTCGATG CGCTGAGCGC ACAGACGCCG TTCGGCGGGT TCCGCCAGTC GGGCTTCGGC CGCGATCTTT CGCTGCATGC GATCGACAAG TACACGGGCC TCAAGACGAC CTGGATCAGT TACTGA
|
Protein sequence | MNPTSAENHQ HADWRRLAAH VVPRTLAHIG GCSVPARSGR TFAAINPATE AVIAEVASCD APDVDDAVRA ARHAFESGAW SRCAPAERKR VLCRLGELIA SHGAELALLD SLNMGKRVAD AFSIDVPAAG GLFSWYGEAV DKLHGEVAST DPGNLAVVTR EPLGVVGAVV PWNFPLDMVA WKVAPALAAG NSVVLKPAEQ SPLSALRLAE LALEAGLPPG VLNVVPGYGE TAGRALGLHP DVDVLAFTGS TAVGKKFLEY AAQSNMKQVW LECGGKSPNL VFDDTDDLDL AARKACFGIF FNQGEVCSAN SRLLVQRSIH DAFVDRLIAH AAAFMPGDPL DPSSGMGAIV DEQQHRRVRE WIARGRDSAT LAIGGGAPRV DGKGYFIEPT IFIDVKHDDA IAREEIFGPV LSVMAFDTED EAVRLANDSI YGLAASLWTG SLSRAHRVSG RLRAGTVSVN TVDALSAQTP FGGFRQSGFG RDLSLHAIDK YTGLKTTWIS Y
|
| |