Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3249 |
Symbol | |
ID | 3911050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3712826 |
End bp | 3714256 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885151 |
Product | aldehyde dehydrogenase |
Protein accession | YP_486856 |
Protein GI | 86750360 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAACC GCATGCAATT CTACATCGAC GGCGCCTGGG TCGATCCCGT CGCTCCCAAG TCGACGCCGG TGGTCAATCC GGCGACCGAG GACGCGATGT ACGAAGTTGC ACTCGGCTCC AAGGCCGACG TCGACAAGGC GGTCGTCGCC GCCAAGCGCG CGTTCGAAAC CTTCTCCCAG ACCAGCCGCG AGGAGCGCAT CGCGCTGCTC GAAAAAATCA TCGCGATCTA CAAGGGCCGC ATGAAGGAGA TCGGCGCCGC CGTCTCCGAT GAGATGGGCG CGCCGCTGCC GATGGCGGAG AAGATGCAGG CCGGCGCCGG CCTCGGCCAC ATCATGTCGA CGCTCGACGT GCTGAAGAAT TATCAGTTCG AGGAGCCGAT GGGCTCGGCG GTGATCGTGC GCGAGCCGAT CGGCGTCATC GGCATGATCA CGCCGTGGAA TTGGCCGCTG AACCAGATCG CCTGCAAGGT CGCGCCCGCG CTCGCCGCTG GCTGCACCAT GATCCTGAAG CCCTCCGAAT TCACACCGAC CTCGGCGCTG ATCTTCGCCG AGATCCTGCA TGAAGCCGGC GTGCCGAAGG GCGTGTTCAA TCTCGTCAAT GGCCTCGGCC CGGAGGTCGG CGCGGCGATG AGCGAACATC CGGACATCGA CATGATTTCG TTCACCGGCT CGACCCGCGC CGGCATCGAC GTCGCGCAGC GCGCGGCGCC GACCGTGAAG CGCGTCAGCC AGGAGCTCGG CGGCAAGTCG CCGAACGTCA TCCTCGACGA CGCCGACCTC ACCAAGGCGG TGACCGGCGG CGTGATGCAC ATGTTCAACA ACTCCGGCCA GTCCTGCAAC GCGCCGAGCC GGATGATCGT GCCGCTGTCG AAGATGAAGG AGGTCGCGGC GATCGCCAAG GGCGTCGCCG AAAAGACCAA GGCGGGCGAT CCGCGCGGCG AGGGCACCAC GATCGGCCCG GTGGTCAATC GCGGCCAGTG GGACAAGATC CAGACGCTGA TCAACAAGGG CATCGAGGAA GGCGCGACGC TGGTCGCCGG CGGACCCGGT CTGCCCGAAG GCGTCAACAA GGGCTTCTAC GTCCGCCCGA CGGTGTTCGC CGACGTCACC GACAACATGA CCATCGCCCG CGAGGAGATC TTCGGGCCGG TGCTGGTGAT CATGGGCGCC AAGGACGAGG ACGAGGCGGT CAAGCTCGCC AACGACACGC CCTATGGTCT CGCCGGCTAC GTCTCGGCCG GTTCGGTCGA GCGTGCGCGC AAGGTCGGCC GCCAGATCCG CGCCGGCAAC GTCAATCTGC AGGGCGTGCC GAACGAACGC ACCGCGCCGT TCGGCGGCTA CAAGCAGTCC GGCAACGGCC GCGAGTGGGG CAAGTTCGGC CTCGAGGAAT ATCTCGAGGT CAAGGCGATC GCCGGCTTCA ACGCCGCGTA A
|
Protein sequence | MVNRMQFYID GAWVDPVAPK STPVVNPATE DAMYEVALGS KADVDKAVVA AKRAFETFSQ TSREERIALL EKIIAIYKGR MKEIGAAVSD EMGAPLPMAE KMQAGAGLGH IMSTLDVLKN YQFEEPMGSA VIVREPIGVI GMITPWNWPL NQIACKVAPA LAAGCTMILK PSEFTPTSAL IFAEILHEAG VPKGVFNLVN GLGPEVGAAM SEHPDIDMIS FTGSTRAGID VAQRAAPTVK RVSQELGGKS PNVILDDADL TKAVTGGVMH MFNNSGQSCN APSRMIVPLS KMKEVAAIAK GVAEKTKAGD PRGEGTTIGP VVNRGQWDKI QTLINKGIEE GATLVAGGPG LPEGVNKGFY VRPTVFADVT DNMTIAREEI FGPVLVIMGA KDEDEAVKLA NDTPYGLAGY VSAGSVERAR KVGRQIRAGN VNLQGVPNER TAPFGGYKQS GNGREWGKFG LEEYLEVKAI AGFNAA
|
| |