Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3846 |
Symbol | |
ID | 3911650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4397463 |
End bp | 4398896 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885747 |
Product | aldehyde dehydrogenase |
Protein accession | YP_487450 |
Protein GI | 86750954 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCAGC CGTTGGGCGG CTTCGGGGGC AATGCCCTGC ACGACAGCTT TCACCGCATG ATCGAGCGCT CCCGCGCCGA GCCCCCGGCG TCGCTGGAGC AGCGGCTCGA CCGGCTGGCG CGGCTGCGCG GTTTGCTCAA AGACAATGAG ACGCGATTCG AGCAGGCGAT CTCGGCCGAT TTCGGCCATC GCTGTTCGGT CGAAACCATG ATCGCAGAGA CGTTGAGCCT GCTCGGCGAC ATCAAGCACA CCAGCAAGCA CGTCAAAGGC TGGATGGCGC CGCGCAAGGT GGCGACCCAG CCGCAATTCT GGCCGGGCAA GAACCGGCTG ATCCCGCAGC CGCTCGGCGT GGTCGGCATC ATCGCGCCGT GGAACTATCC GTTGCAGCTC ACGATCGCGC CGGCGATCGG CGCGCTGGCG GCCGGCAATC GGGTGATGAT CAAGCCCAGC GAATTGTCGC CCGCGTTCTC CGCCCTGCTG CAGGAGACGG TGGCGGCAAA GTTCGATCCC ACCGAGATGA TCGTGACCGG GATCGACGAC GGCGTCGCCG AGGCGTTCGC GAAGCTGCCG TTCGATCACC TGATGTTCAC CGGCTCGACC CGGGTCGGCC GCATCGTCGC GGCGGAAGCG GGCAAGAACC TCACCCCGGT CACGCTCGAA CTCGGCGGCA AGTCGCCGAC CATCATCGAC CGCTCCGCCG ATCTCGACGA GGTGGCGCCG CGGATCGCCT ATGCCAAGCT GATGAATGCC GGGCAGACCT GCATCGCGCC GGACTACGTG CTGGCGCCGC GCGACAAGGT CGAGGCGCTG GCGGGCAAGA TCCGCGACGC GATGCAGCGG ATGTTCGGCG CCGATCCCGC GAATACGGAC TACACCTCGA TCGTCGCCGA CCGGCACTAC GCGCGGCTGA AGGGCCTCGT CGACGACGCC GCCGCGCGCG GCGCGAGGCT GCTGCAACCG GCCCCGGCCG ACGATGCGGC GTGGCAGAGC CGGCGAAAAT TCCCGCCGAC CGTGGTGCTC GGCGCCACGC CCGAGATGAA GATCATGCAG GAGGAAATCT TCGGGCCGCT GCTGCCGATC CTCGGCTACG ACGATCCCGC CGACCCGATC GCCTTCATCA ACGGCCGCGA CCGGCCGCTG GCGCTGTACT GGTTCGGCAC CGACGAGGCG GCGCGCGACG AGGTGCTGCA ACGCACCGTG TCCGGCGGTG TGACGATCAA CGACTGCCTA GTGCATTTCG CGCAGGTGAA CCAGCCGATG GGCGGCGTCG GCGCCTCGGG CACCGGCGCG TATCACGGCG AATGGGGCTT CAACACCTTC ACGCAGCTCA AGCCGGTGTT CTATCGCTCG CCCTACAACC GGTTCGCCGA TCTGTATCCG CCCTATGGCG GCAAGATCGC GCGGCTGGCG AAAGTGCTGC GCTGGATGTC CTGA
|
Protein sequence | MDQPLGGFGG NALHDSFHRM IERSRAEPPA SLEQRLDRLA RLRGLLKDNE TRFEQAISAD FGHRCSVETM IAETLSLLGD IKHTSKHVKG WMAPRKVATQ PQFWPGKNRL IPQPLGVVGI IAPWNYPLQL TIAPAIGALA AGNRVMIKPS ELSPAFSALL QETVAAKFDP TEMIVTGIDD GVAEAFAKLP FDHLMFTGST RVGRIVAAEA GKNLTPVTLE LGGKSPTIID RSADLDEVAP RIAYAKLMNA GQTCIAPDYV LAPRDKVEAL AGKIRDAMQR MFGADPANTD YTSIVADRHY ARLKGLVDDA AARGARLLQP APADDAAWQS RRKFPPTVVL GATPEMKIMQ EEIFGPLLPI LGYDDPADPI AFINGRDRPL ALYWFGTDEA ARDEVLQRTV SGGVTINDCL VHFAQVNQPM GGVGASGTGA YHGEWGFNTF TQLKPVFYRS PYNRFADLYP PYGGKIARLA KVLRWMS
|
| |