Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3791 |
Symbol | |
ID | 3969479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4214984 |
End bp | 4216417 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637926901 |
Product | aldehyde dehydrogenase |
Protein accession | YP_533644 |
Protein GI | 90425274 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCAGG CGCTGCACAG TGCCGGGCGC AGAGCGCTCG ACGACGACTT TCATCGGATG TGGGCGACGG CGCGGCAAAC CCCGCCGCCG CCTCTGGAGA CCCGCCTCGA CCGGCTGGCG CGCCTGCGGG CGCTGATTCG CGACCATGAG GCGGGGTTCA GCGCGGCGAT CTCAGCGGAT TTCGGCCATC GCTGCCCGGT GGAAACCCAG ATCGCCGAGA CGTTGTCGGT GCTGGGCGAG ATCAAGCACA CCGCGAGGCA TCTGAAAGCC TGGATGGCGC CGCGGCGCAT CGCCACCCAG TTGCAGTTCC TACCCGGCCG CAACCGGCTG ATCCCGCAGC CGCTCGGGGT GGTCGGCATC ATCGCGCCGT GGAATTACCC GTTGCAGCTG ACCTTGGCCC CCGCGGTGGC GGCGCTGGCC GCCGGCAATG CGGCGATGAT CAAACCCAGC GAATTGACGC CGCGGTTTGC CGCCCTGCTG CAGGAAACCG TGGCGGCCAA GTTCGCGCCC GACGAGATGG TGGTGACCGG CATCGAGGAC GACATCGCCG AAGCCTTCGC CGCGCTGCCG TTCGACCATC TGATGTTCAC CGGCTCGACC AGGGTCGGCC GCATCGTCGC CGCCGCCGCC GGCCGCAATC TCACCCCGGT GACGCTGGAG CTCGGCGGCA AGTCGCCGGT GATCATCGAC GCCTCCGCCG ACCTCGACCA GGCGGCGGCG CGGATCGCCT ATGCCAAATT GCTCAACGCC GGCCAGACCT GCATCGCACC GGATTACGTG CTGGTGCCGA ACGTCTCGCT GCAGGCCTTC GCCGACAAGC TGCGCGACGC GATGCGCCGC ATGTTCGGCG CCGACCCCGG CAACCAGGAC TACAGTTCGA TCATCGCCGA GCGGCATTAT GCCCGGCTCG AGGGCCTGCT CGCCGATGCG CGGGCGCTCG GCGCCAGCGT CGTGCAGAGC GCCTCGCCCG ACGACGCCGC GTGGAAAGCG CTGCGCAAAT TCCCGCCGAC GGTGCTGACC GGCGTCAGCT CCGAGATGAA GATCATGCAG GAGGAGATCT TCGGGCCGCT GCTGCCGATC CTCGGCTACG ACGACGCCAG CGAGCCGATC GCTTTCATCA ACGCCCGCGA CCGGCCGCTG GCGCTGTACT GGTTCGGCAC CGACGACGCC GCGCGCGACG AGGTTTTGGC GCGCACCGTC TCCGGCGGCG TCACCGTCAA CGACTGCCTG GTGCATTTCG CGCAAGTCAA CCAGCCGATG GGCGGCGTCG GCGCCTCGGG AAGCGGCGCC TATCACGGCG AATGGGGCTT CAACACCTTC AGCAAGCTGA AGCCGGTGTT CTATCGCTCG CCCTACAACC GCTTCGCCGA TCTCTATCCG CCCTATGGCG GCACGGTCGC GCGGCTCGCC AAACTGCTGC GCTGGCTGTC CTAG
|
Protein sequence | MDQALHSAGR RALDDDFHRM WATARQTPPP PLETRLDRLA RLRALIRDHE AGFSAAISAD FGHRCPVETQ IAETLSVLGE IKHTARHLKA WMAPRRIATQ LQFLPGRNRL IPQPLGVVGI IAPWNYPLQL TLAPAVAALA AGNAAMIKPS ELTPRFAALL QETVAAKFAP DEMVVTGIED DIAEAFAALP FDHLMFTGST RVGRIVAAAA GRNLTPVTLE LGGKSPVIID ASADLDQAAA RIAYAKLLNA GQTCIAPDYV LVPNVSLQAF ADKLRDAMRR MFGADPGNQD YSSIIAERHY ARLEGLLADA RALGASVVQS ASPDDAAWKA LRKFPPTVLT GVSSEMKIMQ EEIFGPLLPI LGYDDASEPI AFINARDRPL ALYWFGTDDA ARDEVLARTV SGGVTVNDCL VHFAQVNQPM GGVGASGSGA YHGEWGFNTF SKLKPVFYRS PYNRFADLYP PYGGTVARLA KLLRWLS
|
| |