Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3107 |
Symbol | |
ID | 3910908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3540073 |
End bp | 3541464 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885011 |
Product | aldehyde dehydrogenase |
Protein accession | YP_486716 |
Protein GI | 86750220 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGCTG ACAAGGACGG TCGTTTCGTC GTTTACAATC CCTGGACGGC CCAGGAAGCC TTTACGACGC CGGGCACCAG CGAGCAGCGT GTCGATGAGA TCGCGCGCGA AGCGAAGGCC GCGTTCTATC AGCATCTCAA GACGCCGGGA CACGTTCGGG CGGAATGGAT CCATGGCGCC GCCGCCGCGC TGGAACGGGC GCGGAGCGAG ATCGTCGAGG CGATGATCGC GCATATCGGC AAGCCGCGCA AAACTGCCGA GATGGAGGTC AGGCGCAGCG TCGCTTTCAT CGCGGCCTGC GCACAGCATC TGCATGCGCT GGGCGGACAT TTGCTGCCTC TGGACATGGT GCCGGCGGGT GTCGGCTCTG TCGGATTTGC GCGGCGGATG CCGTATGGTG TCGTCGCTGC GGTCACCCCG TTCAATGCAC CGTCCAACCT GCTGGTGCAG AAATTGGCGC CGGCGCTCGC CACCGGCAAC GCCGTCGTGA TCAAGCCGTC TCTCGAGGGC ACGCGGATCG CAGAGATGAT TGCACGGGCC TTCGTCACGG GCGGCGTTCC CGAGGGGCTG GTGTGGGTGG TTCCTGGCGA TCGCGCAGAG GCGCTGGGCT TGGCTGGGCA CCGCGACGTC GATCTTGTAA CCCTAACCGG AGGCACTGCG GCCGGCGACG CACTGGCGCG GGCTGCCGGC GCAAAGCGAT TTCTCGGCGA ACTCGGCGGC AATTCGCCGA ACATCGTTGC GGCGGACGCA GACATCGAGG ACGCGGTGAA ACGCATCGTG CCGTCGTCGT TCGAAGCGAG TGGCCAGCAA TGCATTTCAA CGCAAAGGAT CATTGTTGAA GCGCCGGTGT TCGATCGGTT CCTTGCTCTC TTCGTCGAGG AAACGAAGCG TCTCAAGGTC GGCGATCCGG CCGCGGCGGA TACCGACCTC GGTCCCGTGG TGTCGCGAGT CTCTGCCGAG CGGATTGCGG CGATGATCGA AGACGCGCGC GCCCTCGGCG CCCGCGTGAT CAGCTGCGGC GAAATCCGGG ATTGCGTGAT CCCGCCGACC ATCGTGGTCG AGCCGCCGGC GGCGGCGCGG CTCATCCGCG AGGAGGTGTT CGGACCTGTC GTCGTGGTGC TCCGCGCCGC CGATGTCGAC GATGCCATCC GCATCGCCAA TGATTGCGAA TTCGGCTTGC AAGGCTCGTG TTTCACCGCG AGCCTGTCGA CCGCGTTGCG GGTCGCGGAC GAGGTGCGTG TCGGATCGCT GTGGATCAAC GAGGCCAGCC GGTTCCGGCT CGACAACTAT CCGTTCGGTG GCATGGGCCG CTCGGGCGTC GGCCGCGAAG GGTTACCTTA TGCGCTGGAG GAGTACACCC AGCTCAAGTT CACCGGAATG CGCGGGGTAT AG
|
Protein sequence | MQADKDGRFV VYNPWTAQEA FTTPGTSEQR VDEIAREAKA AFYQHLKTPG HVRAEWIHGA AAALERARSE IVEAMIAHIG KPRKTAEMEV RRSVAFIAAC AQHLHALGGH LLPLDMVPAG VGSVGFARRM PYGVVAAVTP FNAPSNLLVQ KLAPALATGN AVVIKPSLEG TRIAEMIARA FVTGGVPEGL VWVVPGDRAE ALGLAGHRDV DLVTLTGGTA AGDALARAAG AKRFLGELGG NSPNIVAADA DIEDAVKRIV PSSFEASGQQ CISTQRIIVE APVFDRFLAL FVEETKRLKV GDPAAADTDL GPVVSRVSAE RIAAMIEDAR ALGARVISCG EIRDCVIPPT IVVEPPAAAR LIREEVFGPV VVVLRAADVD DAIRIANDCE FGLQGSCFTA SLSTALRVAD EVRVGSLWIN EASRFRLDNY PFGGMGRSGV GREGLPYALE EYTQLKFTGM RGV
|
| |