Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2215 |
Symbol | |
ID | 4022700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2480027 |
End bp | 2481457 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637962410 |
Product | aldehyde dehydrogenase |
Protein accession | YP_569351 |
Protein GI | 91976692 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.21786 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.328694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAACC GCATGCAATT CTACATCGAC GGCGCCTGGG TCGATCCCGT CGTCAAGAAG TCGACGCCGG TCGTCAATCC AGCGACCGAG GAGGCGATGT ATGAGATTGC ACTCGGCTCC AAGGCCGATG TCGACAAGGC GGTCGCAGCG GCCAAGCGTG CGTTCGAAAC CTTCTCCCAG ACCAGCCGCG AGGAGCGCAT CGCGCTGCTC GAGAAGATTA TCGGGATCTA CAAGGGCCGG ATGAAGGAAA TCGGCGCCGC GGTGTCCGAC GAAATGGGCG CGCCGCTGCC GATGGCCGAG AAGATGCAGG CCGGCGCCGG CCTGGGCCAC ATCATGTCGA CGCTCGACGT GCTGAAGAAT TATGCCTTCG AGGAGCCGTT GGGCTCGGCG ATGATCGTGC GTGAACCGAT CGGCGTCGTC GGCATGATCA CGCCGTGGAA CTGGCCGCTG AACCAGATCG CCTGCAAGGT CGCGCCGGCG CTCGCCGCGG GCTGCACCAT GATTCTCAAG CCGTCGGAAT TCACCCCGAC CTCGGCGCTG ATTTTCGCCG AGATCCTGCA TGAAGCCGGC GTGCCGAAGG GCGTGTTCAA CCTGGTCAAC GGCCTCGGCC CGGAGGTCGG CGCCGCGATG AGCGAGCATC CCGACATCGA CATGATCTCG TTCACCGGCT CGACCCGCGC GGGCATCGAC GTCGCGCAGC GCTCCGCGCC GACGGTGAAG CGCGTCAGCC AGGAACTCGG CGGCAAATCG CCGAACGTGA TCCTCGACGA CGCCGATCTC ACCAAGGCGG TGACCGGCGG CGTGATGCAC ATGTTCAACA ATTCCGGCCA GTCGTGTAAT GCGCCGAGCC GGATGATCGT GCCGCTGTCG AAGATGAAGG AGGTCGCCGC GATCGCCAAG GGCGTCGCGG AGAAGACCAA GGCGGGCGAT CCGCGCGGCG AGGGCACGAC GATCGGTCCG GTGGTCAATC GCGGCCAGTG GGACAAGATC CAGTCGTTGA TCAACAAGGG CATCGAGGAA GGCGCGACGC TGGTCGCCGG CGGCCCTGGC CTGCCGGAGG GCGTCAACAA GGGCTTCTAT GTCCGCCCGA CCGTATTCGC CGACGTCACC GACAACATGA CCATCGCACG CGAGGAAATC TTCGGGCCGG TGCTGGTGAT CATGGGTGCG AAGGACGAGG ACGAGGCGGT CAAGCTCGCC AACGACACGC CCTATGGCCT CGCGGGCTAT GTCTCGGCCG GTTCGGTCGA GCGGGCGCGC AAGGTCGGTC GCCAGATCCG CGCCGGCAAC GTCAATTTGC AGGGCGTTCC GAACGAACGC ACTGCGCCGT TCGGCGGCTA CAAGCAGTCC GGTAACGGCC GCGAATGGGG CAAGTTCGGC CTCGAGGAAT ATCTCGAGGT CAAGGCGATC GCCGGCTTCA ACGCCGCGTA A
|
Protein sequence | MVNRMQFYID GAWVDPVVKK STPVVNPATE EAMYEIALGS KADVDKAVAA AKRAFETFSQ TSREERIALL EKIIGIYKGR MKEIGAAVSD EMGAPLPMAE KMQAGAGLGH IMSTLDVLKN YAFEEPLGSA MIVREPIGVV GMITPWNWPL NQIACKVAPA LAAGCTMILK PSEFTPTSAL IFAEILHEAG VPKGVFNLVN GLGPEVGAAM SEHPDIDMIS FTGSTRAGID VAQRSAPTVK RVSQELGGKS PNVILDDADL TKAVTGGVMH MFNNSGQSCN APSRMIVPLS KMKEVAAIAK GVAEKTKAGD PRGEGTTIGP VVNRGQWDKI QSLINKGIEE GATLVAGGPG LPEGVNKGFY VRPTVFADVT DNMTIAREEI FGPVLVIMGA KDEDEAVKLA NDTPYGLAGY VSAGSVERAR KVGRQIRAGN VNLQGVPNER TAPFGGYKQS GNGREWGKFG LEEYLEVKAI AGFNAA
|
| |