Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1032 |
Symbol | |
ID | 4895539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1065193 |
End bp | 1066611 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640111619 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001042915 |
Protein GI | 126461801 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.619093 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAACA TCGACCGTTT CTTCATTGGC GGCGCCTGGA CGGCCCCCCT CGGCGCCGAC CGCCACCGGC TGGTGAACCC GGCGACGGAG GAAGAGATCG CCGCGATCCC GATGGCCTCG GCCGAGGATG TCGACCGCGC CGTCGCCGCG GCCCGCGCCG CCTTCGAGGG CTGGCAGGCC AGCTCGAAGG AGGAGCGCCT CGCCCTCCTT CGGCGTCTCC TCGATCTCTA CAACGAAGCC TACGACGAGC TGGCCGAGCT CATGACCCGC GAGATGGGCA CGGTTGCCCG CTTCAGCCGC GAGGCGCAGG CCTGGGTCGG GCGCGCCCAT CTGGAGGCCG CCATCGAGGC TCTCGAGGCC GAGAGCTTCG AGGAGATGCG CGGCTCGACC CTGATCTCGA AGGAGCCCAT CGGCGTCTGC GCCCTCATCA CGCCCTGGAA CTGGCCGATG AACCAGCTGG TCGTGAAGGT GGCCCCCGCC CTTGCCGCAG GCTGCACGGT GGTGGCGAAA CCCTCGGAAT TCTCGCCGCT CTCCTCGATC CGCTTCGCCG AACTGGTCGA GGCGGCGGGC TTTCCGCCGG GCGTCTACAA TCACATCACC GGCGCGGGCC CCGTCGCGGG CGAGGCGCTG GCGCGGCACC CGGACGTGGA CATGATCTCG ATCACCGGCT CGACCCGGGC AGGGATCGCG GTGGCGCGCG CGGCCGCCGA CACGGTCAAG CGCGTGACGC AGGAGCTCGG CGGCAAGTCG GCCAACATCA TCCTGCCCGA TGCGGATCTC GCCACCGCCG TCCGGCAGGG GGTGCTCGAC TGTTTCGGCA ATGCGGGGCA GGCCTGCAAG GCGCCCGCGC GGATGCTGGT GCCGGCCGAG CGGATGGAGG AGGCCGCGGC CCTCGCGGCC GCGGCGGCCG AGGCCCTGAC GGTGGGGGCG CCCGAGGGCG AGGTCGATCT CGGCCCCGTC GTGAACGAGA GCCAGTGGCG CCGCATCCAG TCGCTGATCG AGGCGGGCAT CGCCGAGGGC GCGCGCCTCG TCACCGGGGG TCCGGGCCGG CCCGACCATC TGCCGCGCGG CTGGTATGTG CGCCCGACCG TCTTTGCCGA CGTGGCCCAT GGCTCGACCA TCGCCACCGA GGAGATCTTC GGCCCCGTCG TGGCCCTCAT TCCCTACCGG GACGAGGAAG AGGCGATCCG CATCGCCAAC GACTCGATCT ACGGGCTTGC GGGCTATATC CAGACCGGCG ACCCCGAGAC GGCGCGGCGC ATCGCGCGGA AGCTGCGCGT GGGCATGGTC TATATCAACG GCGCGGGCTG GGACGCGCGC GCCCCCTTCG GCGGCTACAA GCAATCGGGC AACGGGCGCG AACACGGCGC CTGGGGCCTC GCCGACTATC TCGAGACGAA AGCCACGGCC GGATTGTGA
|
Protein sequence | MRNIDRFFIG GAWTAPLGAD RHRLVNPATE EEIAAIPMAS AEDVDRAVAA ARAAFEGWQA SSKEERLALL RRLLDLYNEA YDELAELMTR EMGTVARFSR EAQAWVGRAH LEAAIEALEA ESFEEMRGST LISKEPIGVC ALITPWNWPM NQLVVKVAPA LAAGCTVVAK PSEFSPLSSI RFAELVEAAG FPPGVYNHIT GAGPVAGEAL ARHPDVDMIS ITGSTRAGIA VARAAADTVK RVTQELGGKS ANIILPDADL ATAVRQGVLD CFGNAGQACK APARMLVPAE RMEEAAALAA AAAEALTVGA PEGEVDLGPV VNESQWRRIQ SLIEAGIAEG ARLVTGGPGR PDHLPRGWYV RPTVFADVAH GSTIATEEIF GPVVALIPYR DEEEAIRIAN DSIYGLAGYI QTGDPETARR IARKLRVGMV YINGAGWDAR APFGGYKQSG NGREHGAWGL ADYLETKATA GL
|
| |