Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3472 |
Symbol | |
ID | 4898854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 548738 |
End bp | 551113 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640114069 |
Product | betaine-aldehyde dehydrogenase |
Protein accession | YP_001045337 |
Protein GI | 126464224 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.27044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATCA AGGACATCAT GGAAAGCATG GATTACGGCC CTGCGCCCGA GGCCGCGACG GATGCGAAGG CGTGGCTGGC GGCGCGCGGC CATGCTCTCG GACATTACAT CGACGGCGCC TTCACAGGCG CCGAGACGGC CGCCATCGAG GTCGAGAACC CGGCCACGGG CGAGATCCTC GCCCGGATCC CCGCGGCGGG CGAGGCCGAG ATCGAGGCCG CCGTGGCCGC GGCCCGCGCG GCCTTCTCCG GCTGGTCGCA GCTTCCGGGC TTCGAGCGGG CGCGGTACCT CTACGCCATC GCCCGCGGCC TGCAGAAGCG CGAGCGGTTC TTCTCGGTCC TCGAGACGCT CGACAACGGC AAGGCCATCC GCGAGACCCG CACCGCCGAC GTGCCGCTCG CCATCCGCCA CTTCTATCAT CATGCGGGCT GGGCCGCGGT GCTGGGCGAG GAGTTTCCGG GCCACGAGCC GCTCGGCGTC TGCGGTCAGG TGATCCCCTG GAACTTCCCG ATGCTGATGC TGGCGTGGAA GATCGCGCCG GCGCTGGCGG CGGGCAACAC GGTGGTGCTG AAGCCCGCCG ACCTCACGCC GCTGACCGCG GTGGCCTTCG CCGAGATGCT GGACGAGATC GGCCTGCCCA GGGGCGTGGT CAATATCGTC CATGGCGGGG CCGAGACCGG CGCGCTTCTC GTCCGCCATC CGGGCGTGGC GAAGGTGGCC TTCACCGGGT CGACCGCCGT CGGCCGCGAG ATCCGGCGCG CGACAGCCGG CAGCGGCAAG AGCCTGACGC TCGAGCTCGG CGGCAAGTCG CCCTTCGTGG TCTGCGCCGA TGCCGATCTC GATGCCGCCG TCGAAGGGGT GGTGGAGGGC GTCTGGTTCA ACCAGGGCGA GGTCTGCTGC GCGGGCTCGC GGCTTCTGCT GCAGGAGGGG ATCGCCGAGC GGTTCCTCGC CAAGCTCCGG GCGCGGATGG AGAAGATCCG CGTGGGCGAT CCGCTCGACA AATCCACCGA CATGGGCGCC ATCGTCTCGG CGCGCCAGAA GGCCCGGATC GAGGAGCTGA TCGCGGGCGC GGCGCGCGAG GGCTACCGGC TTGAGCAGGC GGCCTGCCCG CTGCCCGCGG CCGGTCATTT CGTGGCGCCC GGCTTCTTCG CCGACACCGA GCCCGCGGCC ACCGTCGCGC AGGTCGAGAT CTTCGGCCCG ATCGCGGTCA CGACCACCTT CCGCACCGTC GACGAGGCGG TGGCGCTGGC CAACAACACG CCCTACGGGC TCGCGGCCTC GGTCTGGTCC GAGAACATCA ATGCCGCGAC CGAGCTTGCC GCGCGGATCC GCGCGGGCGT GGTCTGGATC AACGCCTCGA ACCTCTTCGA TGCGGGCGCG TCCTTCGGCG GCATGAAGGA GAGCGGCTTC GGGCGCGAGG GCGCGCGGGA AGGCCTCGGC GCCTATCTGC GCCCGCGCAC GCCGCGCGGC CCCGAGGCGC TGGTGGCGCC CGTGGATTTC ACCGCCCACA CCGGCATGGG CGGCAACCTG TCCGGCCTCA TCGACCGCAC GATGAAGAAC TACATCGGCG GCGCGCAGGT CCGGCCCGAC GGCGGCGCGA GCTATGTGGT GCGCGGCCCG AAGGGCGAGG CCCTGGGCCT CGCGCCCGTC TCGGGGCGCA AGGACATCCG CAACGCCGTC GAGGCTGCGC TGAAGGCGAA GGGCTGGGCC GCCAACGCCC ATGGCCGGGC GCAGGTGCTG TTCTTCCTCG CCGAGAACAT CGCCGCCCGC GCCGAGGATC TTGCGGCGGC CCTCGTGCAG GGCGGCGCCG GCAGGTCCGA GGCCGCAGCG GAGGTGCGCA GCCTCATCGA GCGGGTGTTC TTCTATGCCG GCATGGCCGA CAAGGACGAT GGCCGCATCC ATGCGACGAA GCCGCGCCAC CTGACGCTCT CGGTGAAGGA GCCGCTGGGG GTGGTGGGGG TTCTCGCGCC CGACGAGGCG CCGCTCCTGT CGCTCATGTC GCTGATCCTG CCGCTGATCG CTGCGGGCAA CCGCGTGGTG GCGGTGCCCT CGCCCGCGCA GGCGCTGCTG GCGCAGCCGC TCACCCAGAT CTTCGACACG TCGGATCTGC CCGGGGGCGT GGTGAACCTC GTCACCGGCG ACCGCAATCT CCTCGCCCGG ACGCTCGCCG AGCACGATGC GGTGGACGGG ATCTGGTATC ACGGATCGGC CAAGGGCGCG GCGGAGGTCG AGGCGCTGTC GGCGGGCAAC CTGAAGCAGG TCTGGACGAA CGGCGGGCGC GCGCTCGACT GGAACGCGGA TGCCGTGGCC TGTGGGCGGA GCTGGCTCGA TCGGGCGACC CAGATCAAGA CGATCTGGGT GCCCTACGGC GCCTGA
|
Protein sequence | MSIKDIMESM DYGPAPEAAT DAKAWLAARG HALGHYIDGA FTGAETAAIE VENPATGEIL ARIPAAGEAE IEAAVAAARA AFSGWSQLPG FERARYLYAI ARGLQKRERF FSVLETLDNG KAIRETRTAD VPLAIRHFYH HAGWAAVLGE EFPGHEPLGV CGQVIPWNFP MLMLAWKIAP ALAAGNTVVL KPADLTPLTA VAFAEMLDEI GLPRGVVNIV HGGAETGALL VRHPGVAKVA FTGSTAVGRE IRRATAGSGK SLTLELGGKS PFVVCADADL DAAVEGVVEG VWFNQGEVCC AGSRLLLQEG IAERFLAKLR ARMEKIRVGD PLDKSTDMGA IVSARQKARI EELIAGAARE GYRLEQAACP LPAAGHFVAP GFFADTEPAA TVAQVEIFGP IAVTTTFRTV DEAVALANNT PYGLAASVWS ENINAATELA ARIRAGVVWI NASNLFDAGA SFGGMKESGF GREGAREGLG AYLRPRTPRG PEALVAPVDF TAHTGMGGNL SGLIDRTMKN YIGGAQVRPD GGASYVVRGP KGEALGLAPV SGRKDIRNAV EAALKAKGWA ANAHGRAQVL FFLAENIAAR AEDLAAALVQ GGAGRSEAAA EVRSLIERVF FYAGMADKDD GRIHATKPRH LTLSVKEPLG VVGVLAPDEA PLLSLMSLIL PLIAAGNRVV AVPSPAQALL AQPLTQIFDT SDLPGGVVNL VTGDRNLLAR TLAEHDAVDG IWYHGSAKGA AEVEALSAGN LKQVWTNGGR ALDWNADAVA CGRSWLDRAT QIKTIWVPYG A
|
| |