Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_3740 |
Symbol | |
ID | 3721500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007494 |
Strand | + |
Start bp | 865349 |
End bp | 867724 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640073411 |
Product | aldehyde dehydrogenase |
Protein accession | YP_355248 |
Protein GI | 77465745 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.977934 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATCA AGGACATCAT GGAAAGCATG GATTACGGCC CTGCGCCCGA GGCCGCGACG GATGCGAAGG CCTGGCTGGC GGCGCGCGGC CATGCTCTCG GACATTACAT CGACGGCGCC TTCACAGGCG CCGAGACGGC CGCCATCGAG GTCGAGAACC CGGCCACGGG CGAGATCCTC GCCCGGATCC CCGCGGCGGG TGAGGCCGAG ATCGAGGCCG CCGTGGCGGC GGCCCGCGCG GCCTTCTCCG GCTGGTCGCA GCTTCCGGGC TTCGAGCGGG CGCGCTACCT CTATGCCATC GCCCGCGGCC TGCAGAAGCG CGAGCGGTTC TTCTCGGTCA TCGAGACGCT CGACAACGGC AAGGCCATCC GCGAGACCCG CACCGCCGAC GTGCCGCTCG CCATCCGCCA CTTCTACCAC CATGCGGGCT GGGCCGCGGT GCTGGGCGAG GAGTTTCCGG GCCACGAGCC GCTCGGCGTC TGCGGTCAGG TGATCCCCTG GAACTTCCCG ATGCTGATGC TGGCGTGGAA GATCGCGCCG GCGCTGGCGG CGGGCAACAC GGTGGTGCTG AAGCCCGCCG ATCTCACGCC GCTGACCGCG GTGGCCTTCG CCGAGATGCT GGAGGAGATC GGCCTGCCCA GGGGCGTGGT CAACATCGTC CATGGCGGGG CCGAGACCGG CGCGCTTCTC GTCCGCCATC CGGGCGTGGC GAAGGTGGCC TTCACCGGCT CGACCGCCGT CGGCCGCGAG ATCCGGCGCG CGACGGCCGG CAGCGGCAAG AGCCTGACGC TCGAACTCGG CGGCAAGTCG CCCTTCGTGG TCTGCGCCGA TGCCGATCTC GATGCGGCCG TCGAAGGGGT GGTGGAGGGC GTCTGGTTCA ACCAGGGCGA GGTCTGCTGC GCGGGCTCGC GGCTTCTGCT GCAGGAGGGG ATCGCCGAGC GATTCCTCGC CAAGCTCCGG GCGCGGATGG AGAAGATCCG CGTGGGCGAC CCGCTCGACA AATCCACCGA CATGGGCGCC ATCGTCTCGG CGCGCCAGAA GGCCCGGATC GAGGAGCTGA TCGCGGGCGC GGCGCGCGAG GGCTACCGGC TCGAGCAGGC GGCCTGCCCG CTGCCCGCGG CGGGCCATTT CGTGGCGCCC GGCTTCTTCG CCGACACCGA GCCCGCGGCC ACCGTCGCGC AGGTCGAGAT CTTCGGCCCG ATCGCGGTCA CGACCACCTT CCGCACCGTC GACGAGGCGG TGGCGCTGGC CAACAACACG CCCTACGGGC TTGCGGCCTC GGTCTGGTCC GAGAATATCA ATGCCGCGAC CGAGCTTGCC GCGCGGATCC GCGCGGGCGT GGTCTGGATC AACGCCTCGA ACCTCTTCGA TGCGGGCGCG CCCTTCGGCG GCATGAAGGA GAGCGGCTTC GGGCGCGAGG GCGCGCGCGA GGGTCTCGGC GCCTATCTGC GCCCGCGCAC GCCGCGCGGC CCCGAGGCGC TGGTGGCGCC CGTGGATTTC ACCGCCCATA CCGGCACGGG CGGCAGCCTG ACCGGACTGA TCGACCGCAC GATGAAGAAC TACATCGGCG GCGCGCAGGT CCGGCCCGAC GGCGGCGCGA GCTATGTGGT GCGCGGGCCG AAGGGCGAGG CCCTGGGCCT CGCGCCCGTG TCGGGGCGCA AGGACATCCG CAACGCCGTC GAGGCCGCGC TGAAGGCCAA GGGCTGGGCC GCCAGCGCCC ATGGCCGGGC GCAGGTGCTG TTCTTCCTTG CCGAGAACAT CGCCGCCCGC GCCGAGGATC TTGCGGCGGC CCTCGTGCAG GGCGGCGCCG GCAGATCCGA GGCCGCAGCG GAGGTGCGCA GCCTCATCGA GCGGGTGTTC TTCTATGCCG GCATGGCCGA CAAGGACGAT GGCCGCATCC ATGCGACGAA GCCGCGCCAC CTGACGCTCT CGGTGAAGGA GCCGCTGGGG GTGGTGGGGG TGCTTGCGCC CGACGAGGCG CCGCTCCTGT CGCTCATGTC GCTGATCCTG CCGCTGATCG CTGCGGGCAA CCGCGTGGTG GCGGTGCCCT CGCCCGCGCA GGCGCTGCTG GCGCAGCCGC TCACCCAGAT CTTCGACACG TCGGATCTGC CCGGGGGCGT GGTGAACCTC GTCACCGGCG ACCGCGACCT CCTCGCCCGG ACGCTCGCCG AGCACGATGC GGTGGACGGG CTCTGGTATC ACGGATCGGC CAAGGGAGCG GCGGAGGTCG AGGCGCTGTC GGCGGGCAAC CTGAAGCAGG TCTGGACGAA CGGCGGGCGC GCGCTCGACT GGAACGCGGA TGCCGTGGCC TGCGGGCGGA GCTGGCTCGA CCGGGCGACC CAGATCAAGA CGATCTGGGT GCCCTACGGC GCCTGA
|
Protein sequence | MSIKDIMESM DYGPAPEAAT DAKAWLAARG HALGHYIDGA FTGAETAAIE VENPATGEIL ARIPAAGEAE IEAAVAAARA AFSGWSQLPG FERARYLYAI ARGLQKRERF FSVIETLDNG KAIRETRTAD VPLAIRHFYH HAGWAAVLGE EFPGHEPLGV CGQVIPWNFP MLMLAWKIAP ALAAGNTVVL KPADLTPLTA VAFAEMLEEI GLPRGVVNIV HGGAETGALL VRHPGVAKVA FTGSTAVGRE IRRATAGSGK SLTLELGGKS PFVVCADADL DAAVEGVVEG VWFNQGEVCC AGSRLLLQEG IAERFLAKLR ARMEKIRVGD PLDKSTDMGA IVSARQKARI EELIAGAARE GYRLEQAACP LPAAGHFVAP GFFADTEPAA TVAQVEIFGP IAVTTTFRTV DEAVALANNT PYGLAASVWS ENINAATELA ARIRAGVVWI NASNLFDAGA PFGGMKESGF GREGAREGLG AYLRPRTPRG PEALVAPVDF TAHTGTGGSL TGLIDRTMKN YIGGAQVRPD GGASYVVRGP KGEALGLAPV SGRKDIRNAV EAALKAKGWA ASAHGRAQVL FFLAENIAAR AEDLAAALVQ GGAGRSEAAA EVRSLIERVF FYAGMADKDD GRIHATKPRH LTLSVKEPLG VVGVLAPDEA PLLSLMSLIL PLIAAGNRVV AVPSPAQALL AQPLTQIFDT SDLPGGVVNL VTGDRDLLAR TLAEHDAVDG LWYHGSAKGA AEVEALSAGN LKQVWTNGGR ALDWNADAVA CGRSWLDRAT QIKTIWVPYG A
|
| |