Gene Rsph17029_3472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3472 
Symbol 
ID4898854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp548738 
End bp551113 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content72% 
IMG OID640114069 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_001045337 
Protein GI126464224 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.27044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA AGGACATCAT GGAAAGCATG GATTACGGCC CTGCGCCCGA GGCCGCGACG 
GATGCGAAGG CGTGGCTGGC GGCGCGCGGC CATGCTCTCG GACATTACAT CGACGGCGCC
TTCACAGGCG CCGAGACGGC CGCCATCGAG GTCGAGAACC CGGCCACGGG CGAGATCCTC
GCCCGGATCC CCGCGGCGGG CGAGGCCGAG ATCGAGGCCG CCGTGGCCGC GGCCCGCGCG
GCCTTCTCCG GCTGGTCGCA GCTTCCGGGC TTCGAGCGGG CGCGGTACCT CTACGCCATC
GCCCGCGGCC TGCAGAAGCG CGAGCGGTTC TTCTCGGTCC TCGAGACGCT CGACAACGGC
AAGGCCATCC GCGAGACCCG CACCGCCGAC GTGCCGCTCG CCATCCGCCA CTTCTATCAT
CATGCGGGCT GGGCCGCGGT GCTGGGCGAG GAGTTTCCGG GCCACGAGCC GCTCGGCGTC
TGCGGTCAGG TGATCCCCTG GAACTTCCCG ATGCTGATGC TGGCGTGGAA GATCGCGCCG
GCGCTGGCGG CGGGCAACAC GGTGGTGCTG AAGCCCGCCG ACCTCACGCC GCTGACCGCG
GTGGCCTTCG CCGAGATGCT GGACGAGATC GGCCTGCCCA GGGGCGTGGT CAATATCGTC
CATGGCGGGG CCGAGACCGG CGCGCTTCTC GTCCGCCATC CGGGCGTGGC GAAGGTGGCC
TTCACCGGGT CGACCGCCGT CGGCCGCGAG ATCCGGCGCG CGACAGCCGG CAGCGGCAAG
AGCCTGACGC TCGAGCTCGG CGGCAAGTCG CCCTTCGTGG TCTGCGCCGA TGCCGATCTC
GATGCCGCCG TCGAAGGGGT GGTGGAGGGC GTCTGGTTCA ACCAGGGCGA GGTCTGCTGC
GCGGGCTCGC GGCTTCTGCT GCAGGAGGGG ATCGCCGAGC GGTTCCTCGC CAAGCTCCGG
GCGCGGATGG AGAAGATCCG CGTGGGCGAT CCGCTCGACA AATCCACCGA CATGGGCGCC
ATCGTCTCGG CGCGCCAGAA GGCCCGGATC GAGGAGCTGA TCGCGGGCGC GGCGCGCGAG
GGCTACCGGC TTGAGCAGGC GGCCTGCCCG CTGCCCGCGG CCGGTCATTT CGTGGCGCCC
GGCTTCTTCG CCGACACCGA GCCCGCGGCC ACCGTCGCGC AGGTCGAGAT CTTCGGCCCG
ATCGCGGTCA CGACCACCTT CCGCACCGTC GACGAGGCGG TGGCGCTGGC CAACAACACG
CCCTACGGGC TCGCGGCCTC GGTCTGGTCC GAGAACATCA ATGCCGCGAC CGAGCTTGCC
GCGCGGATCC GCGCGGGCGT GGTCTGGATC AACGCCTCGA ACCTCTTCGA TGCGGGCGCG
TCCTTCGGCG GCATGAAGGA GAGCGGCTTC GGGCGCGAGG GCGCGCGGGA AGGCCTCGGC
GCCTATCTGC GCCCGCGCAC GCCGCGCGGC CCCGAGGCGC TGGTGGCGCC CGTGGATTTC
ACCGCCCACA CCGGCATGGG CGGCAACCTG TCCGGCCTCA TCGACCGCAC GATGAAGAAC
TACATCGGCG GCGCGCAGGT CCGGCCCGAC GGCGGCGCGA GCTATGTGGT GCGCGGCCCG
AAGGGCGAGG CCCTGGGCCT CGCGCCCGTC TCGGGGCGCA AGGACATCCG CAACGCCGTC
GAGGCTGCGC TGAAGGCGAA GGGCTGGGCC GCCAACGCCC ATGGCCGGGC GCAGGTGCTG
TTCTTCCTCG CCGAGAACAT CGCCGCCCGC GCCGAGGATC TTGCGGCGGC CCTCGTGCAG
GGCGGCGCCG GCAGGTCCGA GGCCGCAGCG GAGGTGCGCA GCCTCATCGA GCGGGTGTTC
TTCTATGCCG GCATGGCCGA CAAGGACGAT GGCCGCATCC ATGCGACGAA GCCGCGCCAC
CTGACGCTCT CGGTGAAGGA GCCGCTGGGG GTGGTGGGGG TTCTCGCGCC CGACGAGGCG
CCGCTCCTGT CGCTCATGTC GCTGATCCTG CCGCTGATCG CTGCGGGCAA CCGCGTGGTG
GCGGTGCCCT CGCCCGCGCA GGCGCTGCTG GCGCAGCCGC TCACCCAGAT CTTCGACACG
TCGGATCTGC CCGGGGGCGT GGTGAACCTC GTCACCGGCG ACCGCAATCT CCTCGCCCGG
ACGCTCGCCG AGCACGATGC GGTGGACGGG ATCTGGTATC ACGGATCGGC CAAGGGCGCG
GCGGAGGTCG AGGCGCTGTC GGCGGGCAAC CTGAAGCAGG TCTGGACGAA CGGCGGGCGC
GCGCTCGACT GGAACGCGGA TGCCGTGGCC TGTGGGCGGA GCTGGCTCGA TCGGGCGACC
CAGATCAAGA CGATCTGGGT GCCCTACGGC GCCTGA
 
Protein sequence
MSIKDIMESM DYGPAPEAAT DAKAWLAARG HALGHYIDGA FTGAETAAIE VENPATGEIL 
ARIPAAGEAE IEAAVAAARA AFSGWSQLPG FERARYLYAI ARGLQKRERF FSVLETLDNG
KAIRETRTAD VPLAIRHFYH HAGWAAVLGE EFPGHEPLGV CGQVIPWNFP MLMLAWKIAP
ALAAGNTVVL KPADLTPLTA VAFAEMLDEI GLPRGVVNIV HGGAETGALL VRHPGVAKVA
FTGSTAVGRE IRRATAGSGK SLTLELGGKS PFVVCADADL DAAVEGVVEG VWFNQGEVCC
AGSRLLLQEG IAERFLAKLR ARMEKIRVGD PLDKSTDMGA IVSARQKARI EELIAGAARE
GYRLEQAACP LPAAGHFVAP GFFADTEPAA TVAQVEIFGP IAVTTTFRTV DEAVALANNT
PYGLAASVWS ENINAATELA ARIRAGVVWI NASNLFDAGA SFGGMKESGF GREGAREGLG
AYLRPRTPRG PEALVAPVDF TAHTGMGGNL SGLIDRTMKN YIGGAQVRPD GGASYVVRGP
KGEALGLAPV SGRKDIRNAV EAALKAKGWA ANAHGRAQVL FFLAENIAAR AEDLAAALVQ
GGAGRSEAAA EVRSLIERVF FYAGMADKDD GRIHATKPRH LTLSVKEPLG VVGVLAPDEA
PLLSLMSLIL PLIAAGNRVV AVPSPAQALL AQPLTQIFDT SDLPGGVVNL VTGDRNLLAR
TLAEHDAVDG IWYHGSAKGA AEVEALSAGN LKQVWTNGGR ALDWNADAVA CGRSWLDRAT
QIKTIWVPYG A