Gene Rsph17029_3394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3394 
Symbol 
ID4898740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp448152 
End bp449678 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content72% 
IMG OID640113991 
Productaldehyde dehydrogenase 
Protein accessionYP_001045259 
Protein GI126464146 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACA CATCCGTCTT CACGCCGCAC GGCAAGCACC TGATCGCGGG CGAATGGGTG 
GCTGGCGAGA CCAGCTTCAC CTCCGATCCC GCCCACGGCC CGAGCCACGC CTTCTCGGTC
GGCACGCCTG ACCTCGTCGA CCGCGCCTGC CGGGCCGCCG AGGAGGCCTT CTGGACCTAC
GGCCAGACGA CCCGCGAAAG CCGCGCCGCC TTCCTGCGGG CCATCGCCGA CGAGATCGAG
GCCCGCGCCG AGGCGATCAC CGAGATCGGC CGTCGGGAGA CCGGCCTGCC CGAGGCGCGG
CTTCAGGGCG AGCGTGGCCG CACCATCGGG CAGTTGCGGC TCTTCGCCGA GCACATCCTC
GAGGGGGGGC ACCTCGACCG CCGTCACGAC GTGGCGCTGC CCGAGCGCCA GCCGCTGCCG
CGTCCCGACA TCCGGCTCGT GATGCGTCCC ATAGGCCCGA TCGCGGTCTT CGGCGCCTCG
AACTTCCCGC TCGCCTTCTC CACCGCCGGC GGCGACACAG CCGCCGCGCT GGCCGCGGGC
TGTCCCGTGG TGGTCAAGGG CCATGGCGCG CACCCCGGCA CCGCCGAGAT CGTCGCCGAG
GCGATCCTCG CCGCGATCAG GAAGACGGGG ATGCCGGACG GCGTCTTCTC TCTGATTCAG
GGCGGTCGGC GCGACGTGGG TCAGGCGCTG GTGCAGCACC CGCTGATCGC CGCCGTGGGC
TTCACCGGCT CGCTCAAGGG TGGGCGGGCG CTGTTCGACC TCTGCGCGCA GCGCGAGGTG
CCGATCCCCT TCTTCGGCGA ACTGGGCTCG GTCAACCCGA TGTTCCTGCT GCCCGAAGCG
GTGAAGGCGC GCGGCGCCGC GATCGGCGCC GGCTGGGCGG CTTCGCTGAC CATGGGCGCC
GGCCAGTTCT GCACCAACCC GGGCATTGCC GTGGTTGAGA TCGGCCCCGA GGGCGACGCT
TTCGTCGCCG CCGCGGCCGA GGCGCTGCGG GCTGTGCCCG CCCAGTGCAT GCTCACCCCG
GACATCGCAC AGGCCTACAG GAAGGGCAGG TCCCGCTTCG ACGGGCGCCC GGACGTGCGG
CCGGTGCTGA CCACCGACAG CGACGGCCGC AACGCCCTTC CCAACCTCTT CGAGACCGAT
GCGGCGAGCT ACCTGCGGGA CCCGGCGCTC GGTGAAGAGG TCTTCGGCCC GCTCGGCCTC
GTGGTGCGGG TCTCGGGCGC GGACGAGGTG GACGCGCTCG CCCGCGGCCT CGAGGGCCAG
CTCACCGCCA CCATTCACAT GGACGAGGGC GACACGGCGC TCGCACAGCG CCTGATGCCG
GTGCTCGAGC GCAAGGCCGG CCGGCTGCTC GTCAACGGCT TTCCGACGGG GGTCGAGGTC
TCTCACGCCA TGGTGCATGG CGGGCCTTAC CCCGCCTCGA CCAACTTTGG CGCGACCTCG
GTCGGCACGC TCTCGATCCG CCGCTTCCTG CGCCCGGTGA GCTACCAGAA CCTGCCCCCT
GCGCTGCTGC CGCGCGATCT GGCCTGA
 
Protein sequence
MLDTSVFTPH GKHLIAGEWV AGETSFTSDP AHGPSHAFSV GTPDLVDRAC RAAEEAFWTY 
GQTTRESRAA FLRAIADEIE ARAEAITEIG RRETGLPEAR LQGERGRTIG QLRLFAEHIL
EGGHLDRRHD VALPERQPLP RPDIRLVMRP IGPIAVFGAS NFPLAFSTAG GDTAAALAAG
CPVVVKGHGA HPGTAEIVAE AILAAIRKTG MPDGVFSLIQ GGRRDVGQAL VQHPLIAAVG
FTGSLKGGRA LFDLCAQREV PIPFFGELGS VNPMFLLPEA VKARGAAIGA GWAASLTMGA
GQFCTNPGIA VVEIGPEGDA FVAAAAEALR AVPAQCMLTP DIAQAYRKGR SRFDGRPDVR
PVLTTDSDGR NALPNLFETD AASYLRDPAL GEEVFGPLGL VVRVSGADEV DALARGLEGQ
LTATIHMDEG DTALAQRLMP VLERKAGRLL VNGFPTGVEV SHAMVHGGPY PASTNFGATS
VGTLSIRRFL RPVSYQNLPP ALLPRDLA