Gene Rsph17029_3176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3176 
Symbol 
ID4899158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp203759 
End bp205246 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content67% 
IMG OID640113778 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_001045048 
Protein GI126463935 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.762089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAAG AGATCGACTG GCACGCCCGC GCCAAAGCCG TGGCCCTTCC GAACCGTCCC 
TTCATCGACG GCCAGTATGT CGACAGCCTG TCGGGCGAGA CCTTCGCCTG CATCTATCCC
GGCGATGGGC GGCCGCTGAC CGATGTGGCC TCGTGCAATG CGGCGGATGT GGATGTCGCG
GTGCGGTCGG CCCGCCGGGC CTTCGAGGCG GGGGTCTGGT CGCGGATGGC CCCGGCCGAC
CGGCGCCGCA TCCTGCTGCG CTTCTCCGAG CTGATCCTCG CGAACCGCGA GGAGCTGGCG
CTCCTCGAGA CGCTGAACGT GGGCAAGCCC ATCGCCAATG CCTACAACGG CGACGTGGTG
AGCGCTGCCG CCTGCATCGC CTGGTATGCC GAGGCGATCG ACAAGGTCTA TGGCGAGGTG
GCGGCCACGG CGCACGACAT GACGACGCTC GTCATGCGCG AGCCCTTGGG GATCGTGGCG
GCCGTGGTGC CGTGGAACTA TCCGATGTCG ATGGCCGCCT GGAAGCTCGG GCCGGCGCTC
GCCACCGGCA ACTCGGTGAT CCTGAAGCCC GCCGAACAAT CGCCCTTCAC CGCGCTGAAG
TTCGGCGAAC TCGCCATCGA GGCGGGGATG CCTCCGGGGG TGCTGAACGT GGTACCGGGC
CTCGGCCATA TCGCGGGCAA GGCGCTCGGG CTGCATATGG ATGTCGATTG TGTGGGCTTC
ACCGGCTCGA CCGAAGTCGG CAAGTACTTC ATGCAATATT CGGGCCAGTC GAACATCAAG
CGCATCGGGC TCGAACTCGG CGGCAAGTCG CCCCAGGTCG TGCTCGCGGA TTGCGACGAT
CTCGATGCGG CGGCGGCCGG GATTGCGGCC GGCATCTTCG CCAACACCGG GCAGGTCTGC
AACGCGGGCT CGCGCCTGAT CGTGGACGAG AAGATCCACG ACCAGCTCCT CGAGAAGATC
GCGGCTCAGG CCAAGGTCTT CGCGCCCGGC GATCCGCTCG ACACCGCCAC CCGCATGGGC
TCGATGGTGA GCGAAGAGCA GATGGACCGC GTGCTGGGCT ACATCGACGC CGGCCGCGCG
GACGGTGCAC GGCCGGTCAT CGGCGGCGGC CGGGTGAAGA CCGAAACCGG CGGCTTCTAC
ATCGAACCCA CGATCTTCGA GGGCGTGCGC AACGACATGA AGATCGCGCA GGAAGAGATC
TTCGGGCCGG TGCTGTCGGC CATCCCGGTG AAGGGCTTCG ACGAGGCGAT GGCGGTGGCC
AATGACACGG TCTACGGGCT TGCGGGCGCG GTCTGGACCG GGTCGGTGAA GAACGCGCAC
CGGGCGGCCA AGTCGCTCCG CGCGGGCGTG GTCTGGGTCA ACTGCTTCGA CCGCGGCTCG
CTCGCCGTGC CCTTCGGCGG CTTCAAGCAA TCGGGCTTCG GCCGGGACAA GTCGCTGCAT
GCGATGGACA AATACACCGA CCTCAAGGCC GTCTGGTTCG CTCACTGA
 
Protein sequence
MLQEIDWHAR AKAVALPNRP FIDGQYVDSL SGETFACIYP GDGRPLTDVA SCNAADVDVA 
VRSARRAFEA GVWSRMAPAD RRRILLRFSE LILANREELA LLETLNVGKP IANAYNGDVV
SAAACIAWYA EAIDKVYGEV AATAHDMTTL VMREPLGIVA AVVPWNYPMS MAAWKLGPAL
ATGNSVILKP AEQSPFTALK FGELAIEAGM PPGVLNVVPG LGHIAGKALG LHMDVDCVGF
TGSTEVGKYF MQYSGQSNIK RIGLELGGKS PQVVLADCDD LDAAAAGIAA GIFANTGQVC
NAGSRLIVDE KIHDQLLEKI AAQAKVFAPG DPLDTATRMG SMVSEEQMDR VLGYIDAGRA
DGARPVIGGG RVKTETGGFY IEPTIFEGVR NDMKIAQEEI FGPVLSAIPV KGFDEAMAVA
NDTVYGLAGA VWTGSVKNAH RAAKSLRAGV VWVNCFDRGS LAVPFGGFKQ SGFGRDKSLH
AMDKYTDLKA VWFAH