Gene RPB_3249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3249 
Symbol 
ID3911050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3712826 
End bp3714256 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content66% 
IMG OID637885151 
Productaldehyde dehydrogenase 
Protein accessionYP_486856 
Protein GI86750360 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAACC GCATGCAATT CTACATCGAC GGCGCCTGGG TCGATCCCGT CGCTCCCAAG 
TCGACGCCGG TGGTCAATCC GGCGACCGAG GACGCGATGT ACGAAGTTGC ACTCGGCTCC
AAGGCCGACG TCGACAAGGC GGTCGTCGCC GCCAAGCGCG CGTTCGAAAC CTTCTCCCAG
ACCAGCCGCG AGGAGCGCAT CGCGCTGCTC GAAAAAATCA TCGCGATCTA CAAGGGCCGC
ATGAAGGAGA TCGGCGCCGC CGTCTCCGAT GAGATGGGCG CGCCGCTGCC GATGGCGGAG
AAGATGCAGG CCGGCGCCGG CCTCGGCCAC ATCATGTCGA CGCTCGACGT GCTGAAGAAT
TATCAGTTCG AGGAGCCGAT GGGCTCGGCG GTGATCGTGC GCGAGCCGAT CGGCGTCATC
GGCATGATCA CGCCGTGGAA TTGGCCGCTG AACCAGATCG CCTGCAAGGT CGCGCCCGCG
CTCGCCGCTG GCTGCACCAT GATCCTGAAG CCCTCCGAAT TCACACCGAC CTCGGCGCTG
ATCTTCGCCG AGATCCTGCA TGAAGCCGGC GTGCCGAAGG GCGTGTTCAA TCTCGTCAAT
GGCCTCGGCC CGGAGGTCGG CGCGGCGATG AGCGAACATC CGGACATCGA CATGATTTCG
TTCACCGGCT CGACCCGCGC CGGCATCGAC GTCGCGCAGC GCGCGGCGCC GACCGTGAAG
CGCGTCAGCC AGGAGCTCGG CGGCAAGTCG CCGAACGTCA TCCTCGACGA CGCCGACCTC
ACCAAGGCGG TGACCGGCGG CGTGATGCAC ATGTTCAACA ACTCCGGCCA GTCCTGCAAC
GCGCCGAGCC GGATGATCGT GCCGCTGTCG AAGATGAAGG AGGTCGCGGC GATCGCCAAG
GGCGTCGCCG AAAAGACCAA GGCGGGCGAT CCGCGCGGCG AGGGCACCAC GATCGGCCCG
GTGGTCAATC GCGGCCAGTG GGACAAGATC CAGACGCTGA TCAACAAGGG CATCGAGGAA
GGCGCGACGC TGGTCGCCGG CGGACCCGGT CTGCCCGAAG GCGTCAACAA GGGCTTCTAC
GTCCGCCCGA CGGTGTTCGC CGACGTCACC GACAACATGA CCATCGCCCG CGAGGAGATC
TTCGGGCCGG TGCTGGTGAT CATGGGCGCC AAGGACGAGG ACGAGGCGGT CAAGCTCGCC
AACGACACGC CCTATGGTCT CGCCGGCTAC GTCTCGGCCG GTTCGGTCGA GCGTGCGCGC
AAGGTCGGCC GCCAGATCCG CGCCGGCAAC GTCAATCTGC AGGGCGTGCC GAACGAACGC
ACCGCGCCGT TCGGCGGCTA CAAGCAGTCC GGCAACGGCC GCGAGTGGGG CAAGTTCGGC
CTCGAGGAAT ATCTCGAGGT CAAGGCGATC GCCGGCTTCA ACGCCGCGTA A
 
Protein sequence
MVNRMQFYID GAWVDPVAPK STPVVNPATE DAMYEVALGS KADVDKAVVA AKRAFETFSQ 
TSREERIALL EKIIAIYKGR MKEIGAAVSD EMGAPLPMAE KMQAGAGLGH IMSTLDVLKN
YQFEEPMGSA VIVREPIGVI GMITPWNWPL NQIACKVAPA LAAGCTMILK PSEFTPTSAL
IFAEILHEAG VPKGVFNLVN GLGPEVGAAM SEHPDIDMIS FTGSTRAGID VAQRAAPTVK
RVSQELGGKS PNVILDDADL TKAVTGGVMH MFNNSGQSCN APSRMIVPLS KMKEVAAIAK
GVAEKTKAGD PRGEGTTIGP VVNRGQWDKI QTLINKGIEE GATLVAGGPG LPEGVNKGFY
VRPTVFADVT DNMTIAREEI FGPVLVIMGA KDEDEAVKLA NDTPYGLAGY VSAGSVERAR
KVGRQIRAGN VNLQGVPNER TAPFGGYKQS GNGREWGKFG LEEYLEVKAI AGFNAA