Gene RPB_3846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3846 
Symbol 
ID3911650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4397463 
End bp4398896 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content68% 
IMG OID637885747 
Productaldehyde dehydrogenase 
Protein accessionYP_487450 
Protein GI86750954 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGC CGTTGGGCGG CTTCGGGGGC AATGCCCTGC ACGACAGCTT TCACCGCATG 
ATCGAGCGCT CCCGCGCCGA GCCCCCGGCG TCGCTGGAGC AGCGGCTCGA CCGGCTGGCG
CGGCTGCGCG GTTTGCTCAA AGACAATGAG ACGCGATTCG AGCAGGCGAT CTCGGCCGAT
TTCGGCCATC GCTGTTCGGT CGAAACCATG ATCGCAGAGA CGTTGAGCCT GCTCGGCGAC
ATCAAGCACA CCAGCAAGCA CGTCAAAGGC TGGATGGCGC CGCGCAAGGT GGCGACCCAG
CCGCAATTCT GGCCGGGCAA GAACCGGCTG ATCCCGCAGC CGCTCGGCGT GGTCGGCATC
ATCGCGCCGT GGAACTATCC GTTGCAGCTC ACGATCGCGC CGGCGATCGG CGCGCTGGCG
GCCGGCAATC GGGTGATGAT CAAGCCCAGC GAATTGTCGC CCGCGTTCTC CGCCCTGCTG
CAGGAGACGG TGGCGGCAAA GTTCGATCCC ACCGAGATGA TCGTGACCGG GATCGACGAC
GGCGTCGCCG AGGCGTTCGC GAAGCTGCCG TTCGATCACC TGATGTTCAC CGGCTCGACC
CGGGTCGGCC GCATCGTCGC GGCGGAAGCG GGCAAGAACC TCACCCCGGT CACGCTCGAA
CTCGGCGGCA AGTCGCCGAC CATCATCGAC CGCTCCGCCG ATCTCGACGA GGTGGCGCCG
CGGATCGCCT ATGCCAAGCT GATGAATGCC GGGCAGACCT GCATCGCGCC GGACTACGTG
CTGGCGCCGC GCGACAAGGT CGAGGCGCTG GCGGGCAAGA TCCGCGACGC GATGCAGCGG
ATGTTCGGCG CCGATCCCGC GAATACGGAC TACACCTCGA TCGTCGCCGA CCGGCACTAC
GCGCGGCTGA AGGGCCTCGT CGACGACGCC GCCGCGCGCG GCGCGAGGCT GCTGCAACCG
GCCCCGGCCG ACGATGCGGC GTGGCAGAGC CGGCGAAAAT TCCCGCCGAC CGTGGTGCTC
GGCGCCACGC CCGAGATGAA GATCATGCAG GAGGAAATCT TCGGGCCGCT GCTGCCGATC
CTCGGCTACG ACGATCCCGC CGACCCGATC GCCTTCATCA ACGGCCGCGA CCGGCCGCTG
GCGCTGTACT GGTTCGGCAC CGACGAGGCG GCGCGCGACG AGGTGCTGCA ACGCACCGTG
TCCGGCGGTG TGACGATCAA CGACTGCCTA GTGCATTTCG CGCAGGTGAA CCAGCCGATG
GGCGGCGTCG GCGCCTCGGG CACCGGCGCG TATCACGGCG AATGGGGCTT CAACACCTTC
ACGCAGCTCA AGCCGGTGTT CTATCGCTCG CCCTACAACC GGTTCGCCGA TCTGTATCCG
CCCTATGGCG GCAAGATCGC GCGGCTGGCG AAAGTGCTGC GCTGGATGTC CTGA
 
Protein sequence
MDQPLGGFGG NALHDSFHRM IERSRAEPPA SLEQRLDRLA RLRGLLKDNE TRFEQAISAD 
FGHRCSVETM IAETLSLLGD IKHTSKHVKG WMAPRKVATQ PQFWPGKNRL IPQPLGVVGI
IAPWNYPLQL TIAPAIGALA AGNRVMIKPS ELSPAFSALL QETVAAKFDP TEMIVTGIDD
GVAEAFAKLP FDHLMFTGST RVGRIVAAEA GKNLTPVTLE LGGKSPTIID RSADLDEVAP
RIAYAKLMNA GQTCIAPDYV LAPRDKVEAL AGKIRDAMQR MFGADPANTD YTSIVADRHY
ARLKGLVDDA AARGARLLQP APADDAAWQS RRKFPPTVVL GATPEMKIMQ EEIFGPLLPI
LGYDDPADPI AFINGRDRPL ALYWFGTDEA ARDEVLQRTV SGGVTINDCL VHFAQVNQPM
GGVGASGTGA YHGEWGFNTF TQLKPVFYRS PYNRFADLYP PYGGKIARLA KVLRWMS