Gene RPC_3791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3791 
Symbol 
ID3969479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4214984 
End bp4216417 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content69% 
IMG OID637926901 
Productaldehyde dehydrogenase 
Protein accessionYP_533644 
Protein GI90425274 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGG CGCTGCACAG TGCCGGGCGC AGAGCGCTCG ACGACGACTT TCATCGGATG 
TGGGCGACGG CGCGGCAAAC CCCGCCGCCG CCTCTGGAGA CCCGCCTCGA CCGGCTGGCG
CGCCTGCGGG CGCTGATTCG CGACCATGAG GCGGGGTTCA GCGCGGCGAT CTCAGCGGAT
TTCGGCCATC GCTGCCCGGT GGAAACCCAG ATCGCCGAGA CGTTGTCGGT GCTGGGCGAG
ATCAAGCACA CCGCGAGGCA TCTGAAAGCC TGGATGGCGC CGCGGCGCAT CGCCACCCAG
TTGCAGTTCC TACCCGGCCG CAACCGGCTG ATCCCGCAGC CGCTCGGGGT GGTCGGCATC
ATCGCGCCGT GGAATTACCC GTTGCAGCTG ACCTTGGCCC CCGCGGTGGC GGCGCTGGCC
GCCGGCAATG CGGCGATGAT CAAACCCAGC GAATTGACGC CGCGGTTTGC CGCCCTGCTG
CAGGAAACCG TGGCGGCCAA GTTCGCGCCC GACGAGATGG TGGTGACCGG CATCGAGGAC
GACATCGCCG AAGCCTTCGC CGCGCTGCCG TTCGACCATC TGATGTTCAC CGGCTCGACC
AGGGTCGGCC GCATCGTCGC CGCCGCCGCC GGCCGCAATC TCACCCCGGT GACGCTGGAG
CTCGGCGGCA AGTCGCCGGT GATCATCGAC GCCTCCGCCG ACCTCGACCA GGCGGCGGCG
CGGATCGCCT ATGCCAAATT GCTCAACGCC GGCCAGACCT GCATCGCACC GGATTACGTG
CTGGTGCCGA ACGTCTCGCT GCAGGCCTTC GCCGACAAGC TGCGCGACGC GATGCGCCGC
ATGTTCGGCG CCGACCCCGG CAACCAGGAC TACAGTTCGA TCATCGCCGA GCGGCATTAT
GCCCGGCTCG AGGGCCTGCT CGCCGATGCG CGGGCGCTCG GCGCCAGCGT CGTGCAGAGC
GCCTCGCCCG ACGACGCCGC GTGGAAAGCG CTGCGCAAAT TCCCGCCGAC GGTGCTGACC
GGCGTCAGCT CCGAGATGAA GATCATGCAG GAGGAGATCT TCGGGCCGCT GCTGCCGATC
CTCGGCTACG ACGACGCCAG CGAGCCGATC GCTTTCATCA ACGCCCGCGA CCGGCCGCTG
GCGCTGTACT GGTTCGGCAC CGACGACGCC GCGCGCGACG AGGTTTTGGC GCGCACCGTC
TCCGGCGGCG TCACCGTCAA CGACTGCCTG GTGCATTTCG CGCAAGTCAA CCAGCCGATG
GGCGGCGTCG GCGCCTCGGG AAGCGGCGCC TATCACGGCG AATGGGGCTT CAACACCTTC
AGCAAGCTGA AGCCGGTGTT CTATCGCTCG CCCTACAACC GCTTCGCCGA TCTCTATCCG
CCCTATGGCG GCACGGTCGC GCGGCTCGCC AAACTGCTGC GCTGGCTGTC CTAG
 
Protein sequence
MDQALHSAGR RALDDDFHRM WATARQTPPP PLETRLDRLA RLRALIRDHE AGFSAAISAD 
FGHRCPVETQ IAETLSVLGE IKHTARHLKA WMAPRRIATQ LQFLPGRNRL IPQPLGVVGI
IAPWNYPLQL TLAPAVAALA AGNAAMIKPS ELTPRFAALL QETVAAKFAP DEMVVTGIED
DIAEAFAALP FDHLMFTGST RVGRIVAAAA GRNLTPVTLE LGGKSPVIID ASADLDQAAA
RIAYAKLLNA GQTCIAPDYV LVPNVSLQAF ADKLRDAMRR MFGADPGNQD YSSIIAERHY
ARLEGLLADA RALGASVVQS ASPDDAAWKA LRKFPPTVLT GVSSEMKIMQ EEIFGPLLPI
LGYDDASEPI AFINARDRPL ALYWFGTDDA ARDEVLARTV SGGVTVNDCL VHFAQVNQPM
GGVGASGSGA YHGEWGFNTF SKLKPVFYRS PYNRFADLYP PYGGTVARLA KLLRWLS