Gene RPB_3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3107 
Symbol 
ID3910908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3540073 
End bp3541464 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content65% 
IMG OID637885011 
Productaldehyde dehydrogenase 
Protein accessionYP_486716 
Protein GI86750220 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCTG ACAAGGACGG TCGTTTCGTC GTTTACAATC CCTGGACGGC CCAGGAAGCC 
TTTACGACGC CGGGCACCAG CGAGCAGCGT GTCGATGAGA TCGCGCGCGA AGCGAAGGCC
GCGTTCTATC AGCATCTCAA GACGCCGGGA CACGTTCGGG CGGAATGGAT CCATGGCGCC
GCCGCCGCGC TGGAACGGGC GCGGAGCGAG ATCGTCGAGG CGATGATCGC GCATATCGGC
AAGCCGCGCA AAACTGCCGA GATGGAGGTC AGGCGCAGCG TCGCTTTCAT CGCGGCCTGC
GCACAGCATC TGCATGCGCT GGGCGGACAT TTGCTGCCTC TGGACATGGT GCCGGCGGGT
GTCGGCTCTG TCGGATTTGC GCGGCGGATG CCGTATGGTG TCGTCGCTGC GGTCACCCCG
TTCAATGCAC CGTCCAACCT GCTGGTGCAG AAATTGGCGC CGGCGCTCGC CACCGGCAAC
GCCGTCGTGA TCAAGCCGTC TCTCGAGGGC ACGCGGATCG CAGAGATGAT TGCACGGGCC
TTCGTCACGG GCGGCGTTCC CGAGGGGCTG GTGTGGGTGG TTCCTGGCGA TCGCGCAGAG
GCGCTGGGCT TGGCTGGGCA CCGCGACGTC GATCTTGTAA CCCTAACCGG AGGCACTGCG
GCCGGCGACG CACTGGCGCG GGCTGCCGGC GCAAAGCGAT TTCTCGGCGA ACTCGGCGGC
AATTCGCCGA ACATCGTTGC GGCGGACGCA GACATCGAGG ACGCGGTGAA ACGCATCGTG
CCGTCGTCGT TCGAAGCGAG TGGCCAGCAA TGCATTTCAA CGCAAAGGAT CATTGTTGAA
GCGCCGGTGT TCGATCGGTT CCTTGCTCTC TTCGTCGAGG AAACGAAGCG TCTCAAGGTC
GGCGATCCGG CCGCGGCGGA TACCGACCTC GGTCCCGTGG TGTCGCGAGT CTCTGCCGAG
CGGATTGCGG CGATGATCGA AGACGCGCGC GCCCTCGGCG CCCGCGTGAT CAGCTGCGGC
GAAATCCGGG ATTGCGTGAT CCCGCCGACC ATCGTGGTCG AGCCGCCGGC GGCGGCGCGG
CTCATCCGCG AGGAGGTGTT CGGACCTGTC GTCGTGGTGC TCCGCGCCGC CGATGTCGAC
GATGCCATCC GCATCGCCAA TGATTGCGAA TTCGGCTTGC AAGGCTCGTG TTTCACCGCG
AGCCTGTCGA CCGCGTTGCG GGTCGCGGAC GAGGTGCGTG TCGGATCGCT GTGGATCAAC
GAGGCCAGCC GGTTCCGGCT CGACAACTAT CCGTTCGGTG GCATGGGCCG CTCGGGCGTC
GGCCGCGAAG GGTTACCTTA TGCGCTGGAG GAGTACACCC AGCTCAAGTT CACCGGAATG
CGCGGGGTAT AG
 
Protein sequence
MQADKDGRFV VYNPWTAQEA FTTPGTSEQR VDEIAREAKA AFYQHLKTPG HVRAEWIHGA 
AAALERARSE IVEAMIAHIG KPRKTAEMEV RRSVAFIAAC AQHLHALGGH LLPLDMVPAG
VGSVGFARRM PYGVVAAVTP FNAPSNLLVQ KLAPALATGN AVVIKPSLEG TRIAEMIARA
FVTGGVPEGL VWVVPGDRAE ALGLAGHRDV DLVTLTGGTA AGDALARAAG AKRFLGELGG
NSPNIVAADA DIEDAVKRIV PSSFEASGQQ CISTQRIIVE APVFDRFLAL FVEETKRLKV
GDPAAADTDL GPVVSRVSAE RIAAMIEDAR ALGARVISCG EIRDCVIPPT IVVEPPAAAR
LIREEVFGPV VVVLRAADVD DAIRIANDCE FGLQGSCFTA SLSTALRVAD EVRVGSLWIN
EASRFRLDNY PFGGMGRSGV GREGLPYALE EYTQLKFTGM RGV