Gene RPB_3806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3806 
Symbol 
ID3911609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4343970 
End bp4345025 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content69% 
IMG OID637885707 
Productalcohol dehydrogenase 
Protein accessionYP_487411 
Protein GI86750915 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.925036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTGA CCTTCCGCGC CGCCGTTCTT CGTGAGCTGA ATCGGCCGCT CGCCATCGAG 
ACCGTCGAGG CGCCGCCGCT CGCGGCCGGG CAGGTGCTGG TCAAGCTGGC GTATTCCGGG
GTGTGCCACA GCCAGGTGAT GGAAGCGCGG GGCGGCCGCG GCGTCGATCG CTATCTGCCG
CACATGCTCG GCCACGAGGG CTCGGGCGTC GTGGTCGAGA CCGGCGCCGG CGTGACCAAG
GTGAAGACGG GCGATCGCGT CATCCTCGGC TGGATCAAGG GCAAAGGCGC CGACGCGCAG
GGCATCCGCT ACAAGAGCGG CGACGGCTTC ATCAATGCCG GCGCGGTGAC GACGTTCAAC
GAATACGCCG TGGTCGCGGA GAACCGCGTG ACCCTGCTGC CGCAGGGCCT GCCGATGGAC
GTCGCGGTGC TGTTCGGCTG CGCGCTGCCG GTCGGCGCCG GCATCGTCAT CAATATCGCC
AAGCCCGCGC CCGGCAGCAC GCTCGCGGTG TTCGGGCTCG GCGGCATCGG GTTGTCGGCG
CTGATGGCGT GCAAGCTGTT CGACTGCAGG CAACTGATCG CCGTCGATGT CGAGCCCGCG
AAGCTGGCGA TGGCGCGCGA ACTCGGCGCC ACCGCGACGA TCGATGCGTC GCAGCAGGAT
CCGGTCGCTG CGATCCGGGA GCTGACCGGC GGGCTCGGTG TCGACTACGC CATTGAATCC
GCCGGCCTGG TGCGCGTCAT CGAGCAGGCG TTCGACGCCA CGCGGCGGTT CGGCGGGCTG
TGCGTGTTCG CCTCGCATCC GCGTTCCGGC GAAAAGATCG CGCTCGACCC GTTCGAACTG
ATCTGCGGCA AGCGCATCCT CGGCACCTGG GGTGGCGACG CCAATCCGGA CCGCGACGTC
GACCTGCTCG CCGGCCTGTT CCGCGCCGGC AAGCTGCCGC TGGCCTCGAT GTTCAGCCGC
CGCTACGCGC TCGACGAGAT CAACATCGCC CTCGACGATC TCGAACAGCG CCGCAGCGTG
CGGCCGCTGA TCGAAATCGA TGCGACATTG GGCTGA
 
Protein sequence
MPVTFRAAVL RELNRPLAIE TVEAPPLAAG QVLVKLAYSG VCHSQVMEAR GGRGVDRYLP 
HMLGHEGSGV VVETGAGVTK VKTGDRVILG WIKGKGADAQ GIRYKSGDGF INAGAVTTFN
EYAVVAENRV TLLPQGLPMD VAVLFGCALP VGAGIVINIA KPAPGSTLAV FGLGGIGLSA
LMACKLFDCR QLIAVDVEPA KLAMARELGA TATIDASQQD PVAAIRELTG GLGVDYAIES
AGLVRVIEQA FDATRRFGGL CVFASHPRSG EKIALDPFEL ICGKRILGTW GGDANPDRDV
DLLAGLFRAG KLPLASMFSR RYALDEINIA LDDLEQRRSV RPLIEIDATL G