Gene RPC_4893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4893 
Symbol 
ID3973715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5462941 
End bp5463996 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content65% 
IMG OID637928005 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_534734 
Protein GI90426364 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCATG TCAAAGCCTA TGCCGCGCAG TCCGCGGCTT CGCCGATCGC GCCGTTCAGC 
CTGGAGCGGC GCGAACCCGG TCCGCACGAC GTGCAGATCG ATATTCTGTA TTGCGGCGTC
TGCCATTCCG ATCTGCACCA GGCCCGCAAC GATTGGAGCA ACTCGCTGTA TCCGATGGTG
CCCGGCCACG AAATCGTCGG CCGCGTGGTC GCCACCGGCG CTCATGTGAA GAACCTCAAG
GTCGGCGATT TCGCCGGCGT CGGCTGCATG GTGGATTCCT GCCGGCATTG CGCGCCGTGC
GAGGCCGGGC TCGAGCAATA TTGCATCGAG GGCGCGACCT GGACCTACAA CGCGCACGAA
CGCGGCTCGC AGCAGCTGAC CTTCGGCGGC TATTCCGAAG CGATCGTCGC CGACGAACGC
TTCGTGGTGA AGATCCCCGC CCACATGGAT CTGAAGGCGG TGGCGCCGCT GCTCTGCGCC
GGCATCACCA CCTGGTCGCC GCTGCGGCAC TGGAAGGTCG GCAAAGGCCA GAAGGTCGGC
GTCGTTGGGC TCGGCGGCCT CGGCCATATG GGCGTGAAGT TCGCCAAGGC GCTCGGCGCC
CATGTGGTGA TGGTCACCAC CTCGCCGGAG AAGGGCAAGG ACGCGATCCG GCTCGGCGCC
GACGAGGTGC TGGTGTCGAA GGACGCGAAC GCTATGGCCA AGGCCAAGGG CTCGTTCGAC
TTCCTGCTCA ACACCATCCC GGTCGGCCAC GACGCCAACC CGTATTTGCA GTTGCTCAAG
CTCGACGGCG CGATGGTGAT GGTCGGCGCG CTGACGCCGC TGGATCCGAT CGTCGGCGGC
AATCTGATCC ACGGCCGCCG CAGCATCGCC GGCTCGGGGA TCGGCGGCAT GCCGGAGACC
CAGGAGATGA TCGATTTCTG CGCCGAACAC GGCATCGTCT CCGACGTCGA AATGATCCGC
ATCCAGGACA TCAACAAAGC CTATGAGCGG CTGTTGAAGA ACGACGTGCG CTATCGCTTC
GTCATCGACA TGGCGTCGCT GAAGAACGCG GGTTGA
 
Protein sequence
MIHVKAYAAQ SAASPIAPFS LERREPGPHD VQIDILYCGV CHSDLHQARN DWSNSLYPMV 
PGHEIVGRVV ATGAHVKNLK VGDFAGVGCM VDSCRHCAPC EAGLEQYCIE GATWTYNAHE
RGSQQLTFGG YSEAIVADER FVVKIPAHMD LKAVAPLLCA GITTWSPLRH WKVGKGQKVG
VVGLGGLGHM GVKFAKALGA HVVMVTTSPE KGKDAIRLGA DEVLVSKDAN AMAKAKGSFD
FLLNTIPVGH DANPYLQLLK LDGAMVMVGA LTPLDPIVGG NLIHGRRSIA GSGIGGMPET
QEMIDFCAEH GIVSDVEMIR IQDINKAYER LLKNDVRYRF VIDMASLKNA G