Gene RPC_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0020 
Symbol 
ID3971445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp23282 
End bp24178 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content64% 
IMG OID637923134 
Productshort chain dehydrogenase 
Protein accessionYP_529918 
Protein GI90421548 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAC TGAAGGGCAA GACGCTGTTC ATTTCCGGCG GCAGCCGCGG CATTGGGCTG 
GCGATCGCGC TACGTGCGGC GCGCGACGGC GCCAATGTGG CGATCGCCGC CAAGACCGCC
GAGCCACATC CCAAACTCCA GGGCACGATC TATACCGCGG CGGACGAGGT GCGCGCCGCC
GGCGGCAACG CGCTGCCGAT CCTGTGCGAC ATCCGCGACG AGGCCCAGGT GATCGCCGCG
ATCGACAAGA CGGTCGGCGA GTTCGGCGGC CTCGATATCT GCGTCAACAA TGCCTCTGCG
ATCAGTCTCA CCAACTCACA AAATACCGAC ATGAAGCGGT TCGACCTGAT GATGGGGATC
AACACCCGCG GCACCTTCAT GGTGTCGAAA TATTGCATTC CGCATCTGAA GAAGGCCGAG
AACCCGCACA TCCTGATGCT GTCGCCGCCG CTCGACATGA AGCCAAAATG GTTCGAGCAC
TCCACCGCCT ACACCTTGGC CAAGTTCGGC ATGAGCATGT GCGTGCTGGG ATTGTCCGGC
GAACAAAAGC GCGCCGGCAT CGCCGTCAAC GCGCTGTGGC CGCGCACCAC CATCGCCACC
GCGGCGGTCG GCAATCTCTT GGGCGGCGAC GCCATGATGC GCGCCAGCCG GACGCCGGAG
ATCATGGGCG ACGCGGCCTA TGAGATCTTT CTCAAGCCGT CGCGCGAGTT CACCGGGCAG
TTCTGCATCG ACGACAAAGT GCTGTATGAA GCGGGCGTCA CCGATTTCGA GCGCTACCGC
GTCGATCCCT CGGTGCCCCT GATGTCGGAT TTCTTCGTGC CCGACGACGA TGTGCCGCCG
CCCGGCGTCA GCGTGAGGAC GCTGCCCTCG GTCGATGCGG CGAAGGCGAA GGGGTAG
 
Protein sequence
MASLKGKTLF ISGGSRGIGL AIALRAARDG ANVAIAAKTA EPHPKLQGTI YTAADEVRAA 
GGNALPILCD IRDEAQVIAA IDKTVGEFGG LDICVNNASA ISLTNSQNTD MKRFDLMMGI
NTRGTFMVSK YCIPHLKKAE NPHILMLSPP LDMKPKWFEH STAYTLAKFG MSMCVLGLSG
EQKRAGIAVN ALWPRTTIAT AAVGNLLGGD AMMRASRTPE IMGDAAYEIF LKPSREFTGQ
FCIDDKVLYE AGVTDFERYR VDPSVPLMSD FFVPDDDVPP PGVSVRTLPS VDAAKAKG