Gene Rpal_1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1721 
Symbol 
ID6409378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1844489 
End bp1845694 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content68% 
IMG OID642711609 
Productgeranylgeranyl reductase 
Protein accessionYP_001990724 
Protein GI192290119 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR02023] geranylgeranyl reductase
[TIGR02032] geranylgeranyl reductase family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.889725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATA GCTCCGCAAT TTATGATGTC GTTGTCGTCG GCGGCGGCCC TGCCGGCGCT 
ACTGCGGCGT GCGATCTGGC GCGCGCCGGC AAGCGGGTGG TGCTGCTCGA CCGCGCCGGC
CGCATCAAGC CGTGCGGCGG CGCGATCCCG CCGCGCGCGA TCCGCGATTT CGCGATTCCC
GACAGCATGC TGGTCGCCAA GATCAACGCC GCGCGGATGG TGTCGCCGAG CAATGTCGAA
GTCGACATGC CGATCGACGG CGGCTTCGTC GGCATGGTCG ACCGCGAGCA TTTCGACGAG
TGGCTGCGCC AGCGCGCCGC GACCGTCGGC GCCGAACGGC GCACCGGGCT GTTCAAGCGG
TTCAGCCGCG ACGAGTCCGG CGTCAACACC GTGCACTACG AAGGGCGCGG CCCCGACGGG
TCGATGATCG ACCAGACGGT GCGCTGCCGT GCGATCATCG GCGCCGATGG CGCGGTGTCC
GGCGTGGCGC GGCAGTTCCT GAAGGATGCC GACCGCGTGC CGTTCGTGTT CGCCTATCAC
GAGATCATCA AGGCGCCGAC GGCGGAGCAG AAGGCCGCCT ATGAAAGCCG CCGCTGCGAT
GTGTACTACC AGGGCCACGT CTCGCCGGAC TTCTACGGCT GGGTGTTCCC GCACGGCAAT
ACGGTCAGCG TCGGCACCGG CTCGATGCAC AAGGGCTTCT CGCTGCGCGA TTCGGTCGCG
GAGCTGCGCA AGCAGACCGG CCTCGACGAG GTCGAGACCA TCCGCAAGGA AGGCGCGCCG
ATCCCGCTGC ATCCGCTGCC GCGCTGGGAC GACGGCCACA GCGTGCTGCT CGCCGGCGAT
GCCGCCGGCG TGGTCGCGCC GGCGTCGGGC GAGGGCATCT ACTACGCGCT GCTCGGCGGC
CGGCTCGCCG CCGAAGCGGT CGAGGAATTC CTGCAGACCG CCGACGCCAA GGCGCTGAAG
CTGGCCCGCA AGCGCTTCAT GCGCGGCAAC GGTTCGGTGT TCCGGATCCT CGGCCTGATG
CAGTGGTATT GGTACGCCAA CGACAAGCGC CGCGAACAAT TCGTCAGCAT CTGCCGCGAT
CGCGACGTGC AGAAGTTGAC GTGGGACGCC TACATGAACA AAAAGCTGGT CCGGGCGAAA
CCGATCGCCC ACGTCCGGAT CTTCTTCAAG AACCTGGCCC ACATGACGGG GCTGGCTTCG
GTCTGA
 
Protein sequence
MSDSSAIYDV VVVGGGPAGA TAACDLARAG KRVVLLDRAG RIKPCGGAIP PRAIRDFAIP 
DSMLVAKINA ARMVSPSNVE VDMPIDGGFV GMVDREHFDE WLRQRAATVG AERRTGLFKR
FSRDESGVNT VHYEGRGPDG SMIDQTVRCR AIIGADGAVS GVARQFLKDA DRVPFVFAYH
EIIKAPTAEQ KAAYESRRCD VYYQGHVSPD FYGWVFPHGN TVSVGTGSMH KGFSLRDSVA
ELRKQTGLDE VETIRKEGAP IPLHPLPRWD DGHSVLLAGD AAGVVAPASG EGIYYALLGG
RLAAEAVEEF LQTADAKALK LARKRFMRGN GSVFRILGLM QWYWYANDKR REQFVSICRD
RDVQKLTWDA YMNKKLVRAK PIAHVRIFFK NLAHMTGLAS V