Gene RPC_3824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3824 
Symbol 
ID3969283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4254452 
End bp4255471 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content66% 
IMG OID637926934 
Productaldo/keto reductase 
Protein accessionYP_533677 
Protein GI90425307 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTATC GCCAACTCGG CCGGAGCGGC CTAAAAATTT CCCCGTTGTG TCTCGGCACC 
ATGATGTTCG GTGGCGCCAC CGACGAGGCG ACCGCGGTGC GGATCATCGA CAAGGCGCGC
GGCGCCGGTA TCAATTTCAT CGACACCGCC GACGCCTATT CCAGCGGCGC CGCGGAGGCC
ATCGTCGGCC GCGCTATCGC CAAGCATCGG CAGCATTGGG TGCTGGCGTC CAAACTCGCC
AACCCGATGG GCGAAGGCCC CAACCGCGCC GGGCTGTCGC GCCGCTGGGT GATGCAGGCC
GCCGAAGACA GCCTGAAGCG GCTCGGCACC GACCATCTCG ACATCTACTA CCTGCACAAG
GAAGATCACG CCACGCCGCT GCACGAGACG GTGCGGGCGA TCGGCGATCT GATCCGCGAC
GGCAAGATCC GTTACTTCGG CGTATCGAAT TATCGCGCCT GGCGGATCGC GGAAATCTGC
AACATCTGCG ACCGGCTCGG CATCGACCGC CCGGTGGTCA GCCAGCCCTA TTACAACGCG
ATGAACCGGA TGCCCGAGGT CGAGCAGATG CCGGCCTGCG ACTTCTACGG TCTCGGTGTG
GTGCCCTACA GCCCGCTGGC CCGCGGCGTG CTCACCGGCA AGTATCTGCC CGATGCCACG
CCGGACAAGG ACAGCCGCGC CGGCCGCAAC GACATCCGCA TGATGCAGAC CGAATGGCGC
CGGGAATCCC TCGAACTGGC GCAGACAATC CGCCGCCACG CCGAAGCCCG CGGCACCACC
GCCGGCCAGT TCGCGGTGGC CTGGGTGCTG AACAGCGGCT TCGTCAGTTC GGTGATCGCA
GGACCCCGGA CCGAGCCGCA ATGGGACGAT TACCTCAAGG CGTTGGACTA TCGCTTCACC
GCCGAGGACG AAGCCCTGAT CGACAGCCTG GTGGTCAGCG GCCATCCTTC GACGCCGGGC
TACAACGATC CGGCCTACCC GATCGAAGGC CGACGCGCCC GCACAACTGG TAGTATTTAA
 
Protein sequence
MDYRQLGRSG LKISPLCLGT MMFGGATDEA TAVRIIDKAR GAGINFIDTA DAYSSGAAEA 
IVGRAIAKHR QHWVLASKLA NPMGEGPNRA GLSRRWVMQA AEDSLKRLGT DHLDIYYLHK
EDHATPLHET VRAIGDLIRD GKIRYFGVSN YRAWRIAEIC NICDRLGIDR PVVSQPYYNA
MNRMPEVEQM PACDFYGLGV VPYSPLARGV LTGKYLPDAT PDKDSRAGRN DIRMMQTEWR
RESLELAQTI RRHAEARGTT AGQFAVAWVL NSGFVSSVIA GPRTEPQWDD YLKALDYRFT
AEDEALIDSL VVSGHPSTPG YNDPAYPIEG RRARTTGSI