Gene RPC_3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3068 
Symbol 
ID3972889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3400275 
End bp3401309 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content66% 
IMG OID637926177 
Productzinc-binding alcohol dehydrogenase 
Protein accessionYP_532930 
Protein GI90424560 
COG category[R] General function prediction only 
COG ID[COG2130] Putative NADP-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.281772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0896091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAT CCGCTTCCCG CATCGTCCTG GCCTCCCGTC CGCAGGGCGA GCCGACACCG 
GCGAATTTCC GCCTTGAGGG ATTTCCGGTG CCGACGCCCG GAGACGGCCA GCTGCTGCTG
CGCACCATCT ATCTGTCGCT CGATCCCTAT ATGCGCGGCC GGATGAGCGA CGGGCCGTCC
TATGCGGCGC CGGTGCCGGT CGATGGCGTG ATGGACGGCG GCACCGTGGC CGAGGTGATC
GCCTCGCACC ATCCTGGCTT CGCCAAGGGC GATTTCGTGC TGGCGCATTC CGGCTGGCAG
AGCCACGCGC TGTCCGACGG CAAGGGGCTG CGCAAGGTCG ATCCGAAACT GGCGCCGATC
TCCACCGCGG TCGGCGTGCT CGGCATGCCC GGCATGACCG CCTACACCGG GCTTCTCGAA
ATCGGCAAGC CCAAGGCCGG CGAGACCGTG GTGGTCGCAG CCGCCTCCGG CGCGGTCGGC
TCCGCGGTGG GGCAGATCGC CAGGATCAAG GGCGCGCACG CGGTCGGCAT CGCCGGCGGC
CGTGAGAAAT GCGACTACGT GGTCAACGAA CTCGGCTTCG ACGCCTGCCT CGACCATCGC
GAACCCGATC TCGCCGCCAG GCTCAAAGAG GCCTGCCCGA AAGGCATCGA CGTGTATTTC
GAGAATGTCG GCGGCGCGGT GTTCGAGGCG GTGTTTCCGC GGCTCAATCC GTTCGCGCGG
ATCCCGGTGT GCGGGCTGAT CGCCGACTAC AACACGATCT ACGATGGCGA CACGCCGACG
CCGAAATGGG CCAATTCGAT CATGCGGGCG ATCCTGGTGA AGCGGCTGAA TTTCCGCGGC
TTCATCGTGT CGGATTTTGC CGCGTTGCAC GGCGATTTCC TGCGCGACAT GTCGCATTGG
CTGCGCGACG GCAAGATCAA GCACCGCGAA TTCGTCACCG AAGGACTGGC GAGCGCGCCC
GAGGCCTTCA TCGGGCTGTT GAAGGGCGCC AATTTTGGCA AGCAGTTGGT GCGGGTCGGG
CCGGACAACA GCTGA
 
Protein sequence
MSQSASRIVL ASRPQGEPTP ANFRLEGFPV PTPGDGQLLL RTIYLSLDPY MRGRMSDGPS 
YAAPVPVDGV MDGGTVAEVI ASHHPGFAKG DFVLAHSGWQ SHALSDGKGL RKVDPKLAPI
STAVGVLGMP GMTAYTGLLE IGKPKAGETV VVAAASGAVG SAVGQIARIK GAHAVGIAGG
REKCDYVVNE LGFDACLDHR EPDLAARLKE ACPKGIDVYF ENVGGAVFEA VFPRLNPFAR
IPVCGLIADY NTIYDGDTPT PKWANSIMRA ILVKRLNFRG FIVSDFAALH GDFLRDMSHW
LRDGKIKHRE FVTEGLASAP EAFIGLLKGA NFGKQLVRVG PDNS