Gene RPC_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1231 
Symbol 
ID3969108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1347264 
End bp1348121 
Gene Length858 bp 
Protein Length285 aa 
Translation table11 
GC content65% 
IMG OID637924342 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_531113 
Protein GI90422743 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000245222 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCATT ATCCGAAGCC GCCGTTCGCG TCGCAAGCGC AGCCGATGCC GGGTTCGACC 
GACGCGATGA CGCCGCGGCC CGATCACGGC GAGGACAGCT ACAAGGGATC CGGGCGACTG
GCTGGCATGA AGGCGGTGAT TACCGGCGGC GACAGCGGCA TCGGCCGCGC CGTGGCGATC
GCCTATGCCC GCGAAGGCGC CGACATCCTG ATCTCCTATC TCGACGAGAA CGAGGAAGCG
GCCGAGGTCA AGGAGTTGAT CGAACGGGAA GGCCGCAAGG CGGTTCTGGT GCCGGGCGAC
CTGCAGTGGC CGGAGCATTG CCGTCGGGTG GTCCAGCGCG CCGCCGAGGA GCTGGGCGGC
ATCGATATCC TGGTCAACAA CGCCGCGCAC CAGGCGACCT TCAAGGACAT CGGCGACATA
ACGGACGAGG AATGGGAGCT GACCTTCAAG GTCAACATCC ACGCGATGTT CCATCTGACC
AAGGCGGCGG TGCCGCACAT GAAGCCGGGC AGCGCCATCA TCAACACCGC CTCGGTGAAT
TCGGACATGC CGAATCCAAT GCTGCTGGCC TATGCCACCA CCAAGGGCGC GATCCAGAAC
TTCACCGGCG GGCTGGCGCA GATGCTGGCG CCCAAGGGGA TTCGCGCCAA CGCCGTGGCG
CCGGGGCCGA TCTGGACCCC GTTGATTCCA TCGACCATGC CGGAGGAGGC GGTGAAGAAT
TTCGGCAAGC AAACGCCGAT GCAGCGCCCA GGCCAGCCGG CCGAACTCGC CACAGCTTAC
GTGATGCTGG CCGATCCGTT GTCGAGCTAT GTCTCCGGCA CCACCATTGC GGTCACCGGC
GGCAAGCCGT TCATCTGA
 
Protein sequence
MLHYPKPPFA SQAQPMPGST DAMTPRPDHG EDSYKGSGRL AGMKAVITGG DSGIGRAVAI 
AYAREGADIL ISYLDENEEA AEVKELIERE GRKAVLVPGD LQWPEHCRRV VQRAAEELGG
IDILVNNAAH QATFKDIGDI TDEEWELTFK VNIHAMFHLT KAAVPHMKPG SAIINTASVN
SDMPNPMLLA YATTKGAIQN FTGGLAQMLA PKGIRANAVA PGPIWTPLIP STMPEEAVKN
FGKQTPMQRP GQPAELATAY VMLADPLSSY VSGTTIAVTG GKPFI