Gene RPB_3599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3599 
Symbol 
ID3911401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4129228 
End bp4130100 
Gene Length873 bp 
Protein Length290 aa 
Translation table11 
GC content66% 
IMG OID637885501 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_487205 
Protein GI86750709 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.55778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.875827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTCCG ATCAGCTTCT CGCAGGCCGT CGTATCCTCG TCACCGGTGG CGGCACCGGG 
CTCGGCAAAT CGATGGCCGC GCGCTTCCTG CAGCTCGGCG CCGAAGTCCA TATCTGCGGC
CGCCGCAAGG GCGTCTGCGA CGAGACCGCG ACCGAACTGA TGGATCAGTA CGGCGGCAAG
GTGATGACCT ACGGCGTCGA CATCCGCGAC TCGGCCGCGG TCGACCACAT GGTCGAGACC
ATCTTCGCCG ACGGCCCGCT CACCGATCTG ATCAACAACG CCGCCGGAAA TTTCATCTCG
CGGACGGAAG AGCTGTCGCC GCGCGGCTTT GACGCCGTCG CCAACATCGT GATGCACGGC
ACCTTCTACG TGACGCATGC GGTCGGCCGG CGCTGGATCG CCGGCGGCCA CCGCGGCAAT
GTGGTGTCGA TCACCACCAC CTGGGTCCGC AACGGCAGCC CCTATGTGGT GCCCTCGGCG
ATGAGCAAAT CGGCGATCCA CGCCATGACG ATGTCGCTCG CCACCGAATG GGGCCGCTAC
GGCATCCGCC TCAACACCAT TGCGCCCGGC GAAATTCCCA CCGAAGGCAT GAGCAAGCGG
ATCAAGCCCG GCGACGAGGC CGGCGCCCGC ACCGTGAAGG TGAATCCGAT GGGCCGCGTC
GGCACCATGG AGGAACTGCA GAACGTCGCG GTGTTCCTGA TCTCCGGCGG CTGCGACTGG
ATCAACGGCG AAACCATCGC GATGGACGGC GCCCAGGGCC TGGCGATGGG CGGCAATTTC
TATCAGCTGC GCGACTGGAG CAACGCCGAC TGGGACCAGG CCAAGGCCTC GATCAAGGCG
CAGAACGAAA AAGACCGCGC ACAGCGGGGG TGA
 
Protein sequence
MFSDQLLAGR RILVTGGGTG LGKSMAARFL QLGAEVHICG RRKGVCDETA TELMDQYGGK 
VMTYGVDIRD SAAVDHMVET IFADGPLTDL INNAAGNFIS RTEELSPRGF DAVANIVMHG
TFYVTHAVGR RWIAGGHRGN VVSITTTWVR NGSPYVVPSA MSKSAIHAMT MSLATEWGRY
GIRLNTIAPG EIPTEGMSKR IKPGDEAGAR TVKVNPMGRV GTMEELQNVA VFLISGGCDW
INGETIAMDG AQGLAMGGNF YQLRDWSNAD WDQAKASIKA QNEKDRAQRG