Gene Rpal_1965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1965 
Symbol 
ID6409625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2121497 
End bp2122369 
Gene Length873 bp 
Protein Length290 aa 
Translation table11 
GC content66% 
IMG OID642711851 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001990963 
Protein GI192290358 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.215015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCACAG ATCAGCTTCT CGCCGGCCGT CGCATCCTCG TCACCGGTGG CGGCACCGGC 
CTCGGCAAGG CGATGGCGGC GCGCTTCCTG CAGCTCGGCG CCGAAGTGCA CATCTGCGGC
CGCCGCAAGG GCGTGTGCGA CGAGACCGCG ACCGAACTGA TGGATCAGTA CGGCGGCAAG
GTGATGACCT ACGGCGTCGA CATCCGCGAC GCCGCGGCAG TCGACCACAT GGTCGAGACC
ATCTTCGAGA GCGGGCCGCT CACAGACCTG ATCAACAACG CGGCGGGAAA TTTCATCTCG
CGGAGTGAGG ATCTCAGCCC GCGCGGCTTC GACGCCGTCG CCAACATCGT GATGCACGGC
ACGTTCTACG TCACCCACGC GGTCGGCAAG CGCTGGATCG CCGGCGGCCA CCGCGGCAAC
GTGGTGTCGA TCACCACCAC CTGGGTGCGC AACGGCAGCC CCTATGTGGT GCCGTCGGCG
ATGAGCAAGT CGGCGATCCA CGCCATGACG ATGTCGCTCG CCACCGAATG GGGTAAATAC
GGCATCCGTC TCAACACCAT TGCGCCGGGC GAAATTCCGA CCGAGGGCAT GAGCAAGCGG
ATCAAGCCGG GCGACGAAGC CGGCGCACGC ACCATCAAGA TGAACCCGAT GGGCCGCGTC
GGCACCATGG AGGAGTTGCA GAACCTCGCA ACCTTCCTGA TCTCCGGCGG CTGCGACTGG
ATCAGCGGCG AGACCATCGC CATGGACGGC GCCCAGGGCC TCGCGATGGG CGGCAATTTC
TACCAGCTCC GCGACTGGAG CAACGCCGAC TGGGATCAGG CCAAGGCCTC GATCAAAGCC
CAGAACGAAA AGGACCGCGC GCAGCGGGGG TAA
 
Protein sequence
MFTDQLLAGR RILVTGGGTG LGKAMAARFL QLGAEVHICG RRKGVCDETA TELMDQYGGK 
VMTYGVDIRD AAAVDHMVET IFESGPLTDL INNAAGNFIS RSEDLSPRGF DAVANIVMHG
TFYVTHAVGK RWIAGGHRGN VVSITTTWVR NGSPYVVPSA MSKSAIHAMT MSLATEWGKY
GIRLNTIAPG EIPTEGMSKR IKPGDEAGAR TIKMNPMGRV GTMEELQNLA TFLISGGCDW
ISGETIAMDG AQGLAMGGNF YQLRDWSNAD WDQAKASIKA QNEKDRAQRG