Gene RPD_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1049 
Symbol 
ID4021525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1201968 
End bp1203212 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content70% 
IMG OID637961241 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_568188 
Protein GI91975529 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCCC GACCGCGCAA GGCAATCGTC GGCGCGTGGC CTGTTGTCGA GCGCCTGATG 
ATCCCCGCTT TTTCAATCGC CCGTCTGCCC CGCATTGAAT TCGGCGCGGG CGCCGTGGCC
CGGCTGCCGC AACTCGCCGC GCGCTACGGC CGCCGCGTGC TGCTGGTCAC CGGCGCGCAT
TCATTCGACC GCGCTCCCTA TGCCGCAGCG CTGCTCGCGG GTCTTCGCGA CCATGGCCTC
AACTGGGACA GGGTGACGAT CGGCGGCGAA CCATCGCCGG AGGCGGTCGA CGCCGCCGTG
CACGACTGGC ATAGCTGCGA CATCGACGCC GTGATCGGGA TCGGCGGCGG CAGCGCGCTC
GATGCCGCCA AGGCGATCGC CGGCCTGCTG CGGCCCGGTA ATTCGGTGAT GGATCATCTC
GAAGGCGTCG GGCCGGAATT GCCGTATCGC GGCCCGGCGA CGCCGTTCAT CGCGGTGCCG
ACCACCGCCG GCACCGGCTC CGAGGCGACA AAGAACGCGG TGTTGTCGCG GCATGGCGCC
CATGGCTTCA AGAAGTCATT CCGCGACGAG GCGCTGGTGG CCGAGATCGC GCTGGTCGAT
CCCGATCTGC TGGCCGGGTG CCCGCCGGAG CTGATCGCGG CCAACGGCAT GGACGCACTG
ACCCAGCTGC TCGAATCCTA CGTCTCGACC CGCGCCAATC CGTTCACCGA CGCATTGGCG
CTGTCCGGCC TGCGCGCCGT CCGCGACGGC CTGCTCGCCT GGTATGAAGG CGGCGACGCG
GCGCGGGCCG CCCAGGCGCA GATGGCCTAT GCGTCGCTGC AGTCCGGCAT TTGTCTCGCG
CAGACCGGGC TCGGCTCGGT CCACGGGCTG GCGTCGCCGC TCGGCGCGTT CTTTCCGATC
GGCCACGGCG TCGTCTGCGG CACGCTGGTG GCGGCCGCGA CGCGCGTCAA CATCGACGCG
ATGGATGCGC GTGCGCCGCA CCATCCTGCG CTGGAGAAAT ACGCCGAGAT CGGCCGTCTG
CTGTCCGGGC GTAGCGGCGC CGGCGTCGCG GAGGATCGCG ACAATCTCGT GCGCACGCTG
GACGACTGGA CGCGGCGGCT GTCGCTGCCG AAGCTATCCG CGCTCGGCGT CGCCACTGGC
GATTTCGACC GGATCGTCGC GGCCAGCCGT GGCTCCAGTA TGAAGACCAA TCCGGTGGTG
CTGACCGATG ATGAGATCAG GCGCGTGTTG AGCTCCCGCT TCTGA
 
Protein sequence
MLSRPRKAIV GAWPVVERLM IPAFSIARLP RIEFGAGAVA RLPQLAARYG RRVLLVTGAH 
SFDRAPYAAA LLAGLRDHGL NWDRVTIGGE PSPEAVDAAV HDWHSCDIDA VIGIGGGSAL
DAAKAIAGLL RPGNSVMDHL EGVGPELPYR GPATPFIAVP TTAGTGSEAT KNAVLSRHGA
HGFKKSFRDE ALVAEIALVD PDLLAGCPPE LIAANGMDAL TQLLESYVST RANPFTDALA
LSGLRAVRDG LLAWYEGGDA ARAAQAQMAY ASLQSGICLA QTGLGSVHGL ASPLGAFFPI
GHGVVCGTLV AAATRVNIDA MDARAPHHPA LEKYAEIGRL LSGRSGAGVA EDRDNLVRTL
DDWTRRLSLP KLSALGVATG DFDRIVAASR GSSMKTNPVV LTDDEIRRVL SSRF