Gene RPB_3187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3187 
Symbol 
ID3910988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3643842 
End bp3645011 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content64% 
IMG OID637885089 
Productalcohol dehydrogenase 
Protein accessionYP_486794 
Protein GI86750298 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGC TGACCTGGCA CGGCAAGAGC GATATCCGCT GCGAAAGCGT GCCCGATCCA 
AAAATCGAGC ATGGCCGTGA TGCGATCATC CGGGTCACCG CCTGTGCGAT CTGCGGTTCG
GACCTGCATC TGTTCGACGG CGTGATGCCG TCGATGAAGA GCGGCGACGT GCTGGGACAC
GAGACGATGG GCGAGGTCGT CGAGGTCGGC GCCGACAACA AGGCGCTGAA GGTCGGCGAC
CGCGTGGTGG TGCCGTTCAC GATCTCCTGC GGCGAGTGTT TCTTCTGCAA GCGCGGCTTC
TACAGCGGCT GCGAGCGCTC CAATCCGAAC GCCGAGCAGG CCGCGAAACT CTGGGGCCAT
TCGCCCGCCG GTCTGTTCGG CTATTCGCAT CTGCTCGGCG GCTTTGCCGG CGGGCAGGCG
GAATATCTGC GCGTACCTTA CGCCGATGTC GGTCCGATCA AGGTGCCGGA CGGGATGACC
GACGAGCAGG CGCTGTTCCT GTCCGATATC TTTCCGACGG GATTCATGGC GGCGGATTTC
TGTAACCTGA AGGGTGGCGA GACGGTCGCG ATCTGGGGCT GCGGTCCGGT GGGGCAATTC
GCGATCAAGA GCGCGTTTCT GCTCGGCGCC GAGCGCGTGA TCGCGATCGA TACGGTGCCG
GAGCGGTTGG AGATGGCCCG CGCCTCGGGG GCGATCACCA TCGATTTCAG GAACGAGGAC
GTCTACGACC GCATCCAGGA TCTGACCCAT GGTCGCGGCG CCGATGCCTG CATCGACGCG
GTCGGCACCG AGCCGGACAC GGCGTCCGGA TTCGACGCGA TCGTCGACCG GATCAAGGTG
GCGACCTTCA TGGGCACCGA CCGGCCGCAT GTGCTGCGCC AGGCGATCCA TTGCTGCCGC
AACTTCGGAA CGGTGTCGAT CGTCGGCGTC TATGGCGGCC TGCTCGACAA CATCCCGATG
GGCTCGGCGA TCAATCGGGG GCTGACGTTC CGAATGGCGC AGACCCCGGT CCAGCATTAT
CTGCCGCAAC TGCTGGCGCG GATCGAGAAA GGCGAGATCG ATCCCACCTT CGTCATCACC
CATCGCGCCA CGCTGGAGGA TGGCCCGGAG CTGTACAAGA CGTTCCGCGA CAAGCAGGAC
GGCTGCATCA AGGTGGTGAT GAAGCCCTGA
 
Protein sequence
MKALTWHGKS DIRCESVPDP KIEHGRDAII RVTACAICGS DLHLFDGVMP SMKSGDVLGH 
ETMGEVVEVG ADNKALKVGD RVVVPFTISC GECFFCKRGF YSGCERSNPN AEQAAKLWGH
SPAGLFGYSH LLGGFAGGQA EYLRVPYADV GPIKVPDGMT DEQALFLSDI FPTGFMAADF
CNLKGGETVA IWGCGPVGQF AIKSAFLLGA ERVIAIDTVP ERLEMARASG AITIDFRNED
VYDRIQDLTH GRGADACIDA VGTEPDTASG FDAIVDRIKV ATFMGTDRPH VLRQAIHCCR
NFGTVSIVGV YGGLLDNIPM GSAINRGLTF RMAQTPVQHY LPQLLARIEK GEIDPTFVIT
HRATLEDGPE LYKTFRDKQD GCIKVVMKP