Gene RPC_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1223 
Symbol 
ID3969100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1338510 
End bp1339682 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content63% 
IMG OID637924334 
Productzinc-binding alcohol dehydrogenase 
Protein accessionYP_531105 
Protein GI90422735 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000417243 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGCGC TGACCTGGCA CGGCAAGAAT GACATTCGCT GCGAGAGCGT GCCGGATCCG 
ACGATACAAG ACGGTCGCGA CGCGATTATC AAGGTCACCG CCTGCGCGAT CTGCGGCTCC
GACTTGCATC TGTTCGACGG GGTGATGCCG ACCATGGAAA ACGGCGACGT GCTCGGCCAT
GAGACGATGG GCGAGGTGGT CGAGGTCGGC AACGACAACA AAGCCCTGAA GGTCGGTGAC
CGCGTCGTGG TGCCGTTCAC GATTTCCTGC GGGCAATGCT TCTTCTGCAA GCGCGGCTTC
TTTTCCGGCT GCGAGCGCTC CAACCCCAAT GCGAAGATGG CCGAAAGTGT CTGGGGCCAT
TCGCCGGCCG GTCTGTTCGG CTATTCGCAC ATGCTGGGCG GCTTCGCCGG CGGCCAGGCG
GAGTACCTGC GGGTGCCCTA CGCCGATGTC GGGCCGATCA AGGTGCCGGA CGGGTTGAGC
GACGAGCAAG TGCTGTTTCT GTCGGACATC TTTCCGACCG GCTTCATGGC GGCGGATTTC
TGCGATCTGA AGGGCGGCGA AACCGTCGCG ATCTGGGGCT GCGGACCGGT CGGCCAATTC
GCGATCAAGA GCGCCTTCCT GCTCGGCGCC GAGCGGGTGA TCGCGATCGA CACCGTGCCG
GAGCGGCTGG CGATGGCGCA AGCCTCCGGC GCGATCACCC TCGATTTCAT GAAGGAGGAT
ATTTTCGACC GCATCCAGGA CTTGACCGAG GGCCGCGGCG CCGACGCCTG CATCGACGCG
GTCGGCACCG AGCCGGAGAC CGGATCCGGG GTGGACGCGG TGATCGACCG CATCAAGGTC
GCGACTTTCA TAGGCACCGA TCGTCCGCAC GTGCTGCGCC AGGCCATCCA TTGCTGTCGC
AATTTCGGCA CGGTGTCGAT CGTCGGCGTC TATGGCGGAT TGCTGGACAA GATCCCGATG
GGCTCGGCCA TCAACCGCGG GCTGACCTTC CGCATGGCGC AGACCCCGGT GCAGCACTAT
CTGCCGCAAT TATTGGGACG GATCGAAAAG GGTGAGATCG ACCCGTCCTT CGTGATCACC
CATCGGGCGA CTTTGGAAGA AGGTCCGGAA CTCTACAGCA CGTTCCGCGC CAAGCAGGAC
GGCTGCATCA AGGTGGTGAT GAAGCCCTCT TGA
 
Protein sequence
MKALTWHGKN DIRCESVPDP TIQDGRDAII KVTACAICGS DLHLFDGVMP TMENGDVLGH 
ETMGEVVEVG NDNKALKVGD RVVVPFTISC GQCFFCKRGF FSGCERSNPN AKMAESVWGH
SPAGLFGYSH MLGGFAGGQA EYLRVPYADV GPIKVPDGLS DEQVLFLSDI FPTGFMAADF
CDLKGGETVA IWGCGPVGQF AIKSAFLLGA ERVIAIDTVP ERLAMAQASG AITLDFMKED
IFDRIQDLTE GRGADACIDA VGTEPETGSG VDAVIDRIKV ATFIGTDRPH VLRQAIHCCR
NFGTVSIVGV YGGLLDKIPM GSAINRGLTF RMAQTPVQHY LPQLLGRIEK GEIDPSFVIT
HRATLEEGPE LYSTFRAKQD GCIKVVMKPS