Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1861 |
Symbol | |
ID | 4022343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 2082943 |
End bp | 2083764 |
Gene Length | 822 bp |
Protein Length | 273 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637962054 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_568997 |
Protein GI | 91976338 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0078052 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.313039 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCGTT TAGCGCGCGA CAATCAAAAA TGGAAACGCT GTATGAGCAT ATTCGATCTG AGCGGACGCG TCGCAGTCAT CACCGGTGGC AATGGTGGCA TCGGGCTTGG TATGGCGCAG GCGCTTGCCG CGGCGGGTTG CAACGTCTCG ATCTGGGGCC GCAATGCCGA CAAGAACAAG GCCGCGGTGC AGAGCCTGGC TGGCGCGCCC GGCAAGTCCG AAGCGCATGT TTGCGACGTG ACAGATCCGG CCTCGGTCAA GGCCGCAATG GACGCGACGC TGCAATCCTT CGGTCGCGTC GACGGCTGTT TCGCCAATGC CGGAATCGGC GGCGGGGGGC GGCAGGCTTT CATCGATCGC ACCGAGGAGC AGTGGCGCAC GATGTTCGCC ACCAATCTCG ACGGCGTGTT CCATGCCTTC CAGGCCGCGG CGCGACACAT GACCGAACGC GCCGCCAACG GCGATCCCTT CGGCCGTCTG GTGGCGACCT CGAGTCTCGC CTCGATCTTC GGCACTGCGC GCAACGAGCA CTACGCCGCG ACCAAGGCTG CGATCAACGC GCTGGTGCGC GCGCTCGCGG TCGAACTCGC GCGCCACAAC ATCACCGCCA ATGCGATCTT GCCGGGCTGG ATCAAGAGCG ACATGACCGC AGGGATCCTG GGCAACGACA AGTTCGTGGC GAACGTCATG CCGCGCATTC CGGTGCGTCG GTTCGGCGAG CCGGAAGATT TCGGCGGCAT CGCCGTGTAT CTGATGAGCA AGGCGTCGTC TTATCACACC GCCGACACGT TCGTGATCGA CGGCGGATAC ACGGCGTTCT GA
|
Protein sequence | MHRLARDNQK WKRCMSIFDL SGRVAVITGG NGGIGLGMAQ ALAAAGCNVS IWGRNADKNK AAVQSLAGAP GKSEAHVCDV TDPASVKAAM DATLQSFGRV DGCFANAGIG GGGRQAFIDR TEEQWRTMFA TNLDGVFHAF QAAARHMTER AANGDPFGRL VATSSLASIF GTARNEHYAA TKAAINALVR ALAVELARHN ITANAILPGW IKSDMTAGIL GNDKFVANVM PRIPVRRFGE PEDFGGIAVY LMSKASSYHT ADTFVIDGGY TAF
|
| |