Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3939 |
Symbol | |
ID | 3911746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4495440 |
End bp | 4496240 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885843 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_487543 |
Protein GI | 86751047 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0544906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTTG CGGGCAAGGT CGTCGTCGTC ACCGGCGGCG GCAATGGAAT CGGTCAGGCG ATGTGCGAGG CCTTTGCCAA GGCCGGCGCC GCCAAGGTCG TGGTCGCCGA TCTCGATGAG GCCCGCGCCG AGGCGGTGGC CGCCGCGATC GGCGGCGCGG CGTTCAAATG CGACGTCGCG AAAGAATCCG ACATCAAGCA CGTGATCGAC GAGACCGAGC GGCGCTTCGG CCCGATCGCG CTGTTCTGCT CCAATGCGGG GATCGGCGGC GGCTTCGATC CGCTGTCGGA GAATGCCGGC GGCGCGTCCG ACGAGCCGTT CATGAACAGC TGGATGATCC ACGTGATGGC GCATGTCTAT GCCGCACGGC ATCTGGTGCC GCTCTACAGA GCGCGCGGCG GCGGCTATTT CCTCAACACG ATTTCGGCGG CGGGCCTGTT GTCGCAGGTC GGCAGCCCGG CCTATTCGGC GACCAAGCAC GGCGCGGTGG GCTTCGCCGA AAATCTCGCG ATCTCGCACA AGGCGCACAA CATCAAGGTC TCGATCCTGT GCCCGCAGGG CGTCGACACC AACATGCTGC GCGGGCTGCC GAAGGGCCCG CAATCCGCCG ACGGCGATCT CAGCCCCGAG CAGGTCGCGC AGGACGTGAT CGAGGGGCTC GCCGAAGAGA GCTTCCTGAT CCTACCGCAC AAGCAGGTGA TCGACTACAT GCGCAAGAAG ACCGAGAACT ACGACCGCTG GATCGGCGGC ATGGCCAAGA TCCAGGCGAA ATTGCGCGAG ACGTTCGGCG CGAAGGGGTA G
|
Protein sequence | MQVAGKVVVV TGGGNGIGQA MCEAFAKAGA AKVVVADLDE ARAEAVAAAI GGAAFKCDVA KESDIKHVID ETERRFGPIA LFCSNAGIGG GFDPLSENAG GASDEPFMNS WMIHVMAHVY AARHLVPLYR ARGGGYFLNT ISAAGLLSQV GSPAYSATKH GAVGFAENLA ISHKAHNIKV SILCPQGVDT NMLRGLPKGP QSADGDLSPE QVAQDVIEGL AEESFLILPH KQVIDYMRKK TENYDRWIGG MAKIQAKLRE TFGAKG
|
| |