Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1657 |
Symbol | |
ID | 4895767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1749485 |
End bp | 1750483 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640112250 |
Product | short chain dehydrogenase |
Protein accession | YP_001043539 |
Protein GI | 126462425 |
COG category | [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0430692 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGACA GGCCGGTCGT GGTCATCACG GGAGGAACGG CGGGCGTGGG GCGTGCCCTC GCCAGAGCCT ACGCGCACCG CGGCTGGCGG GTGGCGATCC TGGCGCGCGG ACAGGCGGGC CTCGCGGCCA CCGCCGCCGA CGTGGCCCGT GCCGGGGGCG AGCCCCTGAC CCTTGCGGCA GATGTCGCCG ACTCCGCCGC CGTCTTCGAC GCGGCCGACC GCGTGGTGGC CCGCTGGGGC CGCATCGACC TCTGGATCAA CAATGCGATG GTCACGGTCT TCGGCCCGGC CGAGCGCGCG ACGCCCGAGG AATATGCCCG CGTCACCGCC GTGACCTACC TCGGCACCGT CCATGGCACG CTCGCGGCCC TCCGCCACAT GCGCCCACGC GACCGCGGCA CCATCCTTCA GATCGGCTCG GCGCTGGCCT ACCGCTCGAT CCCCCTGCAG GCCGCCTATT GCGCGGCGAA GGCCGCCTGC CGGGGCTTCA CCGACTCGCT CCGGTCCGAG CTTCTGCACG AGAAGAGCCG CATCCGCCTC ACCATGGCGC AGCTCCCGGC CGTCGATACG CCGCAGTTCG ACTGGGCCCG CACCCATGTC CGCGGACGCC CGCAGCCCGT GCCCCCGATC CATACCCCCG AGGCCGTGGC CGAAGCGATC GCCGCCGCGG CCCGCACGGC CCCGCGCGAG ATCTGGATCG GCGCGCCCTC GATGCAGGCG ATCCTCGGCA CGATGGTGGC GCCGGGCCTC ATGGACCGGA TGATGGCGCG CAGCGCCTGG GACAGCCAGA TCGGTCCCGG AGACGCCCGC GCGGACAATC TCATGCAGCC GGTCGAGCGT GACATGGGCG CCGCGGGCCG CTTCGGCTCC GAAGCGTCCG ACCGGGTCGC GAGCCTCTCC GGCCCCACCG TCCGCGCAGG CCTCGCGGCT GGCGGCGTGC TGGTCGCGGG CGCGGCCCTC ACCGCCGCAG CGCTCCTCGC CCGCCGCCGC GACCCATGA
|
Protein sequence | MQDRPVVVIT GGTAGVGRAL ARAYAHRGWR VAILARGQAG LAATAADVAR AGGEPLTLAA DVADSAAVFD AADRVVARWG RIDLWINNAM VTVFGPAERA TPEEYARVTA VTYLGTVHGT LAALRHMRPR DRGTILQIGS ALAYRSIPLQ AAYCAAKAAC RGFTDSLRSE LLHEKSRIRL TMAQLPAVDT PQFDWARTHV RGRPQPVPPI HTPEAVAEAI AAAARTAPRE IWIGAPSMQA ILGTMVAPGL MDRMMARSAW DSQIGPGDAR ADNLMQPVER DMGAAGRFGS EASDRVASLS GPTVRAGLAA GGVLVAGAAL TAAALLARRR DP
|
| |