Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2079 |
Symbol | |
ID | 4897979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2199763 |
End bp | 2200803 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640112673 |
Product | aldo/keto reductase |
Protein accession | YP_001043954 |
Protein GI | 126462840 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.042571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCA TCCCCCTCGG CCGCACCGAT CTGACCGTTT CCGAACTCTG CCTCGGCACG ATGACCTGGG GCAGCCAGAA CAGCGAGGCC GAAGCCCATG CCCAGATCGA CCTCGCCCTC GACCACGGGG TGAATTTCCT CGACACGGCC GAGATGTATC CGACCAATCC CGTGACGGCC GAGACGGTGG GCGGCACCGA AACCATCATC GGTCGCTGGC TTGCCGCGCG GGGCGGGCGC GACCGGATCG TGCTGGCGAC CAAGATCACC GGCGAGGGCA GTGCGGCGGT ACGCGGCGGC GAGCCGGTGA CGCCCGAGAG CCTGCGCCGC GCGCTCGAGG GCTCGCTCGC GCGGCTCGGC ACCGATCATG TCGATCTCTA CCAGATCCAC TGGCCGAACC GGGGCTCCTA TCACTTCCGC AAGATGTGGG CCTATGTGCC GCCCACGGGG GTCGAGGCGG TGCGCGACAG CATGCTCGCG GTGCTCGAAG AGGCGCAGAA GCTGGTGGCC GAGGGCAAGG TGCGCCATTT CGGCCTCTCG AACGAGACGG TCTGGGGCGC GGCGCAGTGG CTCTCGCTGG CCGACCGGCA CGGGCTGCCG CGCATGGCCT CGGTCCAGAA CGAATATTCG CTGCTCTGCC GCCAGTTCGA CACCGACTGG GCCGAGCTTT CGGCGCTGGA GGAGATGCCG CTTCTGGCCT TCTCGCCCCT CGCTGCGGGG CTTCTGTCGG GCAAATATGC CGGAGACGTG ACGCCCGACG GCTCGCGCCG CGAACGCAAT GCCACGCTGG GCGGGCGCGT CACGCCCACC GTCTTCGAGG CGGTGGCGGG CTATCTCGGG ATCGCCGCGC GCCACGGGCT CGACCCCTGC CAGATGGCGC TCGCCTTCTG CCGCAAGCGT CCCTTCCCGG TGATCCCGAT CCTCGGCGCC ACCTCGCTCG ACCAGCTGCG CACCAACCTC GGCGCCTGTG ACCTCGAGCT GTCCCCCGAA GTCGAGGCCG AGATCGCGGC GGCCCATCGC ACCTGGCCCG CGCCCTACTG A
|
Protein sequence | MKRIPLGRTD LTVSELCLGT MTWGSQNSEA EAHAQIDLAL DHGVNFLDTA EMYPTNPVTA ETVGGTETII GRWLAARGGR DRIVLATKIT GEGSAAVRGG EPVTPESLRR ALEGSLARLG TDHVDLYQIH WPNRGSYHFR KMWAYVPPTG VEAVRDSMLA VLEEAQKLVA EGKVRHFGLS NETVWGAAQW LSLADRHGLP RMASVQNEYS LLCRQFDTDW AELSALEEMP LLAFSPLAAG LLSGKYAGDV TPDGSRRERN ATLGGRVTPT VFEAVAGYLG IAARHGLDPC QMALAFCRKR PFPVIPILGA TSLDQLRTNL GACDLELSPE VEAEIAAAHR TWPAPY
|
| |