Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3799 |
Symbol | |
ID | 4898264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 927145 |
End bp | 928434 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640114403 |
Product | putative L-sorbosone dehydrogenase |
Protein accession | YP_001045651 |
Protein GI | 126464538 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.410038 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.17604 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTCC TGGCGCGTGC CACCGCCATC GTCGGCAATA CGATGGTTCT GATGCGGCGG TTCGGATCGC CGGGCACCCA GGCGATCGGG CAGAGCCCGG CCATCCCCGA GGCCCAGAAG CAGGGCATCA TGACCCTCAA GATGCCCGTG GCCAAGGGCT GGGCGCCGGG CCATCTGCCC ACGCCCGCAC CGGGCCTCCA GGTCAATGCC TTCGCCCGGG ATCTGGAGCA TCCGCGCTGG ATCGAGGTGC TGCCCTCGGG CGACGTGCTG GTTGCCGAGG CACGCCAGCT TCCCACCCCG CCGAAGACCC TCCTCGACCG CGCGGCGCAG GCCACCATGC GCCGCATCCG CGCGCTCGGC GACAGCCCGA ACCGCATCAC CCTCCTGCGC GATCCGGAAG GCCGGGGCGA GGCGCAAGAG CGCGGGACTT TCCTCGAGAA CCAGAGCCAG CCCTTCGGCA TGGCGCTGGT GGGCGATACC TTCTACGTCG GCAACACCGA CGGCATTATG GCCTTCCCCT ACCGCCCGGG CGCCACGCGG CTCGAGGGGC CGGGGCGGCG CCTGACCACC TTCAAGCCCG GCGGGCACTG GACGCGCAGC CTGATCGTCT CGCCCGACGG GCGCCGGATC TATGCCGGCG TGGGCTCGCT CAGCAACATC GGCGACGACG GGATGGAGGC CGAGGAGGGC CGCGCCGCGA TCTGGGAGCT GGACCTCGCC AGCGGTCAGG CCCGGATCTA TGCCTCGGGC CTGCGCAACC CGGTGGGCCT CGCGTGGGAG CCCACGACGC GCGTGCTCTG GACCGTGGTC AACGAGCGCG ACGGGCTCGG CGACGAGACC CCGCCCGACT ATCTGACCTC CGTCGAGGAG GACGGCTTCT ACGGCTGGCC CTACTGCTAC TGGAACCGGA TCGTCGACGA TCGCGTGCCG CAGGATCCGG CGATGGTCGC CCGCGCGATC ACGCCCGACT ATGCGCTCGG CGGACACACG GCCTCGCTCG GCCTCTGCTG GGTGCCCGCG GGCACGCTGC CGGGCTTCGG CGACGGAATG GCCATCGGCC AGCACGGCTC GTGGAACCGT TCGAAACTCA GCGGTTACCG GCTGATCTTC GTGCCCTTCG CGAACGGCCG GCCCTCCGGC CCGCCGCGGG ACATCCTGAC CGGCTTCCTC TCCGACGACG AGAAGCTGGC CTACGGTCGC CCGGTCGGCG TGGCGGTGGG CCCCGACCGC CGCTCGCTTC TGCTCGCCGA CGATGTGGGC GACGTGATCT GGCGCGTGAC CGGCGCCTGA
|
Protein sequence | MDFLARATAI VGNTMVLMRR FGSPGTQAIG QSPAIPEAQK QGIMTLKMPV AKGWAPGHLP TPAPGLQVNA FARDLEHPRW IEVLPSGDVL VAEARQLPTP PKTLLDRAAQ ATMRRIRALG DSPNRITLLR DPEGRGEAQE RGTFLENQSQ PFGMALVGDT FYVGNTDGIM AFPYRPGATR LEGPGRRLTT FKPGGHWTRS LIVSPDGRRI YAGVGSLSNI GDDGMEAEEG RAAIWELDLA SGQARIYASG LRNPVGLAWE PTTRVLWTVV NERDGLGDET PPDYLTSVEE DGFYGWPYCY WNRIVDDRVP QDPAMVARAI TPDYALGGHT ASLGLCWVPA GTLPGFGDGM AIGQHGSWNR SKLSGYRLIF VPFANGRPSG PPRDILTGFL SDDEKLAYGR PVGVAVGPDR RSLLLADDVG DVIWRVTGA
|
| |