Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0421 |
Symbol | |
ID | 4895973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 435458 |
End bp | 436636 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640111005 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_001042309 |
Protein GI | 126461195 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.464809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCC GCAGCCCGCT CGTCCTTCTG CTCCTGCCCG CCGGCGCCAT CGCCCAGCCG GTCGAGCAGG GTCCCCCCGC CACCGACTAC GAACCGGCCT TCGAGACCCA GACCCGCGCG CCGGCGCTGG AAGCAACCGG GGCCACGGCG GAGCCCTTCG TGCAGGGGCT CGAGCATCCC TGGGGCATCG CGGCCCTGCC GGAGGGCGGA TGGCTCGTGA CCGAGCGGCC CGGTCGGCTG CGGATGGTCT CCGAGGACGG CACGCTGTCC GACCCGATCA AGGGTCTGCC CGAGGTGGAT GCCCGCAAGC AGGGCGGTCT CCTCGACGTC GCCGCGGGCC CGACCTTCGC CGAAGACCGG ATGATCTACT GGACCTATGC CAAGGCGGTC GAGGGCGGCA CCATCACCGC CGCGGCACGA GGCGTGCTGT CGGAGGACGG AACGGAGGTG AGCGCGGTCG AGGACATCTT CCGGCAGGAG CCGCCCTCAC AGGCGCCGAT GCACTACGGC TCGCGCATCC TGTTCGACGG CGAGGGCCAT GCGATCATCA CCACGGGCGA ACATTCGATC GAGGCCGAGC GCGATCGGGC GCAGGATCTC GGCACCAGCT ACGGCAAGGT GATCCGGGTG GCGCTGGATG GCAGGACGCC CGAGGACAAT CCCTTCGCCG AGAGCGAGGG CCTCGGCACC ATCTGGAGCT ACGGCCACCG CAACATCCAG AGCGCGGCCT TCGACGCGGA GGGCCAGCTC TGGATCGTCG AGCACGGGCC CAAGGGCGGA GACGAGCTGA ACCTGATCCA GCCGGGCGCA AACTACGGCT GGCCCGAAGT GAGCTACGGG GTGAATTACG ACGGCTCGCC CGTGGGCACC GGAGAGCCGC GCGGCGAGGG CTTCACCGAG CCCACCTACT ACTGGGATCC GGTCATCGCG CCGGGCGACA TGACCTTCTA CCGGGGCACC GCGTTCGAGG GCTGGCAGGG CGACCTGCTC GTGGGCTCAA TGAAGCCCGG CGGTCTCGTC CGGCTGACGC TCGAGGAGGG CCGCGTCGCC GGCGAGGAGC GCCTGCTGGG CGACGTGGGC CGGGTCCGCG ATGTCGAGGA GACGGGGGAG GGTCACCTTC TCCTGCTGAT CGACGCGCCC GACGGCGGCA TCCTGCGGGT GACGCCCGAG GCCGGCTGA
|
Protein sequence | MIRRSPLVLL LLPAGAIAQP VEQGPPATDY EPAFETQTRA PALEATGATA EPFVQGLEHP WGIAALPEGG WLVTERPGRL RMVSEDGTLS DPIKGLPEVD ARKQGGLLDV AAGPTFAEDR MIYWTYAKAV EGGTITAAAR GVLSEDGTEV SAVEDIFRQE PPSQAPMHYG SRILFDGEGH AIITTGEHSI EAERDRAQDL GTSYGKVIRV ALDGRTPEDN PFAESEGLGT IWSYGHRNIQ SAAFDAEGQL WIVEHGPKGG DELNLIQPGA NYGWPEVSYG VNYDGSPVGT GEPRGEGFTE PTYYWDPVIA PGDMTFYRGT AFEGWQGDLL VGSMKPGGLV RLTLEEGRVA GEERLLGDVG RVRDVEETGE GHLLLLIDAP DGGILRVTPE AG
|
| |