Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4125 |
Symbol | |
ID | 3970318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4587093 |
End bp | 4588259 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637927229 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_533970 |
Protein GI | 90425600 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.285901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCTC CACTCGTTTG GGTCACCGGA TCGCTGGTCG CGGCCATCGT ACTGACCGCC TCGTTCCTGA TCGCCACCAC GGCGAGCCGC GGCGAAACCA GTTCGTTCGC CTCCTCCGCC GGCCCGCTCG AGGTCCACAC CGTCGCCTCC GGCCTGGTCA ATCCATGGGC GCTGGCGTTC CTGCCGGACC GGCGCATGCT GGTCACCGAG AAGCCCGGCC GGATGCGGAT CGTGACGCCG CAGGGCCAGC TCTCGCCGCC GCTGCAGGGC GTCCCCGAGG TCTGGGCCTC CGGCCAAGGC GGGCTGCTCG ACGTCGTGGT CGACAAGTCG TTTGCTGATA ACAAGACTCT GTATTTCTGT TTCGCCGAAC GCACGGGCAA TGGCGGCCGC ACCGCGGTGG CGCGCGCCAA GCTCGACGAT GCGCGCCCGC CGCGGCTCGA CGATGTGAAG ATCATCTTCC GCCAGGACGG CCCGTTGTCG AAGGGCAATC ACTACGGCTG CCGGATCGCG CAGGCGCCGG ACGGCAATTT GTTCGTCACC CTCGGCGAGC ATTTCTTCGC CCGCGACCAG GCGCAGACCC TCGCCAACCA TCTCGGCAAG CTGATCCACA TCACCCCGGA CGGCGCCGCC GCGAAAGACA ATCCGTTTCT CGGCCGCTCC GACGCCAGGC CGGAGATCTG GAGCTACGGC CACCGCAATC CGCAGGGCCT CGCCTTCAAT CCGAGGTCGG GCGCGCTGTG GGAGATCGAG CACGGCCCGC GCGGCGGCGA CGAGGTCAAC ATCATCGCCC CCGGCAACAA CTACGGCTGG CCGGTGATCG GCTTCGGCAT CGATTACGAC GGCAGCAAGA TTCACGACAG CACCATCAAA GACGGCATGC AGCAGCCGAT CAAATACTGG GTGCCGTCGA TCGCGCCGAG CGGCATGGCG TTCTACATCG GCGCGCTGTT TCCAGCGTGG CGCGGCAGCC TGTTCACCGG CGCGCTCGCT GGCCAGATGC TGGTGCGGCT GTCGCTCGAC GGCGACAAAG TCACCGGCGA GGAGCGGCTG TTACACCAAC TCGGCGAACG CATCCGCGAC GTCCGTCAGG GACCGGACGG CGCGCTCTAT CTGCTCACCG ACAGCGCCAC GGGCCGCATC CTGCGGGTGA CGCCGGCCGG CAAGTAG
|
Protein sequence | MKAPLVWVTG SLVAAIVLTA SFLIATTASR GETSSFASSA GPLEVHTVAS GLVNPWALAF LPDRRMLVTE KPGRMRIVTP QGQLSPPLQG VPEVWASGQG GLLDVVVDKS FADNKTLYFC FAERTGNGGR TAVARAKLDD ARPPRLDDVK IIFRQDGPLS KGNHYGCRIA QAPDGNLFVT LGEHFFARDQ AQTLANHLGK LIHITPDGAA AKDNPFLGRS DARPEIWSYG HRNPQGLAFN PRSGALWEIE HGPRGGDEVN IIAPGNNYGW PVIGFGIDYD GSKIHDSTIK DGMQQPIKYW VPSIAPSGMA FYIGALFPAW RGSLFTGALA GQMLVRLSLD GDKVTGEERL LHQLGERIRD VRQGPDGALY LLTDSATGRI LRVTPAGK
|
| |