Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1294 |
Symbol | |
ID | 3908167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1476471 |
End bp | 1477646 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637883188 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_484915 |
Protein GI | 86748419 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.298447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGC TCCTGGTCTG GGTTTCCGGC ACGCTGACGG TGGCGACGGT GATCATCGCC TCGTTCCTGA TCGCGACCAA GAGCAGCGGC GAGCCGGCCT CGTTCAAGTC CTCCGCGGGC CCGCTGGCGG TGAAGACCTT CGCACAGAAT CTCGACAGCC CCTGGGCGCT GGCGTTCCTG CCCGAGGGGC GCGTGCTGGT CACCGAGAAG CCGGGGCGGA TGCGGGTGGT GTCGGCGCAG GGTGCGCTGT CGCCGCCGGT CCGGGGCGTT CCCGAGGTCT GGGCCACCGG CCAGGGCGGG CTGCTCGACG TCGTCACCGA CACGAACTTC GCCGCCAACC GGACGATCTA TTTCTGCTAC GCTGAACGCA CCAGCCAGGG CGGCCGCAGC GGCGGACGCA CCGCCGTCGC CCGCGCGGCG CTCGTCGAGA GCGACGCGCC ACGGCTCGAC GACGTGAACG TGATCTTCCG TCAGGACGGC CCGCTGTCCT CGGGCAATCA CTATGGCTGC CGGATCGCGC AAGGCGCCGA CGGACACCTG TTCGTCACGC TCGGCGATCA TTTCAGCTTC CGCGATCAGG CGCAGAATCT CGGCAACCAT CTCGGCAAGA TCATCCGCAT CGCGCCGGAC GGCAGCGTGC CGGCCGGCAA TCCGTTCGTC GGGCGCGCCG ACGCCAGGCC GGAGATCTGG AGCTACGGCC ACCGCAACCC GCAATCGCTC GCCCTCAATC CGGCGAGCGG CGGGTTGTGG GAGATCGAGC ACGGCCCGCG CGGCGGCGAC GAGGTCAACA TCATTCGGCC CGGCAACAAT TATGGCTGGC CGGTGATCGG CTACGGAATC GACTACAGCG GCGCCACCAT CCACGAGGCG GCCGCGAAGT CCGGTATGGA GCAGCCGGTC AAATATTGGG TGCCGTCGAT CGCGCCGTCT GGGATGGCGT TCTACACCGC CAAGCTGTTT CCGACATGGG CCGGCAGCCT GTTCACCGGC GCGCTCGCCG GCAAGATGCT GGTGCGGCTG TCGCTCGCCG GCGACAAGGT GACCGGCGAA GAACGCCTGC TGGAGGCGCT GAACGAACGC ATCCGCGACG TCCGCCAGGG CCCCGACGGC GCGCTGTGGC TGCTGACCGA CAACGCCGCC GGACGCATCC TGCGCGTGAC GCCGGCCGCG GACTGA
|
Protein sequence | MKTLLVWVSG TLTVATVIIA SFLIATKSSG EPASFKSSAG PLAVKTFAQN LDSPWALAFL PEGRVLVTEK PGRMRVVSAQ GALSPPVRGV PEVWATGQGG LLDVVTDTNF AANRTIYFCY AERTSQGGRS GGRTAVARAA LVESDAPRLD DVNVIFRQDG PLSSGNHYGC RIAQGADGHL FVTLGDHFSF RDQAQNLGNH LGKIIRIAPD GSVPAGNPFV GRADARPEIW SYGHRNPQSL ALNPASGGLW EIEHGPRGGD EVNIIRPGNN YGWPVIGYGI DYSGATIHEA AAKSGMEQPV KYWVPSIAPS GMAFYTAKLF PTWAGSLFTG ALAGKMLVRL SLAGDKVTGE ERLLEALNER IRDVRQGPDG ALWLLTDNAA GRILRVTPAA D
|
| |