Gene RPB_1294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1294 
Symbol 
ID3908167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1476471 
End bp1477646 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content69% 
IMG OID637883188 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_484915 
Protein GI86748419 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.298447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGC TCCTGGTCTG GGTTTCCGGC ACGCTGACGG TGGCGACGGT GATCATCGCC 
TCGTTCCTGA TCGCGACCAA GAGCAGCGGC GAGCCGGCCT CGTTCAAGTC CTCCGCGGGC
CCGCTGGCGG TGAAGACCTT CGCACAGAAT CTCGACAGCC CCTGGGCGCT GGCGTTCCTG
CCCGAGGGGC GCGTGCTGGT CACCGAGAAG CCGGGGCGGA TGCGGGTGGT GTCGGCGCAG
GGTGCGCTGT CGCCGCCGGT CCGGGGCGTT CCCGAGGTCT GGGCCACCGG CCAGGGCGGG
CTGCTCGACG TCGTCACCGA CACGAACTTC GCCGCCAACC GGACGATCTA TTTCTGCTAC
GCTGAACGCA CCAGCCAGGG CGGCCGCAGC GGCGGACGCA CCGCCGTCGC CCGCGCGGCG
CTCGTCGAGA GCGACGCGCC ACGGCTCGAC GACGTGAACG TGATCTTCCG TCAGGACGGC
CCGCTGTCCT CGGGCAATCA CTATGGCTGC CGGATCGCGC AAGGCGCCGA CGGACACCTG
TTCGTCACGC TCGGCGATCA TTTCAGCTTC CGCGATCAGG CGCAGAATCT CGGCAACCAT
CTCGGCAAGA TCATCCGCAT CGCGCCGGAC GGCAGCGTGC CGGCCGGCAA TCCGTTCGTC
GGGCGCGCCG ACGCCAGGCC GGAGATCTGG AGCTACGGCC ACCGCAACCC GCAATCGCTC
GCCCTCAATC CGGCGAGCGG CGGGTTGTGG GAGATCGAGC ACGGCCCGCG CGGCGGCGAC
GAGGTCAACA TCATTCGGCC CGGCAACAAT TATGGCTGGC CGGTGATCGG CTACGGAATC
GACTACAGCG GCGCCACCAT CCACGAGGCG GCCGCGAAGT CCGGTATGGA GCAGCCGGTC
AAATATTGGG TGCCGTCGAT CGCGCCGTCT GGGATGGCGT TCTACACCGC CAAGCTGTTT
CCGACATGGG CCGGCAGCCT GTTCACCGGC GCGCTCGCCG GCAAGATGCT GGTGCGGCTG
TCGCTCGCCG GCGACAAGGT GACCGGCGAA GAACGCCTGC TGGAGGCGCT GAACGAACGC
ATCCGCGACG TCCGCCAGGG CCCCGACGGC GCGCTGTGGC TGCTGACCGA CAACGCCGCC
GGACGCATCC TGCGCGTGAC GCCGGCCGCG GACTGA
 
Protein sequence
MKTLLVWVSG TLTVATVIIA SFLIATKSSG EPASFKSSAG PLAVKTFAQN LDSPWALAFL 
PEGRVLVTEK PGRMRVVSAQ GALSPPVRGV PEVWATGQGG LLDVVTDTNF AANRTIYFCY
AERTSQGGRS GGRTAVARAA LVESDAPRLD DVNVIFRQDG PLSSGNHYGC RIAQGADGHL
FVTLGDHFSF RDQAQNLGNH LGKIIRIAPD GSVPAGNPFV GRADARPEIW SYGHRNPQSL
ALNPASGGLW EIEHGPRGGD EVNIIRPGNN YGWPVIGYGI DYSGATIHEA AAKSGMEQPV
KYWVPSIAPS GMAFYTAKLF PTWAGSLFTG ALAGKMLVRL SLAGDKVTGE ERLLEALNER
IRDVRQGPDG ALWLLTDNAA GRILRVTPAA D