Gene RPD_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1840 
Symbol 
ID4022322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2059175 
End bp2060551 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content64% 
IMG OID637962034 
ProductBeta-glucosidase 
Protein accessionYP_568977 
Protein GI91976318 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.185877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACGC TCACACCGCC GACCCAGATG CCGATGCCGG GTCATCCATC ACTTTTGCAC 
GTCAAGCCCG ATTTCATCTG GGGCGTGTCC AGTTCGAGCT TTCAGATCGA GGGCGCCACC
AACGAAGACG GCCGCGGCGC GAGCATCTGG GACACCTATT GCCGCACCGG ACAAGTCGCC
AACAACGACA CCGGCGACGT CGCCTGCGAC CATTATCATC GCTACAAGGA AGACGTCGCG
CTGATGAAGG CGCTCGGCGT GCAGGCCTAT CGTTTCTCCA TTGCGTGGCC GCGCGTGCTG
CCGCTAGGCG ACGGCGCGGT GAACGAAGCC GGCCTCGCCT TCTACGACCG GCTGATCGAC
GAACTTCAGG CCGCCGGGAT CGAGCCGTGG ATCTGCCTGT ATCACTGGGA CCTGCCGCAA
GCGCTGGAAG ACCGCGGCGG CTGGCTCAAC CGCGACATCG TCGGCTGGTT CGCCGACTAT
GCAAGGCTGA TCGGCGAGCG TTACGGCAAG CGGGTGAAGC GGTTCGCAAC CTTCAACGAA
CCGGGGATCT TCAGCCTGTT CAGCCGCTCC TTCGGGGCGC GCGACCGCAG CGCCGACGAC
AAGCTCCACC GCTGGATCCA TCACGTCAAT CTCGCCCATG GCGCCGCGGT CGATGCGCTG
CGCGAGACGG TGCCGGACGC ACAGATCGGC CTGGTTACAA ATTATCAACC AATCTTCCCA
TCGAGCGACA AGCCCGAGGA CATCGCCGAA GCCGCGCTGA TCGGCGACTA CTGGAATTGT
GCCTTCTCCG ATCCGCAATA TCTCGGCGAG TATCCGGCCC TGATCCGCGA CGCGCTCGCG
GCGCACGTCA GGCCGGGCGA CATGGAGCAG ATTCACCGGC CGCTCGACTG GTTCGGGCTG
AACCACTACA GCCCGGTCTA CATCAACTCC GATCCGAATG CGATCATCGG GCTCGGCTGG
GGCGCGAAGC CCGACAGCAT TCCGCGGACG CCCATCGACT GGACGATCGA ACCGGACGCC
TTCCGCGACA CGCTGATCGA GGTCAGCCGA CGCTACGGCA AGCCGGTCTA CGTCACCGAG
AACGGTTATG GCAGCAACAT CGAAAAGCCG GACGACACCG GCGCGGTGAT CGATCGCGGC
CGCGTCGCCT TCCTGCACGA CTACATCTCC GGCCTCGACG CGGCGATTGC CGCCGGCGCC
GACGTGCGCG GCTATTTCGT CTGGTCGCTG CTCGACAATT TCGAATGGGA GTCGGGCTAC
GGCGTGCGTT TCGGCCTGAC CTATATCGAC TACGCGACGC AGCGGCGGAT TCCGAAGGCG
TCGTTCAATT GGTACGCGGA CGTCATTCGT CAGGCCCGCG GCGGCGCGAC CGCGTAA
 
Protein sequence
MDTLTPPTQM PMPGHPSLLH VKPDFIWGVS SSSFQIEGAT NEDGRGASIW DTYCRTGQVA 
NNDTGDVACD HYHRYKEDVA LMKALGVQAY RFSIAWPRVL PLGDGAVNEA GLAFYDRLID
ELQAAGIEPW ICLYHWDLPQ ALEDRGGWLN RDIVGWFADY ARLIGERYGK RVKRFATFNE
PGIFSLFSRS FGARDRSADD KLHRWIHHVN LAHGAAVDAL RETVPDAQIG LVTNYQPIFP
SSDKPEDIAE AALIGDYWNC AFSDPQYLGE YPALIRDALA AHVRPGDMEQ IHRPLDWFGL
NHYSPVYINS DPNAIIGLGW GAKPDSIPRT PIDWTIEPDA FRDTLIEVSR RYGKPVYVTE
NGYGSNIEKP DDTGAVIDRG RVAFLHDYIS GLDAAIAAGA DVRGYFVWSL LDNFEWESGY
GVRFGLTYID YATQRRIPKA SFNWYADVIR QARGGATA