Gene RPB_3627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3627 
Symbol 
ID3911429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4162894 
End bp4164270 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content64% 
IMG OID637885529 
ProductBeta-glucosidase 
Protein accessionYP_487233 
Protein GI86750737 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.904469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.578785 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGC TCGCACCGCC CACCAACGAG CCGATGCCGG GCCTGCCATC GCTGACCCAC 
GTCAGGCCCG ACTTCATCTG GGGTGCATCG ACGGCGAGCT TTCAGATCGA GGGCGCGGCC
AACGAGGATG GACGCGGTCA AAGCGTCTGG GACACCTATT GCCGCACCGG CCAGGTCGCC
AACAACGACA CCGGCGACGT CGCCTGCGAC CACTATCATC GCTACAAGGA AGACGTCGCG
CTGATGAAAG CGCTCGGCCT GCAGGCCTAT CGCTTCTCGG TGGCCTGGCC GCGCGTGCTG
CCGCAAGGCA CCGGCGCGGT GAACGAGGCC GGACTCGCCT TCTACGACCG GCTGATCGAC
GAACTCGAGG CCGCGGGAAT CGAGCCGTGG CTCTGTCTGT ATCACTGGGA TCTGCCGCAG
GCGCTCGAAG ATCGCGGCGG CTGGCTCAAT CGCGACATCG TCGACTGGTT CGCCGACTAT
GCCCGGTTGA TTGGCCAACG CTACGGCCGG CGCGTCAAGC GGTTCGTCAC CTTCAACGAG
CCCGGCATCT TCAGCCTGTT CAGCCGGTCT TTCGGCGCCC GCGACCGCAG CGCCGACGAC
AAGCTGCATC GCTGGATCCA CCACGTCAAT CTCGCCCACG GCGCGGCAGT CGACGCGCTG
CGCCAAACCG TCGCCGACGC CCAAATCGGG CTGGTCACCA ACTATCAGCC GATCTATCCG
TCGACCGACA AGCCCGAGGA CATCACCGAG GCCGCGCTGA TCGGCGATTA CTGGAACCGG
GCATTCTCCG ATCCGCAATA TCTCGGCGAA TATCCCTCGC TGATCCGCGA CGCGATCGCT
CCACACATCC AGCCCGGCGA CATAGCGCGA ATCCACCGTC CGCTCGACTG GTTCGGGCTG
AACCATTACA GCCCGGTGTA TATCAACTCC GATCCCAATG CGATCATCGG TCTCGGCTGG
GGCGCCAAAC CCGATGGCAT TCCGCGCACG CCGATCGACT GGACCATCGA GCCCGACGCC
TTTCGCGACA CGTTGATCGA GGTCAGCCGC CGCTACGGCA AGCCGGTCTA CGTCACCGAG
AACGGCTACG GCAGCAACAT CGAGAAGCCC GACGATACCG GCGCGGTGAT CGATCCCGGC
CGCATCGCCT TTCTGCGCGA CTACATCTCC GGCCTCGATG CGGCGATCGC CGCGGGCGCC
GACGTCCGAG GCTATTTCGT CTGGTCGCTG CTCGACAATT TCGAATGGGA GTCGGGCTAC
AAGGTCCGCT TCGGCCTCGT TTATGTCGAC TACGCGACGC AGCGACGAAT TCCGAAATCA
TCGTTCCGCT GGTACGCCGA CGTCATTCGC CGGGCCCGCG GCGAGACGAC AACTTAA
 
Protein sequence
MDKLAPPTNE PMPGLPSLTH VRPDFIWGAS TASFQIEGAA NEDGRGQSVW DTYCRTGQVA 
NNDTGDVACD HYHRYKEDVA LMKALGLQAY RFSVAWPRVL PQGTGAVNEA GLAFYDRLID
ELEAAGIEPW LCLYHWDLPQ ALEDRGGWLN RDIVDWFADY ARLIGQRYGR RVKRFVTFNE
PGIFSLFSRS FGARDRSADD KLHRWIHHVN LAHGAAVDAL RQTVADAQIG LVTNYQPIYP
STDKPEDITE AALIGDYWNR AFSDPQYLGE YPSLIRDAIA PHIQPGDIAR IHRPLDWFGL
NHYSPVYINS DPNAIIGLGW GAKPDGIPRT PIDWTIEPDA FRDTLIEVSR RYGKPVYVTE
NGYGSNIEKP DDTGAVIDPG RIAFLRDYIS GLDAAIAAGA DVRGYFVWSL LDNFEWESGY
KVRFGLVYVD YATQRRIPKS SFRWYADVIR RARGETTT