Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3627 |
Symbol | |
ID | 3911429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4162894 |
End bp | 4164270 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637885529 |
Product | Beta-glucosidase |
Protein accession | YP_487233 |
Protein GI | 86750737 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.904469 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.578785 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAAGC TCGCACCGCC CACCAACGAG CCGATGCCGG GCCTGCCATC GCTGACCCAC GTCAGGCCCG ACTTCATCTG GGGTGCATCG ACGGCGAGCT TTCAGATCGA GGGCGCGGCC AACGAGGATG GACGCGGTCA AAGCGTCTGG GACACCTATT GCCGCACCGG CCAGGTCGCC AACAACGACA CCGGCGACGT CGCCTGCGAC CACTATCATC GCTACAAGGA AGACGTCGCG CTGATGAAAG CGCTCGGCCT GCAGGCCTAT CGCTTCTCGG TGGCCTGGCC GCGCGTGCTG CCGCAAGGCA CCGGCGCGGT GAACGAGGCC GGACTCGCCT TCTACGACCG GCTGATCGAC GAACTCGAGG CCGCGGGAAT CGAGCCGTGG CTCTGTCTGT ATCACTGGGA TCTGCCGCAG GCGCTCGAAG ATCGCGGCGG CTGGCTCAAT CGCGACATCG TCGACTGGTT CGCCGACTAT GCCCGGTTGA TTGGCCAACG CTACGGCCGG CGCGTCAAGC GGTTCGTCAC CTTCAACGAG CCCGGCATCT TCAGCCTGTT CAGCCGGTCT TTCGGCGCCC GCGACCGCAG CGCCGACGAC AAGCTGCATC GCTGGATCCA CCACGTCAAT CTCGCCCACG GCGCGGCAGT CGACGCGCTG CGCCAAACCG TCGCCGACGC CCAAATCGGG CTGGTCACCA ACTATCAGCC GATCTATCCG TCGACCGACA AGCCCGAGGA CATCACCGAG GCCGCGCTGA TCGGCGATTA CTGGAACCGG GCATTCTCCG ATCCGCAATA TCTCGGCGAA TATCCCTCGC TGATCCGCGA CGCGATCGCT CCACACATCC AGCCCGGCGA CATAGCGCGA ATCCACCGTC CGCTCGACTG GTTCGGGCTG AACCATTACA GCCCGGTGTA TATCAACTCC GATCCCAATG CGATCATCGG TCTCGGCTGG GGCGCCAAAC CCGATGGCAT TCCGCGCACG CCGATCGACT GGACCATCGA GCCCGACGCC TTTCGCGACA CGTTGATCGA GGTCAGCCGC CGCTACGGCA AGCCGGTCTA CGTCACCGAG AACGGCTACG GCAGCAACAT CGAGAAGCCC GACGATACCG GCGCGGTGAT CGATCCCGGC CGCATCGCCT TTCTGCGCGA CTACATCTCC GGCCTCGATG CGGCGATCGC CGCGGGCGCC GACGTCCGAG GCTATTTCGT CTGGTCGCTG CTCGACAATT TCGAATGGGA GTCGGGCTAC AAGGTCCGCT TCGGCCTCGT TTATGTCGAC TACGCGACGC AGCGACGAAT TCCGAAATCA TCGTTCCGCT GGTACGCCGA CGTCATTCGC CGGGCCCGCG GCGAGACGAC AACTTAA
|
Protein sequence | MDKLAPPTNE PMPGLPSLTH VRPDFIWGAS TASFQIEGAA NEDGRGQSVW DTYCRTGQVA NNDTGDVACD HYHRYKEDVA LMKALGLQAY RFSVAWPRVL PQGTGAVNEA GLAFYDRLID ELEAAGIEPW LCLYHWDLPQ ALEDRGGWLN RDIVDWFADY ARLIGQRYGR RVKRFVTFNE PGIFSLFSRS FGARDRSADD KLHRWIHHVN LAHGAAVDAL RQTVADAQIG LVTNYQPIYP STDKPEDITE AALIGDYWNR AFSDPQYLGE YPSLIRDAIA PHIQPGDIAR IHRPLDWFGL NHYSPVYINS DPNAIIGLGW GAKPDGIPRT PIDWTIEPDA FRDTLIEVSR RYGKPVYVTE NGYGSNIEKP DDTGAVIDPG RIAFLRDYIS GLDAAIAAGA DVRGYFVWSL LDNFEWESGY KVRFGLVYVD YATQRRIPKS SFRWYADVIR RARGETTT
|
| |