Gene Rpal_1936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1936 
Symbol 
ID6409596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2087851 
End bp2089227 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content64% 
IMG OID642711822 
Productbeta-galactosidase 
Protein accessionYP_001990934 
Protein GI192290329 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.111209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTCAT CGCTCATCCC TCCCGCCTCT GAGCCCGTGG CCGGTTTGCC GGTGTTGTCG 
CACATCCGCA AAGACTTCAT CTGGGGCGTG TCGACCGCGA GCTTCCAGAT CGAAGGCGCC
GCCAACGAGG ACGGCCGCGG CCAGAGCATC TGGGACGTGT ATTGCCGCTC CGGCTATGTC
GCCAACAACG ACACCGGCGA TGTCGCTTGC GATCACTATC ATCGCTACCA GGAAGACGTC
GCGCTGATGA AGACGCTCGG CATTCAGGCG TATCGATTCT CGATCGCCTG GCCGCGGATC
TTCCCGCAAG GCACCGGCGC GATCAACGAG CCCGGCCTCG CCTTCTATGA TCGCCTGATT
GATGCGCTGG AGGCCGCCGG CATCGAGCCG TGGATCTGCC TGTATCATTG GGATCTGCCG
CAAGCGATGG AAGAGCGCGG CGGCTGGATG AATCGCGATA TCGTCGGCTG GTTCGCCGAC
TACGCGCGGG TGATCGGCGA GCGTTACGGC AAGCGCGTGA AGCGGTTTGC CACCTTCAAT
GAGCCCGGCA TCTTCAGCCT GTTCAGCCGC TCGTTCGGCG CGCGCGACCG CAGCGCCGAC
GAGAAGCTGC ACCGCTGGAT TCACCACGTC AATCTGGCGC ACGGCGCCGC CGTCGACGTG
CTGCGCGAGA CCGTGGCGGA TGCAAAGATC GGATTGGTCA CCAACTATCA GCCGGTGTAC
CCGTCGAGCG ACAAGCCGGA AGACGTCGCT GAGGCCAAGC TGATCAGCGA CTACTGGAAT
CGCGCTTTCG CCGATCCGCA ATATCGCGGT GAGTATCCAG GCCTGATCCG CGACGCCGTC
AAGCCGTATA TCCAGCCCGG CGACATGGAG CGCATCCACC GGCCGCTCGA CTGGTTCGGC
CTGAACCATT ACAGCCCCGT CTATATCAAC TCCGATCCGA ACGCGATCGT CGGACTCGGC
TGGGGGCCGA AGCCCGAGGG TATTCCGCGC TCGCCGATCG ACTGGACGAT CGAGCCGGAT
GCCTTCCGCG ATACGCTGAT CGAGATCAGC CGCCGCTACG GCAAGCCGGT TTACGTCACC
GAGAACGGTT ACGGCAGCAA TATCGAGAAG CCAGACGCCA ACGGCGAAGT GGTCGATCCT
GGCAGGATCG GCTTCCTGCG CGACTACATC ACCGCCCTCG ACCAGGCGGT CGCAGCCGGC
GCCGATGTGC GCGGCTACAT GGTGTGGTCG CTGCTCGACA ATTTCGAGTG GGAGTCCGGC
TACAGCGTGC GCTTTGGGCT GATCTACATC GACTATGCGA CGCTGCGCCG GATTCCGAAG
GCGTCGTTCA AGTGGTTCGC CGACGTGATC CGTCACGCCC GCGGCGCGAG CGCCTAA
 
Protein sequence
MASSLIPPAS EPVAGLPVLS HIRKDFIWGV STASFQIEGA ANEDGRGQSI WDVYCRSGYV 
ANNDTGDVAC DHYHRYQEDV ALMKTLGIQA YRFSIAWPRI FPQGTGAINE PGLAFYDRLI
DALEAAGIEP WICLYHWDLP QAMEERGGWM NRDIVGWFAD YARVIGERYG KRVKRFATFN
EPGIFSLFSR SFGARDRSAD EKLHRWIHHV NLAHGAAVDV LRETVADAKI GLVTNYQPVY
PSSDKPEDVA EAKLISDYWN RAFADPQYRG EYPGLIRDAV KPYIQPGDME RIHRPLDWFG
LNHYSPVYIN SDPNAIVGLG WGPKPEGIPR SPIDWTIEPD AFRDTLIEIS RRYGKPVYVT
ENGYGSNIEK PDANGEVVDP GRIGFLRDYI TALDQAVAAG ADVRGYMVWS LLDNFEWESG
YSVRFGLIYI DYATLRRIPK ASFKWFADVI RHARGASA