Gene RPC_1611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1611 
Symbol 
ID3973150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1744323 
End bp1746617 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content70% 
IMG OID637924727 
Productglycoside hydrolase family protein 
Protein accessionYP_531492 
Protein GI90423122 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCA GTCAACCGGG ACCGAGAGCC GCAAGATCCA CCGCTGCAAC GCAAGCCAGC 
GCACAACCGA GCCCGCCGTC GGCCGCGCCG CCCAGCGACG CGGTGGTGCG CGCCCGCGCC
GAGGCGCTGA TCGCCCGGAT GACGCCGGAA GAGAAGGCCG GCCAGATCAC CCAGTATTTC
GACTTCCTCA CCGCGCAGGA CGAAGCCAAG CGGGTGTCCA CGGAGGTGGC CGCCGGCCGC
GCCGGATCGT TGCTGTTCGT CGCCGACCCC GTCGAAATCA ATCGGCTGCA GCGCATCGCC
GTGGAACAGA CCCGGCTGGG GATTCCGCTG TTGTTCGGCT ACGACCTGAT TCACGGCTTT
CGCACCATCC TGCCGGTGCC GCTCGCCATG GCGGCGAGCT GGGATCCGGA CCTGGTCGAG
CGGGCGCAGG CGGTGGCCGC GGCGGAGGCG CGGGCGGTCG GGCTGCATTG GGCGTTCGCC
CCGATGGTCG ATATCGCGCG CGACCCGCGC TGGGGACGGA TGATCGAAGG CGCCGGCGAG
GATCCGTATC TCGGCGCGGC GATGGCGGCG GCGCAGGTGC GCGGCTTTCA GGGCCCGTAT
CTCGGCAGCC CCGATCGGGT GATCGCCGGG CCTAAGCATT TCGCCGGCTA TGGCGCGGCG
CTCGGCGGCC GCGACTACGA CGAAGTGAAC CTGTCCGACA ACGAATTGTG GAACGTGTAT
CTGCCGCCGT TCAAGGCCGC GGTCGACGCC GGCGCCGGCA ATATCATGAC CGCCTATATG
GGGCTGAACG GCGTGCCTGC CACCGGCAAC CATTGGCTGC TGACCGACGT GTTGCGAAAG
GCCTGGGGCT TTGCCGGCTT CGTCGTCACC GACGCAGGCG CCGCGGCCAG CCTGCAGACC
CACGGCGTCG CCCGCGATCT GGCCGATGCC GGCGTCAAGG CGCTCAGCGC GGGCGTCGAC
ATGGAAATGG CGCCGCCGTT TGGCGAGGCC GCCTTCAAGA CGCTGCCGGG CGCGCTCGCC
GCGGGCCGCA TCACGACGCC ACAGTTGGAC GATGCGGTGC GGCGTGTGCT CGAAGCCAAG
ATCCGGCTGG GGCTGTTCGA ACAGCCCTAT GTCGACGTCG CGCGCGCCAG CGAGGTGCTC
GCCGATCCCG CGCATCGCGC CGTGGCGCGG CAGGCGGCGG AGCGCTCCGC GGTGCTGTTG
CGCAATGAAG GCGCGTTGTT GCCGCTCGAT CCGCACGCCT TGCGGCGCAT TGCGGTGCTC
GGGCCGCTGG CCGACGCCGC GCGGGAGACC GTCGGACCCT GGGTGTTCCA GCAGGACGAC
AGCGAGACGG TGACCGTGCT GGCCGGCATC AGGGCCCGGC TCGGCGACGC CGTACGCATC
GACACCACGC CGGGGGTCAG CATTCCGGCG CGGCAGTTCT CCTCGATCTT CGAGGGCCCG
GAGCACGCGC GGGCGCCGCG CATCGCGGTC GACGACGACG CCGAAATCGA GCGCGCGGTC
AACTACGCGC GCGGCGCCGA TGTGGCCATT GTGGTGCTCG GCGAAGCCCA GATCATGATC
GGCGAGAACG CCTCGCGCTC GTCGCTGGAT CTGCCCGGCC GGCAGCAGCA ACTGCTCGAC
GCCGTGCTCG CCACCGCAAC GCCGACCGTG GTGCTGCTGA TGAGCGCGCG GCCGCTCGAT
CTGCGCGGCA GCGCGCCGCA GGCCTTGATG ACGATCTGGT ATCCCGGCTC GCAAGGCGGC
GCCGCGGTCG CCGGCCTGTT GTTCGGCGAC GTCGCGCCGG GCGGCAAATT GCCGTTCAAC
TGGCCGCGCA ACATCGGGCA ATTGCCGCTG CCCTATGCGC GGCTCAACTC GCATCAGCCG
AGCAGCGCCG AGCAGCGCTA CTGGAACGAG CCGAACACGC CGCTGTATGC GTTCGGCTAC
GGCCTCAGCT ATTCGTCGTT CAGCTATGCG AAGCTGCACA TCGACCGACA AAAAATCACC
CCCGCCGAGC GCATGAGCGT CAGCGTCGAA TTGACCAACA CCGGCCGCCG CGTCGCCGAC
GAGGTCGCGC AGCTCTACAT CCACCAGCGC TACGGCGCCT CGGCCCGCCC GGTGCGCGAG
CTGAAAGGAT TTCAACGCGT CACGCTGGCG CCCGGCGAAA CCCGAACGCT GCGCTTCACG
CTCGGCCCCG AGCACCTCCG CTACTGGACC GCCTCCGCGC GCGCCTTTGT GCACGACGAC
TCGGTGTTCG ATGTGTTCGT CGGCGGCGAT TCGTCCGCGT CACTTTCGGC GAGCTTCGAG
GTCTGCAAGG CGTAA
 
Protein sequence
MKTSQPGPRA ARSTAATQAS AQPSPPSAAP PSDAVVRARA EALIARMTPE EKAGQITQYF 
DFLTAQDEAK RVSTEVAAGR AGSLLFVADP VEINRLQRIA VEQTRLGIPL LFGYDLIHGF
RTILPVPLAM AASWDPDLVE RAQAVAAAEA RAVGLHWAFA PMVDIARDPR WGRMIEGAGE
DPYLGAAMAA AQVRGFQGPY LGSPDRVIAG PKHFAGYGAA LGGRDYDEVN LSDNELWNVY
LPPFKAAVDA GAGNIMTAYM GLNGVPATGN HWLLTDVLRK AWGFAGFVVT DAGAAASLQT
HGVARDLADA GVKALSAGVD MEMAPPFGEA AFKTLPGALA AGRITTPQLD DAVRRVLEAK
IRLGLFEQPY VDVARASEVL ADPAHRAVAR QAAERSAVLL RNEGALLPLD PHALRRIAVL
GPLADAARET VGPWVFQQDD SETVTVLAGI RARLGDAVRI DTTPGVSIPA RQFSSIFEGP
EHARAPRIAV DDDAEIERAV NYARGADVAI VVLGEAQIMI GENASRSSLD LPGRQQQLLD
AVLATATPTV VLLMSARPLD LRGSAPQALM TIWYPGSQGG AAVAGLLFGD VAPGGKLPFN
WPRNIGQLPL PYARLNSHQP SSAEQRYWNE PNTPLYAFGY GLSYSSFSYA KLHIDRQKIT
PAERMSVSVE LTNTGRRVAD EVAQLYIHQR YGASARPVRE LKGFQRVTLA PGETRTLRFT
LGPEHLRYWT ASARAFVHDD SVFDVFVGGD SSASLSASFE VCKA