Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1611 |
Symbol | |
ID | 3973150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1744323 |
End bp | 1746617 |
Gene Length | 2295 bp |
Protein Length | 764 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637924727 |
Product | glycoside hydrolase family protein |
Protein accession | YP_531492 |
Protein GI | 90423122 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACCA GTCAACCGGG ACCGAGAGCC GCAAGATCCA CCGCTGCAAC GCAAGCCAGC GCACAACCGA GCCCGCCGTC GGCCGCGCCG CCCAGCGACG CGGTGGTGCG CGCCCGCGCC GAGGCGCTGA TCGCCCGGAT GACGCCGGAA GAGAAGGCCG GCCAGATCAC CCAGTATTTC GACTTCCTCA CCGCGCAGGA CGAAGCCAAG CGGGTGTCCA CGGAGGTGGC CGCCGGCCGC GCCGGATCGT TGCTGTTCGT CGCCGACCCC GTCGAAATCA ATCGGCTGCA GCGCATCGCC GTGGAACAGA CCCGGCTGGG GATTCCGCTG TTGTTCGGCT ACGACCTGAT TCACGGCTTT CGCACCATCC TGCCGGTGCC GCTCGCCATG GCGGCGAGCT GGGATCCGGA CCTGGTCGAG CGGGCGCAGG CGGTGGCCGC GGCGGAGGCG CGGGCGGTCG GGCTGCATTG GGCGTTCGCC CCGATGGTCG ATATCGCGCG CGACCCGCGC TGGGGACGGA TGATCGAAGG CGCCGGCGAG GATCCGTATC TCGGCGCGGC GATGGCGGCG GCGCAGGTGC GCGGCTTTCA GGGCCCGTAT CTCGGCAGCC CCGATCGGGT GATCGCCGGG CCTAAGCATT TCGCCGGCTA TGGCGCGGCG CTCGGCGGCC GCGACTACGA CGAAGTGAAC CTGTCCGACA ACGAATTGTG GAACGTGTAT CTGCCGCCGT TCAAGGCCGC GGTCGACGCC GGCGCCGGCA ATATCATGAC CGCCTATATG GGGCTGAACG GCGTGCCTGC CACCGGCAAC CATTGGCTGC TGACCGACGT GTTGCGAAAG GCCTGGGGCT TTGCCGGCTT CGTCGTCACC GACGCAGGCG CCGCGGCCAG CCTGCAGACC CACGGCGTCG CCCGCGATCT GGCCGATGCC GGCGTCAAGG CGCTCAGCGC GGGCGTCGAC ATGGAAATGG CGCCGCCGTT TGGCGAGGCC GCCTTCAAGA CGCTGCCGGG CGCGCTCGCC GCGGGCCGCA TCACGACGCC ACAGTTGGAC GATGCGGTGC GGCGTGTGCT CGAAGCCAAG ATCCGGCTGG GGCTGTTCGA ACAGCCCTAT GTCGACGTCG CGCGCGCCAG CGAGGTGCTC GCCGATCCCG CGCATCGCGC CGTGGCGCGG CAGGCGGCGG AGCGCTCCGC GGTGCTGTTG CGCAATGAAG GCGCGTTGTT GCCGCTCGAT CCGCACGCCT TGCGGCGCAT TGCGGTGCTC GGGCCGCTGG CCGACGCCGC GCGGGAGACC GTCGGACCCT GGGTGTTCCA GCAGGACGAC AGCGAGACGG TGACCGTGCT GGCCGGCATC AGGGCCCGGC TCGGCGACGC CGTACGCATC GACACCACGC CGGGGGTCAG CATTCCGGCG CGGCAGTTCT CCTCGATCTT CGAGGGCCCG GAGCACGCGC GGGCGCCGCG CATCGCGGTC GACGACGACG CCGAAATCGA GCGCGCGGTC AACTACGCGC GCGGCGCCGA TGTGGCCATT GTGGTGCTCG GCGAAGCCCA GATCATGATC GGCGAGAACG CCTCGCGCTC GTCGCTGGAT CTGCCCGGCC GGCAGCAGCA ACTGCTCGAC GCCGTGCTCG CCACCGCAAC GCCGACCGTG GTGCTGCTGA TGAGCGCGCG GCCGCTCGAT CTGCGCGGCA GCGCGCCGCA GGCCTTGATG ACGATCTGGT ATCCCGGCTC GCAAGGCGGC GCCGCGGTCG CCGGCCTGTT GTTCGGCGAC GTCGCGCCGG GCGGCAAATT GCCGTTCAAC TGGCCGCGCA ACATCGGGCA ATTGCCGCTG CCCTATGCGC GGCTCAACTC GCATCAGCCG AGCAGCGCCG AGCAGCGCTA CTGGAACGAG CCGAACACGC CGCTGTATGC GTTCGGCTAC GGCCTCAGCT ATTCGTCGTT CAGCTATGCG AAGCTGCACA TCGACCGACA AAAAATCACC CCCGCCGAGC GCATGAGCGT CAGCGTCGAA TTGACCAACA CCGGCCGCCG CGTCGCCGAC GAGGTCGCGC AGCTCTACAT CCACCAGCGC TACGGCGCCT CGGCCCGCCC GGTGCGCGAG CTGAAAGGAT TTCAACGCGT CACGCTGGCG CCCGGCGAAA CCCGAACGCT GCGCTTCACG CTCGGCCCCG AGCACCTCCG CTACTGGACC GCCTCCGCGC GCGCCTTTGT GCACGACGAC TCGGTGTTCG ATGTGTTCGT CGGCGGCGAT TCGTCCGCGT CACTTTCGGC GAGCTTCGAG GTCTGCAAGG CGTAA
|
Protein sequence | MKTSQPGPRA ARSTAATQAS AQPSPPSAAP PSDAVVRARA EALIARMTPE EKAGQITQYF DFLTAQDEAK RVSTEVAAGR AGSLLFVADP VEINRLQRIA VEQTRLGIPL LFGYDLIHGF RTILPVPLAM AASWDPDLVE RAQAVAAAEA RAVGLHWAFA PMVDIARDPR WGRMIEGAGE DPYLGAAMAA AQVRGFQGPY LGSPDRVIAG PKHFAGYGAA LGGRDYDEVN LSDNELWNVY LPPFKAAVDA GAGNIMTAYM GLNGVPATGN HWLLTDVLRK AWGFAGFVVT DAGAAASLQT HGVARDLADA GVKALSAGVD MEMAPPFGEA AFKTLPGALA AGRITTPQLD DAVRRVLEAK IRLGLFEQPY VDVARASEVL ADPAHRAVAR QAAERSAVLL RNEGALLPLD PHALRRIAVL GPLADAARET VGPWVFQQDD SETVTVLAGI RARLGDAVRI DTTPGVSIPA RQFSSIFEGP EHARAPRIAV DDDAEIERAV NYARGADVAI VVLGEAQIMI GENASRSSLD LPGRQQQLLD AVLATATPTV VLLMSARPLD LRGSAPQALM TIWYPGSQGG AAVAGLLFGD VAPGGKLPFN WPRNIGQLPL PYARLNSHQP SSAEQRYWNE PNTPLYAFGY GLSYSSFSYA KLHIDRQKIT PAERMSVSVE LTNTGRRVAD EVAQLYIHQR YGASARPVRE LKGFQRVTLA PGETRTLRFT LGPEHLRYWT ASARAFVHDD SVFDVFVGGD SSASLSASFE VCKA
|
| |