Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1520 |
Symbol | |
ID | 4897834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1590131 |
End bp | 1591453 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640112110 |
Product | Beta-glucosidase |
Protein accession | YP_001043402 |
Protein GI | 126462288 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0705421 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTTT CCCGCGCCGA CTTCCCCGCC GATTTCCTGT TCGGGGTGGC CACCTCGGCC TACCAGATCG AGGGCCACGG CGCGGGGGGC GCAGGACGCA CCCACTGGGA CGATTTCGCC GCCACCCCCG GCAACGTGGC TCATGGCGAG GATGGCCGCC GCGCCTGCGA CCATTACCAC CGGTGGGAGG AGGATCTCGA TCTCGTGCGC GATGCGGGCT TCGACAGCTA CCGCTTCTCG GCCTCCTGGG CGCGGGTAAT GCCCGAGGGC CGCGGCACGG TGAATGCCGA GGGACTCGAC TTCTACGACC GTCTCGTCGA CGGCATGCTC GCCCGCGGCC TTAAGCCCGC CCTCACGCTC TACCACTGGG AGCTGCCCTC GGCGCTGCAG GATCTGGGCG GCTGGCGCAA CCGCGACATC GCAGGCTGGT TCGCCGATTT TGCCGAGGTG CTGCTCGGGC GCATCGGCGA CCGGGTCTGG TCCACCGCGC CCGTGAACGA GCCCTGGTGC GTGGCCTGGC TGTCGCACTT CCTCGGCCAT CATGCGCCGG GACTGCGCGA CATCCGCGCC GCGGCCCGCG CTATGCACCA TGTGCTCCTC GCCCATGGCG CCGCCGTCGA GGCCGCGCGC GGGCTCGGCG TGGGCAATCT CGGCGCGGTC TGCAACTTCG AACATGCGAT CCCCGCCGAC GGCAGCGAGG CTTCGGCCGC AGCGACCCGC CGGCACGACG CCCTGATCAA CCGCTGGTTC GTCTCGGCCC TCTTCAACCG CCAGTATCCC GAGGAGGCTC TGGACGGGAT CGCGCCGCAC CTGCCCAGCG GATGGGAGAA GGACCGCGAC CGCATCGCCC AGCCGCTCGA CTGGTTCGGT ATCAACTACT ACACCCGCAA GCTGGTGGCG GCCGCACCCG GCCCCTGGCC GGGCCTGTCC GAGGTGGAGG GCCCCCTGCC GCGCACCCGG ATCGGCTGGG AAATCCATCC AGAGGGCCTG AGCGACATCC TGCTCCGCAT TCACGAGGGC TACACCCGCG GTCTGCCGCT CATCGTGACC GAGAACGGCA TGGCCGCCGC CGACCGGGTT CAGGCGGGCG AGGTGCAGGA CCCCGACCGC ATCGCCTATC TCGAGGGCCA TCTCGCCGCG GTGCGCACCG CTCTCGCGCA GGGCGTGCCG GTCCGGGGCT ACCATGTCTG GTCGCTTCTC GACAATTTCG AGTGGGCCTT CGGCTACGAC CAGCGCTTCG GTCTGGTTCA TGTCGACTTC CAGAACTTGC AGCGCACCCC GAAAGCATCC TATCACGCCC TCGCCCGCGC GCTGGCGCGG TAA
|
Protein sequence | MTFSRADFPA DFLFGVATSA YQIEGHGAGG AGRTHWDDFA ATPGNVAHGE DGRRACDHYH RWEEDLDLVR DAGFDSYRFS ASWARVMPEG RGTVNAEGLD FYDRLVDGML ARGLKPALTL YHWELPSALQ DLGGWRNRDI AGWFADFAEV LLGRIGDRVW STAPVNEPWC VAWLSHFLGH HAPGLRDIRA AARAMHHVLL AHGAAVEAAR GLGVGNLGAV CNFEHAIPAD GSEASAAATR RHDALINRWF VSALFNRQYP EEALDGIAPH LPSGWEKDRD RIAQPLDWFG INYYTRKLVA AAPGPWPGLS EVEGPLPRTR IGWEIHPEGL SDILLRIHEG YTRGLPLIVT ENGMAAADRV QAGEVQDPDR IAYLEGHLAA VRTALAQGVP VRGYHVWSLL DNFEWAFGYD QRFGLVHVDF QNLQRTPKAS YHALARALAR
|
| |