Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1654 |
Symbol | bglA |
ID | 5713219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1719861 |
End bp | 1721168 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641267570 |
Product | beta-glucosidase A |
Protein accession | YP_001532997 |
Protein GI | 159044203 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.484784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000345699 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTTTCG ATCGCAACAC CTATCCCGAC GGTTTTCTCT TCGGGGTCGC CACCTCGGCC TACCAGATCG AAGGTCACGG GCAAGGTGGT GCCGGGCGCA CCCATTGGGA CGATTTCTCC GCCACACCTG GCAACGTGGC GCGCGCCGAG CACGGCGCAC GCGCCTGCGG ACATCTGGAC CGGCTGGAAG AAGATCTCGA CCTGATTGCA GGGCTCGGCG TCGATGCCTA CCGCTTCTCG ACCAGTTGGG CACGGGTTCT GCCCGAGGGG CGCGGCGCGC CGAACATGGA GGGGCTCGAT TTCTACGACC GGCTGGTCGA CGGGTTGCTC GCGCGGGGTA TCAAACCGGC CGCCACGCTC TATCACTGGG AACTTCCCTC GGCGCTCGCC GATCTCGGCG GTTGGCGCAA CCGGGATATC GCGTCCTGGT TCGGCGATTT TACCGACACG ATCATGGATC GCATCGGCGA CCGGGTCTGG TCCGCCGCCC CGATCAACGA GCCTTGGTGC GTCGGCTGGC TCAGCCATTT CCAGGGTCAC CACGCTCCCG GCCTGCGCGA TATCCGTGCC ACTGCCCGCG CCATGCATCA CATCCTGCTC GCCCATGGCA CCGCCATTGC TCGGATGCGC GACATGGGGA TGCGCAACCT CGGGGCCGTG GTGAACATGG AGTACGCGCA ACCTTTGGAC GACAGCCCGA CCGCGATGGC CGCTGCCGAG CTTTACGACG CCATCTACAA CCAGTTCTTC CTGTCAGGTA TGTTCCACAA CACCTACCCG GAGCCTGTGC TCGCAGGACT TGCCCCGCAT CTGCCCGACA GATGGCAGGA TGATTTCGAC ACGATTGCCA CGCCGCTCGA CTGGGTCGGG CTGAACTATT ACACGCGCAA GATCATCGGC CCCGGTGACA GCCCCTGGCC CGCTTATCGC GAGATCGACG GGCCCCTTCC CAAGACCCAG ATGGGATGGG AGGTGTTTCC AGAAGGGCTG CATGCGCTGC TCACCATGAT GCAGGCGCGC TTTACGGGCG ACTTGCCGAT CTACATCACC GAGAACGGCA TGGCCTCGGC CCTCCCGGTC AACGACGCGG ACCGCCTTGC CTATCTCGAT GCGCATCTGG CGCAGGTCCG ACGCGCTATT GCGGACGGCG TTCCGGTGGA CGGCTATTTC ATCTGGTCGC TGATGGACAA TTACGAGTGG TCCTTCGGCT ACGAGAAACG CTTTGGTCTC GTGCATGTTG ATTTCGACAC GCTGGTGCGC ACGCCGAAAG CCTCTTACCG GGCGCTGGCA TCTGCGTTAA ATCGTTAG
|
Protein sequence | MSFDRNTYPD GFLFGVATSA YQIEGHGQGG AGRTHWDDFS ATPGNVARAE HGARACGHLD RLEEDLDLIA GLGVDAYRFS TSWARVLPEG RGAPNMEGLD FYDRLVDGLL ARGIKPAATL YHWELPSALA DLGGWRNRDI ASWFGDFTDT IMDRIGDRVW SAAPINEPWC VGWLSHFQGH HAPGLRDIRA TARAMHHILL AHGTAIARMR DMGMRNLGAV VNMEYAQPLD DSPTAMAAAE LYDAIYNQFF LSGMFHNTYP EPVLAGLAPH LPDRWQDDFD TIATPLDWVG LNYYTRKIIG PGDSPWPAYR EIDGPLPKTQ MGWEVFPEGL HALLTMMQAR FTGDLPIYIT ENGMASALPV NDADRLAYLD AHLAQVRRAI ADGVPVDGYF IWSLMDNYEW SFGYEKRFGL VHVDFDTLVR TPKASYRALA SALNR
|
| |