Gene Dshi_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1654 
SymbolbglA 
ID5713219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1719861 
End bp1721168 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content63% 
IMG OID641267570 
Productbeta-glucosidase A 
Protein accessionYP_001532997 
Protein GI159044203 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.484784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000345699 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTTTCG ATCGCAACAC CTATCCCGAC GGTTTTCTCT TCGGGGTCGC CACCTCGGCC 
TACCAGATCG AAGGTCACGG GCAAGGTGGT GCCGGGCGCA CCCATTGGGA CGATTTCTCC
GCCACACCTG GCAACGTGGC GCGCGCCGAG CACGGCGCAC GCGCCTGCGG ACATCTGGAC
CGGCTGGAAG AAGATCTCGA CCTGATTGCA GGGCTCGGCG TCGATGCCTA CCGCTTCTCG
ACCAGTTGGG CACGGGTTCT GCCCGAGGGG CGCGGCGCGC CGAACATGGA GGGGCTCGAT
TTCTACGACC GGCTGGTCGA CGGGTTGCTC GCGCGGGGTA TCAAACCGGC CGCCACGCTC
TATCACTGGG AACTTCCCTC GGCGCTCGCC GATCTCGGCG GTTGGCGCAA CCGGGATATC
GCGTCCTGGT TCGGCGATTT TACCGACACG ATCATGGATC GCATCGGCGA CCGGGTCTGG
TCCGCCGCCC CGATCAACGA GCCTTGGTGC GTCGGCTGGC TCAGCCATTT CCAGGGTCAC
CACGCTCCCG GCCTGCGCGA TATCCGTGCC ACTGCCCGCG CCATGCATCA CATCCTGCTC
GCCCATGGCA CCGCCATTGC TCGGATGCGC GACATGGGGA TGCGCAACCT CGGGGCCGTG
GTGAACATGG AGTACGCGCA ACCTTTGGAC GACAGCCCGA CCGCGATGGC CGCTGCCGAG
CTTTACGACG CCATCTACAA CCAGTTCTTC CTGTCAGGTA TGTTCCACAA CACCTACCCG
GAGCCTGTGC TCGCAGGACT TGCCCCGCAT CTGCCCGACA GATGGCAGGA TGATTTCGAC
ACGATTGCCA CGCCGCTCGA CTGGGTCGGG CTGAACTATT ACACGCGCAA GATCATCGGC
CCCGGTGACA GCCCCTGGCC CGCTTATCGC GAGATCGACG GGCCCCTTCC CAAGACCCAG
ATGGGATGGG AGGTGTTTCC AGAAGGGCTG CATGCGCTGC TCACCATGAT GCAGGCGCGC
TTTACGGGCG ACTTGCCGAT CTACATCACC GAGAACGGCA TGGCCTCGGC CCTCCCGGTC
AACGACGCGG ACCGCCTTGC CTATCTCGAT GCGCATCTGG CGCAGGTCCG ACGCGCTATT
GCGGACGGCG TTCCGGTGGA CGGCTATTTC ATCTGGTCGC TGATGGACAA TTACGAGTGG
TCCTTCGGCT ACGAGAAACG CTTTGGTCTC GTGCATGTTG ATTTCGACAC GCTGGTGCGC
ACGCCGAAAG CCTCTTACCG GGCGCTGGCA TCTGCGTTAA ATCGTTAG
 
Protein sequence
MSFDRNTYPD GFLFGVATSA YQIEGHGQGG AGRTHWDDFS ATPGNVARAE HGARACGHLD 
RLEEDLDLIA GLGVDAYRFS TSWARVLPEG RGAPNMEGLD FYDRLVDGLL ARGIKPAATL
YHWELPSALA DLGGWRNRDI ASWFGDFTDT IMDRIGDRVW SAAPINEPWC VGWLSHFQGH
HAPGLRDIRA TARAMHHILL AHGTAIARMR DMGMRNLGAV VNMEYAQPLD DSPTAMAAAE
LYDAIYNQFF LSGMFHNTYP EPVLAGLAPH LPDRWQDDFD TIATPLDWVG LNYYTRKIIG
PGDSPWPAYR EIDGPLPKTQ MGWEVFPEGL HALLTMMQAR FTGDLPIYIT ENGMASALPV
NDADRLAYLD AHLAQVRRAI ADGVPVDGYF IWSLMDNYEW SFGYEKRFGL VHVDFDTLVR
TPKASYRALA SALNR