Gene Sare_0636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0636 
Symbol 
ID5708000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp716685 
End bp718475 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content69% 
IMG OID641270157 
Productglycoside hydrolase 15-related 
Protein accessionYP_001535550 
Protein GI159036297 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.252902 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGAGA TCAGTGACTA TGCGTTTCTC GGCGACTGCC AGGGCGCGGC GTTGGTGTCG 
TCGGACGGGT CCGTGGACTG GTGGTGCCCG CCGCGGTTCG ACGCGCCGAG TGTCTTCGCG
CGTCTGCTCG GATCCGCGGG CGGCCACTGG ACGATTCGCC CGCGCGGGGA GTACACGACG
TCACGCCGCT ATCTCGACGG GACGATGGTC GTGCAGACCG AGTTCCACAC CCAGGACGGG
GTGCTGCGGC TGACGGATGC CCTCAGCCTT GGAGTCGGGG AGCGCGGTCA CGGGATCGGT
CTCACCTCCC CGCACGTCCT CCTGCGCCGG GTCGAAGCGG TCGTCGGCAC GGCGCAGGTG
ACGGTCGAGG TCGCGGCCCG GCCGGAGTAC GGCCTGGTCA GCCCCACCGT TGCCGACGCC
ACGACGGGGA TCGAGATCTC CGGAGGCGCC GACCGACTGA CCCTCACCGG CGACCGTGAT
CTGACCATCG ACGGTTCACT GGTGACCGGA GAATTCACCC TGGCCGAGGG TGACAGCGCG
GTGTTCGCGC TGCACCACCG ACGGGCTGCC GATCCACCGA CGGTTCCTCT CGACGGGCGT
GCGGCGCTGC GGGACACCAT CGCCGCCTGG CAGTCATGGG ACCGTATCCA CCAGGGATAC
CAGGGACCCT ACCGGGAGCA GGTCCGCCGC AGCGCCCTCG TCCTCCAGGC GTTGACCTAC
CAGCCGACCG GAGCCGTGAT CGCCGCTGCC ACCACATCCC TGCCCGAGGA GGTCGGCGGC
GAGGCGAACT GGGACTATCG ATTCGGCTGG CTCCGCGACG GCGCCTTCAC CCTGAAGGCA
CTCTGGGTCG CCGCATGCCC GGACGAAGCA CACCGGTTCT TCGACTGGAT GGCCGAGTCG
GTGGGCTCGG TGGACGCGGA GGACCATGTC CCGATCATGT TCGGTGCGGC TGGTGAGCGT
GACCTCACCG AGCACACCCT CGAGCACCTT GAGGGATACC GCGACAGCCG CCCGGTACGG
ATCGGCAACG ATGCCTGGCG GCAGAAGCAA CTCGACGTCA TCGGCGAGGT CCTCGAGTGC
GCCTGGGTCC TGCGGGAGCA GCTCACCGAC CTGTCGCCCG CCGCCGCCGG CCTGCTGCGG
TCCCTGGCCG ACCGGGCTGT GGACACCTGG CAGGAGCCGG ACGCCGGTAT CTGGGAAGGC
CGCGAGGGCC AGCGGCACTA CCTGACCTCG AAGGTGATGT GCTGGCTGGC CCTCGACCGG
GCCGTCAAAC TGGCACAGCT CCTGGACGCC GAGGCGAAGG TCCCGCAGTG GTCGGAGGCG
ATGACCCAGG TCCGCGCGGC GATCCTCACC GAGGGGTGGA GTACCTCGAG CAAGGCGTTC
ACCGGCGCGT TCGGCTCCGA CCACCTGGAC GCGGGGGTCC TGGTCATGCC CATCCTCGGG
TTCCTCCCCG CCGACGACGA GCGAGTACTT GCCACAGTGG ATGCCATCGA ACGAGACCTG
GTCCAGGCCG GCCTCGTGCA ACGCTGGACC AGGGCAGGTG ACGAGGGCGC CTTCATCATC
TGCTCGTACT GGTTGGTCCA GGCCCTCGCC CTGGCCGGGC GCATCGATCA GGCTCGGCAG
GTCTTCGACA CCGTCACCGC ACGGGCGAAC GACCTCGGCC TGCTCGCCGA GGAGATCGAC
CGACGCGAGG GCAGCCTGAT CGGAAACTTC CCCCAGGCGT TGTCCCACAT CGGCCTGATC
AACGCCGCGT GGACCATCGG CCAGGTCGAG GCCGCAGCAA CCAGATCGTA G
 
Protein sequence
MAEISDYAFL GDCQGAALVS SDGSVDWWCP PRFDAPSVFA RLLGSAGGHW TIRPRGEYTT 
SRRYLDGTMV VQTEFHTQDG VLRLTDALSL GVGERGHGIG LTSPHVLLRR VEAVVGTAQV
TVEVAARPEY GLVSPTVADA TTGIEISGGA DRLTLTGDRD LTIDGSLVTG EFTLAEGDSA
VFALHHRRAA DPPTVPLDGR AALRDTIAAW QSWDRIHQGY QGPYREQVRR SALVLQALTY
QPTGAVIAAA TTSLPEEVGG EANWDYRFGW LRDGAFTLKA LWVAACPDEA HRFFDWMAES
VGSVDAEDHV PIMFGAAGER DLTEHTLEHL EGYRDSRPVR IGNDAWRQKQ LDVIGEVLEC
AWVLREQLTD LSPAAAGLLR SLADRAVDTW QEPDAGIWEG REGQRHYLTS KVMCWLALDR
AVKLAQLLDA EAKVPQWSEA MTQVRAAILT EGWSTSSKAF TGAFGSDHLD AGVLVMPILG
FLPADDERVL ATVDAIERDL VQAGLVQRWT RAGDEGAFII CSYWLVQALA LAGRIDQARQ
VFDTVTARAN DLGLLAEEID RREGSLIGNF PQALSHIGLI NAAWTIGQVE AAATRS