Gene Sare_1755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1755 
Symbol 
ID5705388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2027229 
End bp2028728 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content75% 
IMG OID641271258 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001536633 
Protein GI159037380 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.438991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0371464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATGC CCGAAGGCAA CCTCCAGTCC CTGGCCGCCA CCGTTCTCCA ACCCGGCTTC 
GTCGGTACCA CCGCCCCGAC CTGGGTGCGC CGCTGGCTGG GCGACGGGCT CGGCGCGGTG
GTGCTCTTCG CCCGCAACGT GGTCGACTCC GACCAGGTCG CCGCGCTGAC CGCGACGTTG
CGCGCGGAAC GACCGGACGT CATCGTCGCC ATCGACGAGG AGGCCGGCGA CGTGACCCGG
ATCGAGTCGG GGCTGGGCAG CTCCCGGCCC GGCAACCTGG CGCTCGGCGT GGTCGACGAT
CCGGCCCTGA CCGAGGCAGT GGCCAGCGAC CTCGGCGCGG AGCTGGCGGC GCTCGGGATC
ACCCTGAACT ACGCGCCGGA CGCCGACGTC AACTCCGATC CGGACAACCC GGTGATCGGC
GTCCGTTCCT TCGGTGCCGA CCCGGCCCGT GTCGCCCGGC ACACCGCCGC CTGGGTGCGG
GGCCTGCAGG CCAGCGGCGT CGCGGCCTGC GCCAAGCACT TTCCCGGGCA CGGTGACACC
CGGATCGACT CCCACCACGA CCTGCCCCGG ATCGTCGGCG ACCGAACCCG GCTGGATGCC
GTGGAATTGG CACCGTTCCG CGCCGCGTTG TCCGCCGGCG TGCAGGCGGT GATGAGCGGC
CACCTGCTCG TACCCGTACT GGATCCGGAT CTGCCGGCCA GCCTGAGCCG CCGGATCCTC
ACCGGCCTGC TCCGCGACGA GTTGGGATTC GCCGGGGTCG TGGTGACCGA CGCGGTGGAG
ATGCGCGCGG TCGCCGACCG CTACGGCTTC GCGGGTGCCG CGGTGCGTGC CCTGGCCGCG
GGCGCCGACG CCATCTGCGT TGGCGGCGAG CGCGCCGACG AGGACGCGGC CCAGCAGCTA
CGGGACGCGA TCGTGGCCGC TGTCGTGGCC GGGGAACTGC CCGAGGAACG GCTCGTCGAG
GCAGCCAAAC GGGTCAGCCT GCTCGCCTCC TGGACCGCCG CCAGCCGCGG GGCCCGGCCG
GCGCGGCAGC CGGCACCCGG CGGTGGCTCG GCCGTCGGAT TCGCCGCCGC CCGGCGGGCC
GTCCGGATCA CGACGGGCGG TGCCGGGCGG GGGACGCTGC CCCTGACCGG CCCCGCCCAC
GTGGTGGAGT TCGAGTCCCC CCGGAACATC GCGATCGGCG CGGAGACACC GTGGGGCGTC
GCGGCACCGC TGGCCGAGCT GCTGCCGGGC ACCACCGCTG TCCGGTATGC CGAGGACGAC
GCGCCCACCG ATCCCGTCGC CGGAGCGCAC GGTCGCCACG TCGTCCTCGT CGTTCGGGAC
CTGCACCGCC ACCCGTTGGT GCGGGCGGCC GTGACGCGTG CCCTGGCCGC CCGCCCGGAC
GCCGTGGTCG TCGAGCTGGG TGTGCCCGAA CTCGTCACCG GGGCGGTGCA CGTGGCGACC
CACGGTGCGA CCCGTGCCAG CAGCCGGGCC GCGGCGGAGG TCCTGACCGG GGCCGGCTGA
 
Protein sequence
MTMPEGNLQS LAATVLQPGF VGTTAPTWVR RWLGDGLGAV VLFARNVVDS DQVAALTATL 
RAERPDVIVA IDEEAGDVTR IESGLGSSRP GNLALGVVDD PALTEAVASD LGAELAALGI
TLNYAPDADV NSDPDNPVIG VRSFGADPAR VARHTAAWVR GLQASGVAAC AKHFPGHGDT
RIDSHHDLPR IVGDRTRLDA VELAPFRAAL SAGVQAVMSG HLLVPVLDPD LPASLSRRIL
TGLLRDELGF AGVVVTDAVE MRAVADRYGF AGAAVRALAA GADAICVGGE RADEDAAQQL
RDAIVAAVVA GELPEERLVE AAKRVSLLAS WTAASRGARP ARQPAPGGGS AVGFAAARRA
VRITTGGAGR GTLPLTGPAH VVEFESPRNI AIGAETPWGV AAPLAELLPG TTAVRYAEDD
APTDPVAGAH GRHVVLVVRD LHRHPLVRAA VTRALAARPD AVVVELGVPE LVTGAVHVAT
HGATRASSRA AAEVLTGAG