Gene Sare_2600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2600 
Symbol 
ID5707894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2963555 
End bp2967370 
Gene Length3816 bp 
Protein Length1271 aa 
Translation table11 
GC content70% 
IMG OID641272062 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001537432 
Protein GI159038179 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0793818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00832929 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCATCCCG TACCTGATCG CAGGCGGGGC GGCTTCCGTA CCCGCACCGC TGCCCTGCTT 
GCCCTCACGC TCACCGCGTC CCTGCTGGTC ACACCCGGGG CCGCGCGAGC CCTGCCAGAT
CCGGTCACGG CCAGCAGCAC CGGCATCGAG CCGACACCGG CCGGCGAACG ACGGACGGTC
GAGGACTACC AGGACGGCGT CCCGCCCGAG GTTCTGCTCT TCGCCTCCAC CGAGGCCGAA
TGGCCCGACG TGGCCACCGT CGCCACCGGG GACCGTCCGG GCGCGGGCAG CGACGACGAC
GCTTTCTCCG TCCGGTACGA CATCGAGGGC TGGGGCGGAT TCACCCACAA CTTCATCGCC
CCGGACGGCC ACCAGGACTG GCGGGAGTAC GACGGGTTCT CGTTCTGGGT GAAGGGAGAC
GGCAGCGGCC GGCGGGTCCA GTTCGAGATC AAGGATGGCG GCGCGCACGG TGAGGCGTCG
GAGCTGTGGG AGTCGTTCTT CACTGACGAC TCCACCGGCT GGAAGCAGGT CCGGACGACC
TTCCCCGAGT TCGCCTGGCG GACCAGCTAC CAGCCGGGCG ACGGCCCGAA GGACCGGGAG
CTACAGCTCG ACCGGATGTG GGGCTTCGCG GTGAACCTCC CGCAGGGCGC CGGTGAGCTG
CGGTTCGACC AGGTGGAGCT GTTCACCAAC GTGGCGACGG TCGCCGACTT CGAAGCCCCG
CAGCCGACGC TCAACCCGCC GGCTGGGCAG CCCGGCATCA TCACCTTCGG CGGAGACCCG
TCCCGGGTTC CGGAACTGAG CTACGTCGAC GCGCCCCGCG ACGGCGTGCC CGCCGACAAC
CAGGCCCTGG CGGTTGCCGT CGACACCACC ACCAGTTGGG CCGGATTCGC CCACCACCCG
AGCTTCGACA CCGATCCGCA GGACTGGAGC CGATTCGGCG GATTCCGGTT CTGGTACTTC
AGCCCGCTGA CCGTGCCACC GGCCGCGCCC GGCGCCGGAC GGCCGATCGA CCTGGAGATC
AAGGACGGCG GGGTGGACGC CGAACACAGC GAGCTCTGGA CCACCAGCTT CACCGAGGAC
TGGGTCGGCT GGCGCCTGGT CGAGATCCCG TTCTCCCAGT TCAGGTACCG CACCGACGAC
CAACCCGTCG GCGGCATCAA CCAGGAACTG GACCTCGACC GAATGTGGGG CTACGCCATG
CAGCCCCGCT CCGGGTACGC CGACACCTTC CGCATCGACG ACGTCCAGGT ATACGGCGTC
CCGCAGGTGG GACCGACGGT ACGAGTGGAC ACCACCCCGG CGGTCAGCCT GCTCGACGAG
GGTGAGTCGG CCACCGTCAC GGTCCGGATC ACCAACACCG ACGACGCGCC GCTGGACAAC
GAGGTGACCC TGCGGTACGC CACCGGCACC GGCACGGCCA CCGTCGGCGA GGACTACCAG
CCGGTCGAGG GCGAATTCGT CTTCCCCGCC GGCACCGCCT CCGGAACCAC CCGGCAGATC
ACCGTCCAGA CGCTCTCCGA CGACGACGCG GAGACGGCGG AGAACATCCC GCTGACGCTG
TCCGGCAGCG GACTCGCCGT CGCCGAGGAC CTGCCGAACA TCGTGCTCAA CGCGCACGGC
CTGCCCTATT TGGACGAGAC CCGGCCGCTC GACGAGCGGG TCGCCGACCT GCTCGCACGG
ATGTCGGTCG AGGAGAAGGT CGGCCAGATG ACCCAGGCCG AACGTAACGC CCTCGAGTCC
CCCGACGACC TGGCCACCTG GCGGCTCGGC TCGCTGCTGT CCGGCGGCGG CTCGACGCCC
AACCCGAACA CTGCGGAGTC CTGGGCGGAC ATGGTCGACG GCTACCAGAC GCGCGCGCTG
CAGACCCGGT TGCAGATCCC GCTGGTCTAC GGCGTCGACG CGGTACACGG CCACAGCAAC
CTCCGAGGTG CCACGATCTT CCCGCACAAC ATCGGTCTCG GCGCGGCTCG TGACCCGGAA
CTGGTCGAGC GGGCGGGGCA CATCACCGCC AGGGAGACCC GGGCGACCGG TCCGCAGTGG
TCGTTCGCGC CCTGCGCCTG CGTGGCCCGC GACGACCGCT GGGGGCGCAC CTACGAGGCG
TACGGTGAGG ACCCGGCGCT GGTGGTCGCC AACGAGACGG TCATCGACGG GCTCCAGGGC
CACACGCTGG CCGACCGCAG GCACGCCGAC CGGGTACTCG CCTCGGTGAA GCACTACGCC
GGCGACGGCG GGACCGAGTA CCAGCCGGGT AACGGGGGGT ACCCGATCGA CCAGGGCGTC
GCCGTCATGA GCCGGGAAGA GTTCGACCGG GTCCACCTGG AGCCGTACAT CCCGGCGGTG
CGCGAACACC ACGCGGGAAC GATCATGCCG TCGTACTCCA GCGTCGACTT CACCGACGAC
GGGGTGGGCA ACCCGGTCAA GATGCACGCC CACAAGGAAC TCCTCACCGA TGTCCTGAAA
GAGGAGTTCG GCTTCGACGG CTTCCTGATC AGCGACTACG CCGCTATCGA CCAGATTCCC
GGGGACTACG CCAGCGACGT GCGCACCTCG ATCAACGCCG GCCTGGACAT GATCATGGTG
CCGAACGAGT ACCAACGATT CGAGGAGACG CTGCTCGGCG AGATCGAGGC CGGCAACGTG
TCGATGTCGC GCATCGACGA CGCGGTCAGT CGAATCCTGA CCCAGAAGTT CCACCTCGGG
CTCTTCGAGC AGCCGTTCAC CGACCGGACC CATTTGGCCG ACGTGGGCTC GCCCGAACAC
CGGGCGGTGG CCCGCGAGGC CGCCGCGAAG TCCCAGGTGC TGCTGCGCAA CGACGGTCAG
ATACTGCCGC TGGTCGCCAC CGGCAAGCTC TACGTCGCCG GCGACAACGC CGACGACATC
GGCGCACAGT CTGGTGGCTG GACGATCACC TGGCAGGGCG GCACTGGCGA CATCACACCC
GGCACCAGCA TCCTCGACGG CATTCGGCAG GTCGCGCCGG ACACCGAGGT CACCTACAGC
GCCGACGCCT CCGCACCGCT GGCCGGGCAC GACCGGGCCG TCGTCGTGGT CGGCGAGCGG
CCGTACGCGG AAGGCGTGGG CGACGTCGGC AACAACGGCT TCACCATGAC GGTGAGCGCC
GCCGAGCAGG ACATCGTCGA CAGGGTCTGC TCAACGGTGG ATGACTGTGT GGTGCTGGTG
GTGTCCGGTC GTCCGCTCGT GCTCGACGAC GCGCTCGCCC CAGCCGACGC CGTGGTGGCC
TCCTGGCTGC CCGGCACCGA GGGCGCCGGC GTCGCCGACG TACTCTTCGG CGAGCGGCCG
TTCACCGGTC GACTGCCGGT GACCTGGCCG CGTTCCCTGG CCCAGGAGCC GATCAACGTC
GGCGACACCA GCTACGACCC GCTGTACCCG TATGGCTGGG GCCTGCGTAC CGACCCGACC
CGGGACCGGC TGCGCGAGCT GCGTGCCGAG CTGGCGGAGG TCAGGCAGGA CGGCTGGACC
CGCGCCGCGG TGGCGTTGCT GGAGCGGTCG CTGCGCACCG AGAGCTCCTG GCATGCCGAC
GGCTCGGTCC GCGACGAACC TCAGCTGGTC AAGGACCTTA CGGTGATCTC CACCCTGATG
GCTTTCACCA GTCGAGACAG CGCGGCACAC CACGAATTGC TGGTGTCGTC GCTGCGAGAC
GTCGCGCAGG CGGCGATCGC GCGGGAGGGT GTCACCGCCA CCTCGGTCAC GCGCACCTCC
GCACTGACCG CCGACGCGGA GCACGCGCTC CTGACCGACA AACCGACCGT GGCGACGCTG
AAACTCGCCG CGGCCTGGCG GATCGCGGCT GACTGA
 
Protein sequence
MHPVPDRRRG GFRTRTAALL ALTLTASLLV TPGAARALPD PVTASSTGIE PTPAGERRTV 
EDYQDGVPPE VLLFASTEAE WPDVATVATG DRPGAGSDDD AFSVRYDIEG WGGFTHNFIA
PDGHQDWREY DGFSFWVKGD GSGRRVQFEI KDGGAHGEAS ELWESFFTDD STGWKQVRTT
FPEFAWRTSY QPGDGPKDRE LQLDRMWGFA VNLPQGAGEL RFDQVELFTN VATVADFEAP
QPTLNPPAGQ PGIITFGGDP SRVPELSYVD APRDGVPADN QALAVAVDTT TSWAGFAHHP
SFDTDPQDWS RFGGFRFWYF SPLTVPPAAP GAGRPIDLEI KDGGVDAEHS ELWTTSFTED
WVGWRLVEIP FSQFRYRTDD QPVGGINQEL DLDRMWGYAM QPRSGYADTF RIDDVQVYGV
PQVGPTVRVD TTPAVSLLDE GESATVTVRI TNTDDAPLDN EVTLRYATGT GTATVGEDYQ
PVEGEFVFPA GTASGTTRQI TVQTLSDDDA ETAENIPLTL SGSGLAVAED LPNIVLNAHG
LPYLDETRPL DERVADLLAR MSVEEKVGQM TQAERNALES PDDLATWRLG SLLSGGGSTP
NPNTAESWAD MVDGYQTRAL QTRLQIPLVY GVDAVHGHSN LRGATIFPHN IGLGAARDPE
LVERAGHITA RETRATGPQW SFAPCACVAR DDRWGRTYEA YGEDPALVVA NETVIDGLQG
HTLADRRHAD RVLASVKHYA GDGGTEYQPG NGGYPIDQGV AVMSREEFDR VHLEPYIPAV
REHHAGTIMP SYSSVDFTDD GVGNPVKMHA HKELLTDVLK EEFGFDGFLI SDYAAIDQIP
GDYASDVRTS INAGLDMIMV PNEYQRFEET LLGEIEAGNV SMSRIDDAVS RILTQKFHLG
LFEQPFTDRT HLADVGSPEH RAVAREAAAK SQVLLRNDGQ ILPLVATGKL YVAGDNADDI
GAQSGGWTIT WQGGTGDITP GTSILDGIRQ VAPDTEVTYS ADASAPLAGH DRAVVVVGER
PYAEGVGDVG NNGFTMTVSA AEQDIVDRVC STVDDCVVLV VSGRPLVLDD ALAPADAVVA
SWLPGTEGAG VADVLFGERP FTGRLPVTWP RSLAQEPINV GDTSYDPLYP YGWGLRTDPT
RDRLRELRAE LAEVRQDGWT RAAVALLERS LRTESSWHAD GSVRDEPQLV KDLTVISTLM
AFTSRDSAAH HELLVSSLRD VAQAAIAREG VTATSVTRTS ALTADAEHAL LTDKPTVATL
KLAAAWRIAA D