Gene Sare_3626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3626 
Symbol 
ID5708173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4183114 
End bp4184841 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content74% 
IMG OID641273051 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001538415 
Protein GI159039162 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.124243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000631945 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCGACAC CCTCGCAACA CGGCGGCGTC GTAGTCGTCG CACTGGCCGT CCTGACCTCG 
CTGCTGCTCG CCGGCTGCAC CGGCGGGCGG GGGCGCGCAC AGCCGACTCC GGCGGCGAGT
GACCCGACGG CAGGCCCGTC ACCCTCGGTC CCGGTCACTG ATCCGGGCGC CCGCGCCGCC
ACCCTGGCGG GCTCGCTGGC TGACGAGGAC CTCGTCGGCC AGGTGCTGAT GCCCTTCGCC
TACGGCGACG CCGCCGATCG GGTCTCGTCC GGTTCGGCCG CCGGTAACCA GAAACTCGCC
GGCGTCGACA CTCCGGCCCA AATGATCGCG AAGTACCGCC TCGGCGGGCT CATTCTCGTC
GGCTTCAGCG CGGACGACCC GACCAGCGGC AATCAGGAGA CCACCAACGT CGACAACCCC
GACCAGGTCC GGGGGCTGAC CGCCGGGCTG CGGTCCGTCG CCGCCGACCT GGCCACCGGC
GAGGCGCCGT TCCTGATCGG CACGGATCAG GAGTACGGGG TGGTCACCCG GATCACCGAG
GGCGTTACGC AGCTGCCCAG CGCGTTGGCC GCCGGGGCGG CCGGCAAGCC TGACCTGACC
GAGGCCGCCT GGCAGGCCGC CGGCACCGAA CTCGCCGCGA TGGGCGTCAA CGTGGACTTC
GCCCCCGTCG CCGACGTGCT CGTCACGCCG AGCACCGTGA TCGGCTCACG GTCGTACGGT
GCCGACCCGT CGGCGGTGGC CGCACAGGTC AGCGGTGCGG TACGCGGCCT GCAGTCGGCC
GGTGTCGCGG CCACCCTCAA ACATTTCCCC GGCCACGGGC ACAGCGCCAC CGACTCCCAC
GAGGCACTGC CGAGGTTGGA ACAGCCCCGC GCCGTACTCG AGTTGGAGGC ATGGAGTCCC
TTCGCGGCCG GCATCGGGGC CGGTGCCCTC GCGGTGATGT CCGGGCACCT CGACGTCCGT
GCGGTCGACC CGGGGACCCC GGCGACGTTC TCGCACACCC TCCTCACCGA GGTGCTCCGC
GGTCAGCTCG GCTTCCAGGG CGTCGTGATC ACCGACGGGA TGAACATGGC GCCCGCCAAG
CGCTGGTCGC CCGGCGAGGC CGCGGTGCGT GCCCTCAAGG CCGGCAACGA CCTGATCCTG
ATGCCGCCGC ACGTCGGCCA GGCGTACGAC GGGCTGCTCG CCGCGCTGCG CGACGGCTCG
CTGCCCCGGA CCCGGCTGGT CGAGGCGGTG ACCCGCGTGT TGACCATGAA GTTCACCCTG
GCCGGTGCGG CCACCCCCGC GCTGGACGTC ATCGGTACGC CAGCCCACCT GGCGGCGGCC
ACCGAACTCG CCACCGCCGC GGTGACCGCA CTGCGTGGCC AGTGTGGCAG CCTGGTGTCC
GGGCCGATCA CCGTGACCGC CTCCGCCGGC CGGAAGCACA CCCGGGCGGT GCTGATCAAG
GAGCTGACCG CGGCCGGGGT GCCGGTGGTC GACACCGGCG GTGCCGTGGT CCACCTGGTC
GGCTACGGCG ACGGCACCGA CGACCTGAGC GCCGACGCCG CCGTGACCGT CGCCATGGAC
ACCCCGTACC TGCTGGCCGA GGCGGATTCC CCGGCGCTGC TGGCGACCTA CTCGTCGAGC
CCGGCGGCGA TGACCGGGCT GGCCCGGGTG CTGGCCGGTA CGGCCACCCC CGCCGGCCGT
TCGCCGGTGC CGGTGCCCGG CCTGCCCGCG ACGAGCTGCG GCAACTGA
 
Protein sequence
MPTPSQHGGV VVVALAVLTS LLLAGCTGGR GRAQPTPAAS DPTAGPSPSV PVTDPGARAA 
TLAGSLADED LVGQVLMPFA YGDAADRVSS GSAAGNQKLA GVDTPAQMIA KYRLGGLILV
GFSADDPTSG NQETTNVDNP DQVRGLTAGL RSVAADLATG EAPFLIGTDQ EYGVVTRITE
GVTQLPSALA AGAAGKPDLT EAAWQAAGTE LAAMGVNVDF APVADVLVTP STVIGSRSYG
ADPSAVAAQV SGAVRGLQSA GVAATLKHFP GHGHSATDSH EALPRLEQPR AVLELEAWSP
FAAGIGAGAL AVMSGHLDVR AVDPGTPATF SHTLLTEVLR GQLGFQGVVI TDGMNMAPAK
RWSPGEAAVR ALKAGNDLIL MPPHVGQAYD GLLAALRDGS LPRTRLVEAV TRVLTMKFTL
AGAATPALDV IGTPAHLAAA TELATAAVTA LRGQCGSLVS GPITVTASAG RKHTRAVLIK
ELTAAGVPVV DTGGAVVHLV GYGDGTDDLS ADAAVTVAMD TPYLLAEADS PALLATYSSS
PAAMTGLARV LAGTATPAGR SPVPVPGLPA TSCGN