Gene Sare_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1076 
Symbol 
ID5704344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1206680 
End bp1208464 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content72% 
IMG OID641270591 
Productglycoside hydrolase 15-related 
Protein accessionYP_001535975 
Protein GI159036722 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000337356 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGCCCGCA ACGAACGGAC GAGCCGATCC GAGGCACCCC ACTCACCGAA CGTGCTCCGG 
GAGTACGCGC TACTCGCCGA CGGCCAGCGG GCGGCGCTGG TCGGCCCGGA CGGGAACATC
GTGTGGCTGT GTGCCCCGCG GTGGGCCGAC CCGCCGCTGT TCAGCAACCT CCTCGGCGGT
CGTGGCAGCT ACCTGGTGAC CCCGACGAAC CGACGCTTCG TGTGGGGTGG GCAGTACCAG
CCCGAGTCGT TGATCTGGAT CAACAGGTGG GTGACCACCG ACGGAATCAT CGAAACCCGA
GAGGCCCTGG CATTCCCCGG CGACGACCAG CGGGTGGTCC TGCTGCGCCA GGTCCACGCC
CTGGACCAGG ACGCCGCTGT TCGCGTGCAG CTCGACCCCA GAGCCGACTT CGGCCGAGAG
CCGATCCGTC AGGTGCGGCA GGATGGGGGG CTTTGGTGCG CCCAGACCGG GAACCTGTAC
CTGCGGCACA GCAGCGGGCA GCCGCTGGGC ACCGGCGCCG GGGGGCTGCT CTGCGGCGAG
TTGCGGGTGC CCGCTGGAGG GCGGGCCGAC CTGGTGCTGG AAATCTCCAC CCGGCCGCTC
GACGACGAGC CACCCCAACC ACCCGAGCTG TGGCGGATCA CGGAGAAATC CTGGCAGAGA
GTGCTGCACC CGCTGTCGCG CGGCACCGCG GGACGAGACG CGGTGTTCGC CTACACGGTG
CTGCGCGGGC TGACCCGGCC CGGCGGAGGA ATGGTGGCTG CGGTGACCGC GGGACTGCCG
GAGCGGGCGC TCGGCGGCCG CAACTACGAC TACCGCTACG CCTGGATCCG CGACCAGGCG
TTCGCCGGAC AGGCCGCCGC CCTGATCGGC CGGCACGAGC TCCTCGACGA CGCGGTAGCG
TTCCTCACCG ACCGGGTCCT CGCCGAGGGC GACCGGCTTG CCCCCGCCTA CACGATCGAC
GGCGGCCCGG TACCCCCGGA ACAGGAGCTG ACGTTCCTGC CCGGATACCC GGGCGCCAAG
GCGCGGACCG GCAACTGGGT GGGTGGGCAG TTCCAGTTGG ACGCCTACGG CGAGGTGCTG
TTGGTGCTGG CGACCGCCGC GAGCCACGGA CGGCTGGAAG CCACCTCCTG GCAGGCGCTG
ACGCTCGCCG CCCAGGTCAT CGAGGAACGC TGGCAACAAG CCGACTCGGG TATCTGGGAG
CTCCCTGCCC GACAGTGGAC CCACTCGAAG CTGACCTGTG TGGCCGGGCT ACGGGCTGCG
GCACGGATCG CGCCGGGCGG GCTGGCCGGC CGCTGGGTGG CCCTCGCCGA CACCATCGTC
GCCGACACCG CGGCCCACGC CCTCCACCCC TCCGGGCGCT GGCAACGCGC CTACGACGAC
CCGCGGGTCG ACTCGGCGTT GCTGCTGCCG GGAATCCGAG GTGCCCTGCC AGAGGGTGAT
CCACGTACCG AGGCGACCCG CCGCGCTGTC CTGGCCGAGC TGCAGCAGGA CGGCTACCTG
TACCGGTTTC GGCCGGATCG TCGCCCGCTC GGGGACGCCG AGGGGGCCTT CCTGCTCTGT
GGGTTCGCCG CGGCGCTCGC CGAGTGGCAG GCCGGCGACG CCGTCGCCGC GAACCGGTGG
TTCGAACGCA ACCGGGCGGG GTGCGGCCCG CCGGGGCTGT TCACCGAGGA GTTCGACGTG
GCACAGCGGC AGCTACGGGG CAACCTGCCG CAGGCGTTCG TGCACGCGCT GATGCTCGAA
ACCGCGGTGC GCCTCGGCCA CGCCGCCCCC TGCACGGCCG ACTAG
 
Protein sequence
MARNERTSRS EAPHSPNVLR EYALLADGQR AALVGPDGNI VWLCAPRWAD PPLFSNLLGG 
RGSYLVTPTN RRFVWGGQYQ PESLIWINRW VTTDGIIETR EALAFPGDDQ RVVLLRQVHA
LDQDAAVRVQ LDPRADFGRE PIRQVRQDGG LWCAQTGNLY LRHSSGQPLG TGAGGLLCGE
LRVPAGGRAD LVLEISTRPL DDEPPQPPEL WRITEKSWQR VLHPLSRGTA GRDAVFAYTV
LRGLTRPGGG MVAAVTAGLP ERALGGRNYD YRYAWIRDQA FAGQAAALIG RHELLDDAVA
FLTDRVLAEG DRLAPAYTID GGPVPPEQEL TFLPGYPGAK ARTGNWVGGQ FQLDAYGEVL
LVLATAASHG RLEATSWQAL TLAAQVIEER WQQADSGIWE LPARQWTHSK LTCVAGLRAA
ARIAPGGLAG RWVALADTIV ADTAAHALHP SGRWQRAYDD PRVDSALLLP GIRGALPEGD
PRTEATRRAV LAELQQDGYL YRFRPDRRPL GDAEGAFLLC GFAAALAEWQ AGDAVAANRW
FERNRAGCGP PGLFTEEFDV AQRQLRGNLP QAFVHALMLE TAVRLGHAAP CTAD