Gene Sare_2637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2637 
Symbol 
ID5706900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3002567 
End bp3004405 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content68% 
IMG OID641272097 
Productglycoside hydrolase 15-related 
Protein accessionYP_001537467 
Protein GI159038214 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.872982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.317086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGT ACCCGGCCAT CGAGGATCAC GGCCTCATCG GTGACCTGCA GACCGCGGCC 
CTGATCACCC GAAACGGCAC GATCGACTGG TTCTGCGCGC CCCGGTTCGA CTCCCCGAGT
ATCTTCGCGG GTCTGCTCGA CCGCCGCCGC GGCGGCTACT TCTCCGTCGT CCCGGACGGC
GTGCAGTACA TCAGCAAGCA GCTCTACCTG CCGAACACCC CGATCCTGAT CACCAGGTTC
ATCAGCGCCG ACGGGGTCAG CGAAATCATC GACTTCATGC CGGTCACCGG CGAACACCCG
ACCGACCAGC ACCGAATCGT CCGGATCATC AACATGGTGC GCGGCACCAT GCGGTTCCGC
CTCGAGTGCT GCCCACGGTT CGACTACGGC CGGCAACGGC ACGTGTTGCG GGTGCACCGC
GACGGCAGCG TCTTCCACGC CCCGTCAGGC ACCCTGACCA TCCACACCTC CCGCCGGGGC
GGGCTGCTGG TTCCGGCGGA CAACCTGCGC GACGTCAACG GGGACCTTCT CGCCTTCGCG
ACCCTGCACG AGGGCGACAC CGGCGCGCTG GTGGCGGAGA CGGCCGCTCC CGGCCACCCC
CGACTCTTCC CCGTCGAGGA GGCCCAGGAG CTGTTCGAGG CAACGCGCGA ATACTGGCGC
CGCTGGCTCG ACCGGTCCCG GTACACCGGC CGGTGGCGGG AGATGGTCGA ACGCTCGGCG
ATCACGCTGA AACTGATGAC CTACGCGCCA ACCGGGGCCA TGATCGCCGC GCCGACCGCC
GCGCTGCCCG AGGAGATCGG CGGCACCCGC AACTGGGACT ACCGCTACAC CTGGATCCGG
GACACCTCGT TCTCGGTGCA CGCCCTGCTC GGCCTGGGCT TCACCGAGGA GGCCGCTCGG
TACACGAACT GGCTCGACGA GCGCATCCGC GAAGCCGGCG ACCACGCGAC CCCCTTGAAG
ATCATGTACC GGGTGGACGG CTCCTCCGAC CTACACGAGG AGATCCTCGG CCACCTGGAG
GGCTACCTGG GGTCGGGGCC GGTCCGGATC GGCAACGGCG CCGCCGACCA GCTCCAACTC
GACATCCACG GCGAGGCCCT GGACGCGATG CTCTTCGCCG ACCAGCAGGG CATCCGGGTC
ACGCACCAGG TGTGGCGCAG CACCGTGCGG ATGGTCGACT GGCTCTGCGA CAACTGGAAC
CAGCCCGACG CCGGCATCTG GGAGAGCCGC ACCCATCCGC GCGACTACAC GTTCGGCCGG
CTCATGTCCT GGGTGGCGAT GGACCGGGCG ATAAGACTGT CCGCCCGCCA CGGCCGTCCC
GGCGACATCG CGCGATGGGA ACGCGAACGC GACGCCATCT ACAACCAGAT CATGGCACGG
GGTTACGACC GGACCCGTGG CTCCTTCGTG CAGGCGTACG GCGAGCGGGT ACTCGACGCC
GCGCTGCTGA TCATGCCGTC GGTCGGGTTC GTCACCCCGA ACGACCCGCT GTGGCAGTCC
ACCCTGCGTG CGATCGACGA GGATCTCGTC TCGGACAGCC TGGTCCACCG GTACAAGCCC
TCGGAGGCAC CGGACGGCCT TCCCGGCGAC GAGGGCACCT TCAACATGTG CACCTTCTGG
TATGTCGCCG CCCTCGCCTG CTCGGGACGG TTGGAGGACG CCCTGCTCAC CTTCGAGAAG
ATGTTCACCT TCAGCAACCA CCTTCGGCTC TACGCCGAGG AGATCGCCCT CACCGGGCAG
CAGATCGGCA ACTTCCCGCA GGCGTTCAGC CACCTCTCCC TGATCACCAC CGCCCTGGCG
CTCGACGGCA TGCTCGACAA CAGTGACCGG GGGCGGTGA
 
Protein sequence
MPEYPAIEDH GLIGDLQTAA LITRNGTIDW FCAPRFDSPS IFAGLLDRRR GGYFSVVPDG 
VQYISKQLYL PNTPILITRF ISADGVSEII DFMPVTGEHP TDQHRIVRII NMVRGTMRFR
LECCPRFDYG RQRHVLRVHR DGSVFHAPSG TLTIHTSRRG GLLVPADNLR DVNGDLLAFA
TLHEGDTGAL VAETAAPGHP RLFPVEEAQE LFEATREYWR RWLDRSRYTG RWREMVERSA
ITLKLMTYAP TGAMIAAPTA ALPEEIGGTR NWDYRYTWIR DTSFSVHALL GLGFTEEAAR
YTNWLDERIR EAGDHATPLK IMYRVDGSSD LHEEILGHLE GYLGSGPVRI GNGAADQLQL
DIHGEALDAM LFADQQGIRV THQVWRSTVR MVDWLCDNWN QPDAGIWESR THPRDYTFGR
LMSWVAMDRA IRLSARHGRP GDIARWERER DAIYNQIMAR GYDRTRGSFV QAYGERVLDA
ALLIMPSVGF VTPNDPLWQS TLRAIDEDLV SDSLVHRYKP SEAPDGLPGD EGTFNMCTFW
YVAALACSGR LEDALLTFEK MFTFSNHLRL YAEEIALTGQ QIGNFPQAFS HLSLITTALA
LDGMLDNSDR GR