Gene Sare_3942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3942 
Symbol 
ID5708213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4484852 
End bp4486909 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content67% 
IMG OID641273367 
ProductAmylo-alpha-16-glucosidase 
Protein accessionYP_001538723 
Protein GI159039470 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3408] Glycogen debranching enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.300307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGCA ACACGGTGCG GATCCTCGAC GGCAACACCT TTGTCGTCTC CGAGGAGACC 
GGGGACATCG AGGCGACGCC GACCGAGCCG ACCGGGCTCT TCTCCCTCGA CACCCGATTC
CTGTCCACCT GGGTGCTGAC GATCAACGGC GAGCGACTCA ACCCGCTGTC CTACGACGAC
CTGCAGTACT ACGAGGCGCG GTTCTTCCTG GTACCGGGGG TGGCGACGCA TTACGTCGAC
GCCAAGCTCT CGGTTGTCCG GGAGCGGACC GTGGGGGGCA GCTTCCGCGA GACACTCACC
ATCCTCAACC ACGACGAGAA GCCAGTCGAT CTGGAGATCC GGATGGACGC CGCAGCCGAT
TTCGCCGACC TGTTCGAGGT GAAGGACGAA CTGCTGAGCA AGAAGGGCGA GATCTACGCC
GAGGCGGAGT CGGACCGGTT GCGGCTGGGC TACCGGCGCG GCAACTTCAA GCGCGAAACC
ATGATCACGT CGTCGGTGCC GGCAAAGTTC GACAAGGGCG GTTTCGCGTA CACTCTCCGC
CTAGAGGCGA ATGAGCAGTG GGTGGCGAAC ATCGACGTGC GTACCCTGAC GCTCGGCCCG
GGCGGTCGGG ATCTGCGGAT GGGTCTGCGA GCGCACAGTG CCGAACGGCT CGCCCTGCAA
CAGGACCTGG AGGCGTGGAT CGCGAACGCA CCCACGGTCA ACAGTGAACG CGAGGACCTG
ACCGCGACGT ACGGTCGCTG CCTGGTCGAC CTTGCAGCCC TGCGCTTCGT GCCGCTCTCG
CTGGGCGGGG CGGCGTTGCC GGCCGCCGGC CTGCCGTGGT TCATGACGAT GTTCGGTCGG
GACAGCATCC TGACCTGCCT GCAGGTGCTG CCGTTCGCCC CGGAGATGTC CAGGACGACG
TTGCAGATCC TCGCCACCCT CCAGGGCACC CGGTTCGATG ATTTCCGGGA GGAGGACCCG
GGTCGGATTC TGCACGAGAT GCGCTACGGC GAGACGGCCG CCTTCGAGGA GCAACCACAC
TCGCCGTACT ACGGCTCGGT GGACGCGACA CCACTGTTCA TCGTGCTCCT CGACGAGTAC
GAGCGGTGGA GCGGCGACGG CGCGCTGGCC AAGGCGTTGC AGGTCGAGTG TCGGGCGGCG
TTGAAGTGGA TCGACCACTA CGCCGACCTG GTAGGTAATG GTTATATCTG GTATGAACGG
CGCAACACCG ACACCGGGCT GGAGAACCAG TGTTGGAAGG ACTCCTGGGA CTCCATCTCC
TACCGGGACG GTACGTTACC TCCGTTTCCG CGCGCCACCT GCGAGGTCCA GGGGTACGCG
TACGACGCGA AGGTGCGGGC GGCTCGGCTG GCCCGTGAGT TCTGGGACGA CCCGGCGTTC
GCCACGCAGT TGGAGCGGGA GGCGGCCGAC CTGAAGGAGC GGTTCAACCG CGAGTGGTGG
GTGGAGGACG GTGGGTACTA CGCCCTTGCC CTGGATCCGG ACGGCCGGCA GTGCGATGTG
CTCAGCTCCA ACATCGGACA CCTGCTGTGG AGTGGGATCG TCGATGACGA TCGAGCCCCG
AAGCTCGCCG AACACCTGGT TGGGCCACGG CTTTTCTCCG GCTGGGGGGT TCGGACGCTG
GCCGAGGGCG AGGCCCGCTA CAACCCGATC GGCTACCACA ACGGCACCAT CTGGCCGTTC
GACAACTCGT TCGTCGCCTG GGGCCTGCGC AACTACGGCT TCGCGGAGGA GGCGGCGACG
ATCGCCAACG GCATCCTCGA CGCTGCCCGC TACTTCTCCG GCCGGCTGCC CGAGGCGTTC
GGCGGCTACC CACGGGAGTT GACCAAGTTC CCGGTCGAGT ACCCCACCGC GTGCTCGCCC
CAGGCGTGGT CCACCGGCGC GCCGTTGTTG CTGCTGCGGA CGATGCTCGG GCTGGAGCCG
CACGAGGGCC ACCTGGCCGT CGAGCCCCGC CTGCCGGTGG GGATGGGGCG AATCGAAGTG
CTGGACATAC CCGGCCGCTG GGGTCGGGTG GACGCGTTCG CCCGGGGCCG GCTCGACCTC
ACCAGTCGCG GCGACTGA
 
Protein sequence
MPGNTVRILD GNTFVVSEET GDIEATPTEP TGLFSLDTRF LSTWVLTING ERLNPLSYDD 
LQYYEARFFL VPGVATHYVD AKLSVVRERT VGGSFRETLT ILNHDEKPVD LEIRMDAAAD
FADLFEVKDE LLSKKGEIYA EAESDRLRLG YRRGNFKRET MITSSVPAKF DKGGFAYTLR
LEANEQWVAN IDVRTLTLGP GGRDLRMGLR AHSAERLALQ QDLEAWIANA PTVNSEREDL
TATYGRCLVD LAALRFVPLS LGGAALPAAG LPWFMTMFGR DSILTCLQVL PFAPEMSRTT
LQILATLQGT RFDDFREEDP GRILHEMRYG ETAAFEEQPH SPYYGSVDAT PLFIVLLDEY
ERWSGDGALA KALQVECRAA LKWIDHYADL VGNGYIWYER RNTDTGLENQ CWKDSWDSIS
YRDGTLPPFP RATCEVQGYA YDAKVRAARL AREFWDDPAF ATQLEREAAD LKERFNREWW
VEDGGYYALA LDPDGRQCDV LSSNIGHLLW SGIVDDDRAP KLAEHLVGPR LFSGWGVRTL
AEGEARYNPI GYHNGTIWPF DNSFVAWGLR NYGFAEEAAT IANGILDAAR YFSGRLPEAF
GGYPRELTKF PVEYPTACSP QAWSTGAPLL LLRTMLGLEP HEGHLAVEPR LPVGMGRIEV
LDIPGRWGRV DAFARGRLDL TSRGD