Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1076 |
Symbol | |
ID | 5704344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1206680 |
End bp | 1208464 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270591 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_001535975 |
Protein GI | 159036722 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000337356 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGGCCCGCA ACGAACGGAC GAGCCGATCC GAGGCACCCC ACTCACCGAA CGTGCTCCGG GAGTACGCGC TACTCGCCGA CGGCCAGCGG GCGGCGCTGG TCGGCCCGGA CGGGAACATC GTGTGGCTGT GTGCCCCGCG GTGGGCCGAC CCGCCGCTGT TCAGCAACCT CCTCGGCGGT CGTGGCAGCT ACCTGGTGAC CCCGACGAAC CGACGCTTCG TGTGGGGTGG GCAGTACCAG CCCGAGTCGT TGATCTGGAT CAACAGGTGG GTGACCACCG ACGGAATCAT CGAAACCCGA GAGGCCCTGG CATTCCCCGG CGACGACCAG CGGGTGGTCC TGCTGCGCCA GGTCCACGCC CTGGACCAGG ACGCCGCTGT TCGCGTGCAG CTCGACCCCA GAGCCGACTT CGGCCGAGAG CCGATCCGTC AGGTGCGGCA GGATGGGGGG CTTTGGTGCG CCCAGACCGG GAACCTGTAC CTGCGGCACA GCAGCGGGCA GCCGCTGGGC ACCGGCGCCG GGGGGCTGCT CTGCGGCGAG TTGCGGGTGC CCGCTGGAGG GCGGGCCGAC CTGGTGCTGG AAATCTCCAC CCGGCCGCTC GACGACGAGC CACCCCAACC ACCCGAGCTG TGGCGGATCA CGGAGAAATC CTGGCAGAGA GTGCTGCACC CGCTGTCGCG CGGCACCGCG GGACGAGACG CGGTGTTCGC CTACACGGTG CTGCGCGGGC TGACCCGGCC CGGCGGAGGA ATGGTGGCTG CGGTGACCGC GGGACTGCCG GAGCGGGCGC TCGGCGGCCG CAACTACGAC TACCGCTACG CCTGGATCCG CGACCAGGCG TTCGCCGGAC AGGCCGCCGC CCTGATCGGC CGGCACGAGC TCCTCGACGA CGCGGTAGCG TTCCTCACCG ACCGGGTCCT CGCCGAGGGC GACCGGCTTG CCCCCGCCTA CACGATCGAC GGCGGCCCGG TACCCCCGGA ACAGGAGCTG ACGTTCCTGC CCGGATACCC GGGCGCCAAG GCGCGGACCG GCAACTGGGT GGGTGGGCAG TTCCAGTTGG ACGCCTACGG CGAGGTGCTG TTGGTGCTGG CGACCGCCGC GAGCCACGGA CGGCTGGAAG CCACCTCCTG GCAGGCGCTG ACGCTCGCCG CCCAGGTCAT CGAGGAACGC TGGCAACAAG CCGACTCGGG TATCTGGGAG CTCCCTGCCC GACAGTGGAC CCACTCGAAG CTGACCTGTG TGGCCGGGCT ACGGGCTGCG GCACGGATCG CGCCGGGCGG GCTGGCCGGC CGCTGGGTGG CCCTCGCCGA CACCATCGTC GCCGACACCG CGGCCCACGC CCTCCACCCC TCCGGGCGCT GGCAACGCGC CTACGACGAC CCGCGGGTCG ACTCGGCGTT GCTGCTGCCG GGAATCCGAG GTGCCCTGCC AGAGGGTGAT CCACGTACCG AGGCGACCCG CCGCGCTGTC CTGGCCGAGC TGCAGCAGGA CGGCTACCTG TACCGGTTTC GGCCGGATCG TCGCCCGCTC GGGGACGCCG AGGGGGCCTT CCTGCTCTGT GGGTTCGCCG CGGCGCTCGC CGAGTGGCAG GCCGGCGACG CCGTCGCCGC GAACCGGTGG TTCGAACGCA ACCGGGCGGG GTGCGGCCCG CCGGGGCTGT TCACCGAGGA GTTCGACGTG GCACAGCGGC AGCTACGGGG CAACCTGCCG CAGGCGTTCG TGCACGCGCT GATGCTCGAA ACCGCGGTGC GCCTCGGCCA CGCCGCCCCC TGCACGGCCG ACTAG
|
Protein sequence | MARNERTSRS EAPHSPNVLR EYALLADGQR AALVGPDGNI VWLCAPRWAD PPLFSNLLGG RGSYLVTPTN RRFVWGGQYQ PESLIWINRW VTTDGIIETR EALAFPGDDQ RVVLLRQVHA LDQDAAVRVQ LDPRADFGRE PIRQVRQDGG LWCAQTGNLY LRHSSGQPLG TGAGGLLCGE LRVPAGGRAD LVLEISTRPL DDEPPQPPEL WRITEKSWQR VLHPLSRGTA GRDAVFAYTV LRGLTRPGGG MVAAVTAGLP ERALGGRNYD YRYAWIRDQA FAGQAAALIG RHELLDDAVA FLTDRVLAEG DRLAPAYTID GGPVPPEQEL TFLPGYPGAK ARTGNWVGGQ FQLDAYGEVL LVLATAASHG RLEATSWQAL TLAAQVIEER WQQADSGIWE LPARQWTHSK LTCVAGLRAA ARIAPGGLAG RWVALADTIV ADTAAHALHP SGRWQRAYDD PRVDSALLLP GIRGALPEGD PRTEATRRAV LAELQQDGYL YRFRPDRRPL GDAEGAFLLC GFAAALAEWQ AGDAVAANRW FERNRAGCGP PGLFTEEFDV AQRQLRGNLP QAFVHALMLE TAVRLGHAAP CTAD
|
| |