Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3626 |
Symbol | |
ID | 5708173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4183114 |
End bp | 4184841 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641273051 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001538415 |
Protein GI | 159039162 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.124243 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000631945 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCGACAC CCTCGCAACA CGGCGGCGTC GTAGTCGTCG CACTGGCCGT CCTGACCTCG CTGCTGCTCG CCGGCTGCAC CGGCGGGCGG GGGCGCGCAC AGCCGACTCC GGCGGCGAGT GACCCGACGG CAGGCCCGTC ACCCTCGGTC CCGGTCACTG ATCCGGGCGC CCGCGCCGCC ACCCTGGCGG GCTCGCTGGC TGACGAGGAC CTCGTCGGCC AGGTGCTGAT GCCCTTCGCC TACGGCGACG CCGCCGATCG GGTCTCGTCC GGTTCGGCCG CCGGTAACCA GAAACTCGCC GGCGTCGACA CTCCGGCCCA AATGATCGCG AAGTACCGCC TCGGCGGGCT CATTCTCGTC GGCTTCAGCG CGGACGACCC GACCAGCGGC AATCAGGAGA CCACCAACGT CGACAACCCC GACCAGGTCC GGGGGCTGAC CGCCGGGCTG CGGTCCGTCG CCGCCGACCT GGCCACCGGC GAGGCGCCGT TCCTGATCGG CACGGATCAG GAGTACGGGG TGGTCACCCG GATCACCGAG GGCGTTACGC AGCTGCCCAG CGCGTTGGCC GCCGGGGCGG CCGGCAAGCC TGACCTGACC GAGGCCGCCT GGCAGGCCGC CGGCACCGAA CTCGCCGCGA TGGGCGTCAA CGTGGACTTC GCCCCCGTCG CCGACGTGCT CGTCACGCCG AGCACCGTGA TCGGCTCACG GTCGTACGGT GCCGACCCGT CGGCGGTGGC CGCACAGGTC AGCGGTGCGG TACGCGGCCT GCAGTCGGCC GGTGTCGCGG CCACCCTCAA ACATTTCCCC GGCCACGGGC ACAGCGCCAC CGACTCCCAC GAGGCACTGC CGAGGTTGGA ACAGCCCCGC GCCGTACTCG AGTTGGAGGC ATGGAGTCCC TTCGCGGCCG GCATCGGGGC CGGTGCCCTC GCGGTGATGT CCGGGCACCT CGACGTCCGT GCGGTCGACC CGGGGACCCC GGCGACGTTC TCGCACACCC TCCTCACCGA GGTGCTCCGC GGTCAGCTCG GCTTCCAGGG CGTCGTGATC ACCGACGGGA TGAACATGGC GCCCGCCAAG CGCTGGTCGC CCGGCGAGGC CGCGGTGCGT GCCCTCAAGG CCGGCAACGA CCTGATCCTG ATGCCGCCGC ACGTCGGCCA GGCGTACGAC GGGCTGCTCG CCGCGCTGCG CGACGGCTCG CTGCCCCGGA CCCGGCTGGT CGAGGCGGTG ACCCGCGTGT TGACCATGAA GTTCACCCTG GCCGGTGCGG CCACCCCCGC GCTGGACGTC ATCGGTACGC CAGCCCACCT GGCGGCGGCC ACCGAACTCG CCACCGCCGC GGTGACCGCA CTGCGTGGCC AGTGTGGCAG CCTGGTGTCC GGGCCGATCA CCGTGACCGC CTCCGCCGGC CGGAAGCACA CCCGGGCGGT GCTGATCAAG GAGCTGACCG CGGCCGGGGT GCCGGTGGTC GACACCGGCG GTGCCGTGGT CCACCTGGTC GGCTACGGCG ACGGCACCGA CGACCTGAGC GCCGACGCCG CCGTGACCGT CGCCATGGAC ACCCCGTACC TGCTGGCCGA GGCGGATTCC CCGGCGCTGC TGGCGACCTA CTCGTCGAGC CCGGCGGCGA TGACCGGGCT GGCCCGGGTG CTGGCCGGTA CGGCCACCCC CGCCGGCCGT TCGCCGGTGC CGGTGCCCGG CCTGCCCGCG ACGAGCTGCG GCAACTGA
|
Protein sequence | MPTPSQHGGV VVVALAVLTS LLLAGCTGGR GRAQPTPAAS DPTAGPSPSV PVTDPGARAA TLAGSLADED LVGQVLMPFA YGDAADRVSS GSAAGNQKLA GVDTPAQMIA KYRLGGLILV GFSADDPTSG NQETTNVDNP DQVRGLTAGL RSVAADLATG EAPFLIGTDQ EYGVVTRITE GVTQLPSALA AGAAGKPDLT EAAWQAAGTE LAAMGVNVDF APVADVLVTP STVIGSRSYG ADPSAVAAQV SGAVRGLQSA GVAATLKHFP GHGHSATDSH EALPRLEQPR AVLELEAWSP FAAGIGAGAL AVMSGHLDVR AVDPGTPATF SHTLLTEVLR GQLGFQGVVI TDGMNMAPAK RWSPGEAAVR ALKAGNDLIL MPPHVGQAYD GLLAALRDGS LPRTRLVEAV TRVLTMKFTL AGAATPALDV IGTPAHLAAA TELATAAVTA LRGQCGSLVS GPITVTASAG RKHTRAVLIK ELTAAGVPVV DTGGAVVHLV GYGDGTDDLS ADAAVTVAMD TPYLLAEADS PALLATYSSS PAAMTGLARV LAGTATPAGR SPVPVPGLPA TSCGN
|
| |