Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1755 |
Symbol | |
ID | 5705388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2027229 |
End bp | 2028728 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641271258 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001536633 |
Protein GI | 159037380 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.438991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0371464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCATGC CCGAAGGCAA CCTCCAGTCC CTGGCCGCCA CCGTTCTCCA ACCCGGCTTC GTCGGTACCA CCGCCCCGAC CTGGGTGCGC CGCTGGCTGG GCGACGGGCT CGGCGCGGTG GTGCTCTTCG CCCGCAACGT GGTCGACTCC GACCAGGTCG CCGCGCTGAC CGCGACGTTG CGCGCGGAAC GACCGGACGT CATCGTCGCC ATCGACGAGG AGGCCGGCGA CGTGACCCGG ATCGAGTCGG GGCTGGGCAG CTCCCGGCCC GGCAACCTGG CGCTCGGCGT GGTCGACGAT CCGGCCCTGA CCGAGGCAGT GGCCAGCGAC CTCGGCGCGG AGCTGGCGGC GCTCGGGATC ACCCTGAACT ACGCGCCGGA CGCCGACGTC AACTCCGATC CGGACAACCC GGTGATCGGC GTCCGTTCCT TCGGTGCCGA CCCGGCCCGT GTCGCCCGGC ACACCGCCGC CTGGGTGCGG GGCCTGCAGG CCAGCGGCGT CGCGGCCTGC GCCAAGCACT TTCCCGGGCA CGGTGACACC CGGATCGACT CCCACCACGA CCTGCCCCGG ATCGTCGGCG ACCGAACCCG GCTGGATGCC GTGGAATTGG CACCGTTCCG CGCCGCGTTG TCCGCCGGCG TGCAGGCGGT GATGAGCGGC CACCTGCTCG TACCCGTACT GGATCCGGAT CTGCCGGCCA GCCTGAGCCG CCGGATCCTC ACCGGCCTGC TCCGCGACGA GTTGGGATTC GCCGGGGTCG TGGTGACCGA CGCGGTGGAG ATGCGCGCGG TCGCCGACCG CTACGGCTTC GCGGGTGCCG CGGTGCGTGC CCTGGCCGCG GGCGCCGACG CCATCTGCGT TGGCGGCGAG CGCGCCGACG AGGACGCGGC CCAGCAGCTA CGGGACGCGA TCGTGGCCGC TGTCGTGGCC GGGGAACTGC CCGAGGAACG GCTCGTCGAG GCAGCCAAAC GGGTCAGCCT GCTCGCCTCC TGGACCGCCG CCAGCCGCGG GGCCCGGCCG GCGCGGCAGC CGGCACCCGG CGGTGGCTCG GCCGTCGGAT TCGCCGCCGC CCGGCGGGCC GTCCGGATCA CGACGGGCGG TGCCGGGCGG GGGACGCTGC CCCTGACCGG CCCCGCCCAC GTGGTGGAGT TCGAGTCCCC CCGGAACATC GCGATCGGCG CGGAGACACC GTGGGGCGTC GCGGCACCGC TGGCCGAGCT GCTGCCGGGC ACCACCGCTG TCCGGTATGC CGAGGACGAC GCGCCCACCG ATCCCGTCGC CGGAGCGCAC GGTCGCCACG TCGTCCTCGT CGTTCGGGAC CTGCACCGCC ACCCGTTGGT GCGGGCGGCC GTGACGCGTG CCCTGGCCGC CCGCCCGGAC GCCGTGGTCG TCGAGCTGGG TGTGCCCGAA CTCGTCACCG GGGCGGTGCA CGTGGCGACC CACGGTGCGA CCCGTGCCAG CAGCCGGGCC GCGGCGGAGG TCCTGACCGG GGCCGGCTGA
|
Protein sequence | MTMPEGNLQS LAATVLQPGF VGTTAPTWVR RWLGDGLGAV VLFARNVVDS DQVAALTATL RAERPDVIVA IDEEAGDVTR IESGLGSSRP GNLALGVVDD PALTEAVASD LGAELAALGI TLNYAPDADV NSDPDNPVIG VRSFGADPAR VARHTAAWVR GLQASGVAAC AKHFPGHGDT RIDSHHDLPR IVGDRTRLDA VELAPFRAAL SAGVQAVMSG HLLVPVLDPD LPASLSRRIL TGLLRDELGF AGVVVTDAVE MRAVADRYGF AGAAVRALAA GADAICVGGE RADEDAAQQL RDAIVAAVVA GELPEERLVE AAKRVSLLAS WTAASRGARP ARQPAPGGGS AVGFAAARRA VRITTGGAGR GTLPLTGPAH VVEFESPRNI AIGAETPWGV AAPLAELLPG TTAVRYAEDD APTDPVAGAH GRHVVLVVRD LHRHPLVRAA VTRALAARPD AVVVELGVPE LVTGAVHVAT HGATRASSRA AAEVLTGAG
|
| |