Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3186 |
Symbol | |
ID | 5705799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3676525 |
End bp | 3677928 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641272617 |
Product | cellulose-binding family II protein |
Protein accession | YP_001537984 |
Protein GI | 159038731 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3469] Chitinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00151543 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGCGTT CCAGATCCCT GATCCTGTCC CTGGTCACCG TTATGACAGC CACGCTCGGG GCAGCCTGGG TGGCGCTGCC GGCCTACGCC GCCGGCCCGA CCGCGACCTT CGTCAAGGTC TCCGACTGGG GCACCGGGTG GGAGGCAAAG TACACGATCA CCAACGGGGG AAGCAGCGCC GTCGACGGCT GGAGTCTCGG CTTCGACCTG CCCGCCGGCA CGACGATCGG CAACTACTGG GAGGCACTAC TCAGTTCCTC CGGTCAGCGG CACACGTTCA GCAACCGGTC CTGGAACGGC ACCCTCGCGC CGGGTGGTTC GGTCTCCTTC GGCTTCATCG GCAGCGGGCC CGGCACCCCG ACCACCTGCC AGCTGAACGG CGCGGACTGC GCCGGTCCGC CGCAGCCCAC GACTCCACCA CCGACCACCC TGCCGCCCAC CACGATGCCA CCCACGCCCC CGCCGCCCAC CACGATGCCG CCCACAACCC CGCCACCGAC CCACGAGTTA CCCGCGCACA TCCTCACCGG ATACTGGCAC AACTTCGACA ACCCGGCCGT CGAGCTACGC CTACGGGACG TCCCCACCGA GTACGACGTG GTCGCCGTCG CGTTCGCCGC GGCGACAACC ACCCCCGGCG AGGTGACCTT CGCGGTCGAC CCGGGCTTGT CGGCATCACT GGGCGGCTAT TCCGACGCGG ACTTCTCGGC CGACGTGCAG GCGCTCAAGA GCCAGGGCAG GAAGGTCGTA ATCTCGGTTG GCGGCGAGGC GGGACGGGTT GCCGTCGACG ACGCGGCGGC TGCGGTCGCC TTCAGCGATT CGGTCCACGC ACTGATCCAA CGGTATGGCT TCGACGGTGT GGACATCGAC CTGGAGAACG GACTCAATCC GACCTACATG GCGCAGGCCC TCCGGTCGCT GCGGGCCAAG GTTGGCGCTG GCCTTGTCAT CACGATGGCG CCCCAGACCA TCGACATGCA GAACCCCGCC ACCAGCTACT TCAAGCTGGC ACTGGACATC AAGGATATTG TGACGGTAGT AAACACCCAG TACTACAACT CCGGTGCGAT GCTCGGCTGC GACCAGAGGT TTGCCTACAG CCAGGGCTCG GTGAACTTCA TCGTTGCGCT GGCTTGCATC CAACTGGAGG CGGGGCTGCG GCCAGACCAG GTCGGGCTCG GTCTGCCAGC CGGCCCGGGG GCAGCCGGCG GAGGCATCGT CGCACCCAGT GTGGTCAACG CCGCGCTGGA CTGCCTGACC AGGGGGACAC ACTGCGGCAG CTTCCGCCCA CCCCGCACCT ACCCGGGGTT GCGCGGCGCG ATGACCTGGT CGGTGAACTG GGACGTAACC AACGGCACCA CCTTTGCCCA GACCGTCGGC CCACACCTGG ACACCCTGCC CTGA
|
Protein sequence | MKRSRSLILS LVTVMTATLG AAWVALPAYA AGPTATFVKV SDWGTGWEAK YTITNGGSSA VDGWSLGFDL PAGTTIGNYW EALLSSSGQR HTFSNRSWNG TLAPGGSVSF GFIGSGPGTP TTCQLNGADC AGPPQPTTPP PTTLPPTTMP PTPPPPTTMP PTTPPPTHEL PAHILTGYWH NFDNPAVELR LRDVPTEYDV VAVAFAAATT TPGEVTFAVD PGLSASLGGY SDADFSADVQ ALKSQGRKVV ISVGGEAGRV AVDDAAAAVA FSDSVHALIQ RYGFDGVDID LENGLNPTYM AQALRSLRAK VGAGLVITMA PQTIDMQNPA TSYFKLALDI KDIVTVVNTQ YYNSGAMLGC DQRFAYSQGS VNFIVALACI QLEAGLRPDQ VGLGLPAGPG AAGGGIVAPS VVNAALDCLT RGTHCGSFRP PRTYPGLRGA MTWSVNWDVT NGTTFAQTVG PHLDTLP
|
| |