Gene Sare_3186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3186 
Symbol 
ID5705799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3676525 
End bp3677928 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content68% 
IMG OID641272617 
Productcellulose-binding family II protein 
Protein accessionYP_001537984 
Protein GI159038731 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3469] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00151543 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCGTT CCAGATCCCT GATCCTGTCC CTGGTCACCG TTATGACAGC CACGCTCGGG 
GCAGCCTGGG TGGCGCTGCC GGCCTACGCC GCCGGCCCGA CCGCGACCTT CGTCAAGGTC
TCCGACTGGG GCACCGGGTG GGAGGCAAAG TACACGATCA CCAACGGGGG AAGCAGCGCC
GTCGACGGCT GGAGTCTCGG CTTCGACCTG CCCGCCGGCA CGACGATCGG CAACTACTGG
GAGGCACTAC TCAGTTCCTC CGGTCAGCGG CACACGTTCA GCAACCGGTC CTGGAACGGC
ACCCTCGCGC CGGGTGGTTC GGTCTCCTTC GGCTTCATCG GCAGCGGGCC CGGCACCCCG
ACCACCTGCC AGCTGAACGG CGCGGACTGC GCCGGTCCGC CGCAGCCCAC GACTCCACCA
CCGACCACCC TGCCGCCCAC CACGATGCCA CCCACGCCCC CGCCGCCCAC CACGATGCCG
CCCACAACCC CGCCACCGAC CCACGAGTTA CCCGCGCACA TCCTCACCGG ATACTGGCAC
AACTTCGACA ACCCGGCCGT CGAGCTACGC CTACGGGACG TCCCCACCGA GTACGACGTG
GTCGCCGTCG CGTTCGCCGC GGCGACAACC ACCCCCGGCG AGGTGACCTT CGCGGTCGAC
CCGGGCTTGT CGGCATCACT GGGCGGCTAT TCCGACGCGG ACTTCTCGGC CGACGTGCAG
GCGCTCAAGA GCCAGGGCAG GAAGGTCGTA ATCTCGGTTG GCGGCGAGGC GGGACGGGTT
GCCGTCGACG ACGCGGCGGC TGCGGTCGCC TTCAGCGATT CGGTCCACGC ACTGATCCAA
CGGTATGGCT TCGACGGTGT GGACATCGAC CTGGAGAACG GACTCAATCC GACCTACATG
GCGCAGGCCC TCCGGTCGCT GCGGGCCAAG GTTGGCGCTG GCCTTGTCAT CACGATGGCG
CCCCAGACCA TCGACATGCA GAACCCCGCC ACCAGCTACT TCAAGCTGGC ACTGGACATC
AAGGATATTG TGACGGTAGT AAACACCCAG TACTACAACT CCGGTGCGAT GCTCGGCTGC
GACCAGAGGT TTGCCTACAG CCAGGGCTCG GTGAACTTCA TCGTTGCGCT GGCTTGCATC
CAACTGGAGG CGGGGCTGCG GCCAGACCAG GTCGGGCTCG GTCTGCCAGC CGGCCCGGGG
GCAGCCGGCG GAGGCATCGT CGCACCCAGT GTGGTCAACG CCGCGCTGGA CTGCCTGACC
AGGGGGACAC ACTGCGGCAG CTTCCGCCCA CCCCGCACCT ACCCGGGGTT GCGCGGCGCG
ATGACCTGGT CGGTGAACTG GGACGTAACC AACGGCACCA CCTTTGCCCA GACCGTCGGC
CCACACCTGG ACACCCTGCC CTGA
 
Protein sequence
MKRSRSLILS LVTVMTATLG AAWVALPAYA AGPTATFVKV SDWGTGWEAK YTITNGGSSA 
VDGWSLGFDL PAGTTIGNYW EALLSSSGQR HTFSNRSWNG TLAPGGSVSF GFIGSGPGTP
TTCQLNGADC AGPPQPTTPP PTTLPPTTMP PTPPPPTTMP PTTPPPTHEL PAHILTGYWH
NFDNPAVELR LRDVPTEYDV VAVAFAAATT TPGEVTFAVD PGLSASLGGY SDADFSADVQ
ALKSQGRKVV ISVGGEAGRV AVDDAAAAVA FSDSVHALIQ RYGFDGVDID LENGLNPTYM
AQALRSLRAK VGAGLVITMA PQTIDMQNPA TSYFKLALDI KDIVTVVNTQ YYNSGAMLGC
DQRFAYSQGS VNFIVALACI QLEAGLRPDQ VGLGLPAGPG AAGGGIVAPS VVNAALDCLT
RGTHCGSFRP PRTYPGLRGA MTWSVNWDVT NGTTFAQTVG PHLDTLP