Gene Sare_3485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3485 
Symbol 
ID5703548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4018755 
End bp4020374 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content67% 
IMG OID641272912 
Productchitinase 
Protein accessionYP_001538278 
Protein GI159039025 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.686156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0640253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAT CGCTTCGCCG GGCCCTCTGG GCTGGCGCCG TGGTCGTGTT GGCGGCCGCG 
GCTGTTCCGA TGGCCTCGGC CTACGGGGCC GGCAGTGTCA CCGCCACGTT CGACAAGGTG
CAGGACTGGG GGACCGGCCA CCAGACGAAG GTGACGGTCA CCAACGGCTC GGACACGTCG
GTGAGCGACT GGCGTATCGA GTTCGACCTC CCCGCCGGGA CCAGCATCGG CACCTTCTGG
GACGCCGACG TCACCCGCAC CGGGAACCAC TACGTCGCGG TCAAGAAGAG CTGGGCCGGC
CCCCTCGCCC CAGGTGCCAG CTTCAGCTGG GGCTACAACG GGACCGGCCC CTACCAAGCG
CCACTGAACT GCACGATCAA CGGTGCCACC TGCTCCGGTG GTGCCCCGCC GACAACGGCA
CCCCCCACAA CCGCACCCCC CACAACGGCG CCGCCGACGA CCGCGCCCCC AACGACCCCG
CCGCCGACCA CCTCACCCCC AGGTGGTGAC CACAAGGTCG TCGGCTACTT CGCACAATGG
GGCGTCTACG CGCGCAACTA CCACGTCAAG AACATCCACA CCAGCGGCTC GGCGGCGAAG
CTGACCCACA TCATGTACGC GTTCGGCAAC ACCACCAATG GACGCTGCAC GATCGGCGAC
AGCTACGCCG ACTACGAAAA GGCGTACACC GCGGCGGACA GTGTGGACGG GGTCGCGGAC
ACCTGGGACC AACCGTTGCG GGGTAGCTTC AACCAGCTGC GCAAACTCAA GGAGATGTAC
CCGCACCTCA AGGTGATCTG GTCCTTCGGT GGCTGGACCT GGTCCGGCGG GTTCACCCAG
GCGGCGCAGA ACCCGGCTGC GTTCGCCGAG AGCTGCTACA ACCTGGTCGA GGACCCGCGC
TGGGCGGACG TCTTCGACGG CATCGACATC GACTGGGAGT ATCCGAACGC CTGTGGCCTC
ACCTGCGACT CCAGCGGGCC GGCGGCATTC AAGAACGTGG TGAACGCGCT GCGTTCACGG
TTCGGCCCAT CGGCTCTGGT CACCGCCGCG ATCACCGCTG ACGCCAGTAA CGGTGGCAAG
ATCGACGCTG CCGACTACGC CGGCGCGGCA CCGAACCTCG ACTGGATCAT GGCGATGACC
TACGACTACT TCGGTGCCTT CAACCCGCAG GGCCCGACCG CCCCGCACTC GCCGCTCTAC
TCGTACCCCG GCATCCCGCA GCAGGGGTTC TGGTCCGACG CGGCGATCCA GAAGTTGAAG
AGCAAGGGCG TTCCGGCCGA CAAGCTGCTG CTCGGCATCG GCTTCTACGG TCGGGGCTGG
ACCGGCGTCA CCCAAACCGC GCCGGGTGGT TCCGCCACCG GGGCCGCGCC GGGAACCTAC
GAGCAGGGCA TCGAGGACTA CAAGGTCCTC AAGAACACCT GCCCGGCGAC CGGGATGGTC
GGCGGTACGG CGTACGCCAA GTGTGGCAAC AACTGGTGGA GCTATGACAC CCCCGCCACC
ATCGGCGGCA AGATGACCTA CGCGAAGAAC GAGGGCCTCG GCGGCGCGTT CTTCTGGGAG
CTCTCCGGCG ACACGACCAA CGGTGAGTTG ATCGGCGCGA TCAAGGGCGG TCTCGGCTAG
 
Protein sequence
MKRSLRRALW AGAVVVLAAA AVPMASAYGA GSVTATFDKV QDWGTGHQTK VTVTNGSDTS 
VSDWRIEFDL PAGTSIGTFW DADVTRTGNH YVAVKKSWAG PLAPGASFSW GYNGTGPYQA
PLNCTINGAT CSGGAPPTTA PPTTAPPTTA PPTTAPPTTP PPTTSPPGGD HKVVGYFAQW
GVYARNYHVK NIHTSGSAAK LTHIMYAFGN TTNGRCTIGD SYADYEKAYT AADSVDGVAD
TWDQPLRGSF NQLRKLKEMY PHLKVIWSFG GWTWSGGFTQ AAQNPAAFAE SCYNLVEDPR
WADVFDGIDI DWEYPNACGL TCDSSGPAAF KNVVNALRSR FGPSALVTAA ITADASNGGK
IDAADYAGAA PNLDWIMAMT YDYFGAFNPQ GPTAPHSPLY SYPGIPQQGF WSDAAIQKLK
SKGVPADKLL LGIGFYGRGW TGVTQTAPGG SATGAAPGTY EQGIEDYKVL KNTCPATGMV
GGTAYAKCGN NWWSYDTPAT IGGKMTYAKN EGLGGAFFWE LSGDTTNGEL IGAIKGGLG