Gene Sare_1632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1632 
Symbol 
ID5703476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1870041 
End bp1871057 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content71% 
IMG OID641271140 
Productchitin-binding domain-containing protein 
Protein accessionYP_001536515 
Protein GI159037262 
COG category[S] Function unknown 
COG ID[COG3397] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAGC GCTTTGCCCT GCCATTGATG ACGATGGGAG CTGTCACGGC CACGATGGCC 
GTCGCCGCGC CCGCCCAGGC GCATGGCTAC GTTTCGGGAC CGCCCAGCCG TCAGGCGCTC
TGCGCGCAGG GTTCGGTACC CGACTGTGGG CCCATCTCGT TCGAGCCGCA GAGCGTCGAA
GGACCAAAGG GCCTGACGAG CTGCAGCGGC GGCATCTCCG AGTTCGCCGT GCTCGACGAC
GAGAGCCGGG CCTGGCCCGC GGCCACGGTC GGCCGGTCGG TCACCTTCGA CTGGATCAAG
ACCGCCCCGC ACAAGACCAG CAACTGGGAG TACTTCATCG GCGACGAGCT GTTGGCCACG
TTCGACGGTG GTGGCGTGCA GCCGCCGTCC ACGCTCTCGC ACACGGTCGA CCTGGGCGAC
CACGTGGGCC GGCAGAAGGT TCTCGCGGTG TGGAACATCG CCGACACCCC CATGGCGTTC
TACTCCTGCA TCGACGTGAA TATCGACGGC GGCCCTTCGC CGACGCCGAC GGGCACCGCG
TCGCCGACCC CGACCGCCTC GCCGACGAGC ACCGCGTCAC CGACCCCGAC CGCCTCACCG
ACGAGCACCG CCTCGCCGAC CCCGACCGCC TCGCCGACGA GCACCGCGTC ACCGACCCCG
ACCGGCACGC CGTCTCCGAC CTCGACCGGG ACTCCCGCGC CGGAAAGCTG GCAGGTCGGT
ACCACCTACC AGATCGGTGA CGAGGTGACG TACGACGGGG TGAGCTACCG GGCTCGGCAG
GCGCACACCG CGACACCCGG GTGGGAGCCG CCGCGCGTAC CAGCGCTCTG GACTGCCGTG
ACACCACCAC CCGCGACCGG CGACCCGGCA CCCGGCGACG GTTGGGCGGT TGGCATCGCC
TACCAGATCG GTGACGAGGT GACGTACGAC GGGGTGAGCT ACCTGGCTCG GCAGGCGCAC
ACCGCGACAC CCGGGTGGGA GCCGCCGCAC GTGCCGTCGC TGTGGATCCG AATCTGA
 
Protein sequence
MRKRFALPLM TMGAVTATMA VAAPAQAHGY VSGPPSRQAL CAQGSVPDCG PISFEPQSVE 
GPKGLTSCSG GISEFAVLDD ESRAWPAATV GRSVTFDWIK TAPHKTSNWE YFIGDELLAT
FDGGGVQPPS TLSHTVDLGD HVGRQKVLAV WNIADTPMAF YSCIDVNIDG GPSPTPTGTA
SPTPTASPTS TASPTPTASP TSTASPTPTA SPTSTASPTP TGTPSPTSTG TPAPESWQVG
TTYQIGDEVT YDGVSYRARQ AHTATPGWEP PRVPALWTAV TPPPATGDPA PGDGWAVGIA
YQIGDEVTYD GVSYLARQAH TATPGWEPPH VPSLWIRI