Gene Sare_4476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4476 
Symbol 
ID5706916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5057490 
End bp5058644 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content69% 
IMG OID641273892 
Producthypothetical protein 
Protein accessionYP_001539241 
Protein GI159039988 
COG category 
COG ID 
TIGRFAM ID[TIGR02570] CRISPR-associated protein, GSU0053 family, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGCG ACGAGTTGGC GGCACGGCTG ATTGCTGCCG TTGGCGAGCA GAGGCGTGAG 
AGCGGCGTGG TGGTGGAGGC GGTGTATCAG CCGGTCGGGG GTGCTGGTGG CAAGGTGATG
CCTCCGACCT TCCCGGTGGT GGAGCGCGGT GGCTCGCCCT ATCTGCTGGA GGAACGGTGG
GTTGATGGCG ACCGGGTGGG CACGGTGGTG ATCGACCAGG TGCCGAGCCA GGCCAACCGG
GTCGAGGAGG CGCTGCTGGC GGCTCGGGAC ACGGGGCGAC TGTCGGTGCC GATCTTCGAG
ATGATGGTGG ACGGGCTGCG GCTGACGTCG CTGCAGTTCC CGCACCGCTA TGCGGATGCG
TACCTGCGTG ACAGCGAGGT CGACGGCGTA CGTTTCGATG ACAGCACGGC CGGTAAGGCG
CTGCGGTCGG TGACGACACG TGACGTTCGT CCGCTGTACG CCCGGGAGCC GTACTCGCTG
TTGTTCGGGG CGTGGGATTC GCATCGCAGG GGCCGGTGGC CACGGTTCGC GCGGCTGTAC
CAGTCGATGA TGTACGGCCT GGATCCGATC GTTGGCGATC GACGCAGCGG GCGGTTCGAC
CCGTTGAATC TCACGGGCGG TGTGGACAAC AAGAACAAGG CTGAGACGGA CTGGCGGTTC
CTCCCGGAGG GGCAGAAGGC CAAGGGCGGC CGGCTGAGCG AGATCGGCCA CGGCCACATC
GCCCCCAACC CCGCTCATGG TGGGGTGACG GTCCGGGAGG TACGCCGGTC GGCGTGGATC
TCCTTCGCCG GCCTGGAGCG GCTGCGGTTC GGGGAGGTCT CCGAGGAGGC TGCTGGGCTC
GCGCGAGCGG CGCTGGCGGC GTTGGCGCTG GTCGGGGATC GGTTGGCTTT CGGGCGGCCG
TCGCTGTCGC TGCGGTCCGG CTGCGAGTTG ACCCGGATCA CCGAGACGGT GGCGTTCGAA
GTCGCCGGCG GGGAGAAGGA GCCGGTCGAG GTGTCGGTCG GTGACGCTGT CGCGGCGTTC
GTCCAGCTGC GGGCACAGGC GGGGGCGGCG GGTGTGCCGA TGGCAGACGA TGTGGTGGCT
GTGACGCCGA TTCGGCAGTT GCGCGAGGCG ATGGTGTACG CGCGCACCCA GGCTGTCCCA
GACTCCGAGG AGTAG
 
Protein sequence
MTGDELAARL IAAVGEQRRE SGVVVEAVYQ PVGGAGGKVM PPTFPVVERG GSPYLLEERW 
VDGDRVGTVV IDQVPSQANR VEEALLAARD TGRLSVPIFE MMVDGLRLTS LQFPHRYADA
YLRDSEVDGV RFDDSTAGKA LRSVTTRDVR PLYAREPYSL LFGAWDSHRR GRWPRFARLY
QSMMYGLDPI VGDRRSGRFD PLNLTGGVDN KNKAETDWRF LPEGQKAKGG RLSEIGHGHI
APNPAHGGVT VREVRRSAWI SFAGLERLRF GEVSEEAAGL ARAALAALAL VGDRLAFGRP
SLSLRSGCEL TRITETVAFE VAGGEKEPVE VSVGDAVAAF VQLRAQAGAA GVPMADDVVA
VTPIRQLREA MVYARTQAVP DSEE