Gene Sare_2852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2852 
Symbol 
ID5708242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3233728 
End bp3234966 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content62% 
IMG OID641272307 
Productrestriction endonuclease S subunits-like protein 
Protein accessionYP_001537677 
Protein GI159038424 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0584008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAC CGTGGCCGGT TTCTACAGTC GGCGAGCAGT TCGAGGTGCA CCTGGGCAAG 
ATGCTCGACT CGGCCAGGAA CGTTGGCTTC CCAAAACCGT ACGTCGGAAA TCGGGCAGTG
CAATGGGGTT GGATCGACCT GTCGGCTGTT GGGGTCGCGC CACTGACGCA ATCAGACATA
CGACGCTTTC GCCTGCGCAA CGGTGATCTT CTCGTCTGCG AGGGGGGTGA GATCGGCCGC
GGCGCAATTT GGCGAGACCA GCTTTCTGAG TGTTACTACC AAAAGGCGCT GCACCGAATG
CGTCCCAAGA ACGGTTATGA CGTTCGGCTG ATGCTGGCGC TGCTTGAATA CTGGTCGACA
GGTGGAGTGT TCCCCAACTA CGTGACGCAG ACGAGCATCG CCCACCTTCC GCGCGACAAG
TTCATTGAGA TGCCGTTGCC GCTGCCATCG GCAGCCGAGC AGGCCCGAAT CGGTGAGGTG
ATCCAGGACG TCAATGACCT CATCCACGCC TTGCGGCGGA TGATCGCCAA GAAGCAGGCG
ATCAGGCAGG GCCTGCGGCA ACAGTTACTG ACAGGCAGGA CGCGCCTTCC CGGATACAGC
GGATCGTGGC GCGAGGTGTC GCTTGGCAGA TACGTCAGCT ACGTCAACAC GGTTGCACTG
TCGCGGGCGC AGCTTGATGG CGAGTCGCCC GTCCGATACG TGCACTACGG CGACATCCAT
GCTCGGGATA GCCCCATGTT GGACGCGGCG CGCGAGGCAC TGCCACGAGC GAGTTCGACG
TTGTTGCGGA ACGCCGGCCG GCTCAAGGTG GGGGATCTCG TGTTCGCCGA CGTGTCTGAG
GACCCGGACG GAGTGGGCAA GTCGGTCGAG GTGACCTCGG TACCCGATGT GGGAGTGGTT
CCCGGTCTTC ACACCATCGC GGCACGGTTC GAGAAGGCGG TGTTGGCGGA CGGTTTCAAG
GCGTACCTAC AGTTCGTACC GTCGTTTCGT GAAACTCTGC ACCGCCTTGT TGTGGGTACC
AAGGTGCTGG CGACGACGCG TTCGCTCATT TCCAGTATCA CCCTGACGTT GCCGAACGTC
GACGAGCAGC GCGCCATCGC GTCGGTCCTC ACGGATGCTG ATCGTGAGAT CGCTGTTCTC
CGCGTTCGGC TGGCGAAGGC GAGGGATGTC AAGCAGGGCA TGATGCAGGA GCTGCTCGCA
GGCCGTACGC GGTTGCCCGG AACGGGGAGC ACGGCATGA
 
Protein sequence
MTQPWPVSTV GEQFEVHLGK MLDSARNVGF PKPYVGNRAV QWGWIDLSAV GVAPLTQSDI 
RRFRLRNGDL LVCEGGEIGR GAIWRDQLSE CYYQKALHRM RPKNGYDVRL MLALLEYWST
GGVFPNYVTQ TSIAHLPRDK FIEMPLPLPS AAEQARIGEV IQDVNDLIHA LRRMIAKKQA
IRQGLRQQLL TGRTRLPGYS GSWREVSLGR YVSYVNTVAL SRAQLDGESP VRYVHYGDIH
ARDSPMLDAA REALPRASST LLRNAGRLKV GDLVFADVSE DPDGVGKSVE VTSVPDVGVV
PGLHTIAARF EKAVLADGFK AYLQFVPSFR ETLHRLVVGT KVLATTRSLI SSITLTLPNV
DEQRAIASVL TDADREIAVL RVRLAKARDV KQGMMQELLA GRTRLPGTGS TA