Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2852 |
Symbol | |
ID | 5708242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3233728 |
End bp | 3234966 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641272307 |
Product | restriction endonuclease S subunits-like protein |
Protein accession | YP_001537677 |
Protein GI | 159038424 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0584008 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAAC CGTGGCCGGT TTCTACAGTC GGCGAGCAGT TCGAGGTGCA CCTGGGCAAG ATGCTCGACT CGGCCAGGAA CGTTGGCTTC CCAAAACCGT ACGTCGGAAA TCGGGCAGTG CAATGGGGTT GGATCGACCT GTCGGCTGTT GGGGTCGCGC CACTGACGCA ATCAGACATA CGACGCTTTC GCCTGCGCAA CGGTGATCTT CTCGTCTGCG AGGGGGGTGA GATCGGCCGC GGCGCAATTT GGCGAGACCA GCTTTCTGAG TGTTACTACC AAAAGGCGCT GCACCGAATG CGTCCCAAGA ACGGTTATGA CGTTCGGCTG ATGCTGGCGC TGCTTGAATA CTGGTCGACA GGTGGAGTGT TCCCCAACTA CGTGACGCAG ACGAGCATCG CCCACCTTCC GCGCGACAAG TTCATTGAGA TGCCGTTGCC GCTGCCATCG GCAGCCGAGC AGGCCCGAAT CGGTGAGGTG ATCCAGGACG TCAATGACCT CATCCACGCC TTGCGGCGGA TGATCGCCAA GAAGCAGGCG ATCAGGCAGG GCCTGCGGCA ACAGTTACTG ACAGGCAGGA CGCGCCTTCC CGGATACAGC GGATCGTGGC GCGAGGTGTC GCTTGGCAGA TACGTCAGCT ACGTCAACAC GGTTGCACTG TCGCGGGCGC AGCTTGATGG CGAGTCGCCC GTCCGATACG TGCACTACGG CGACATCCAT GCTCGGGATA GCCCCATGTT GGACGCGGCG CGCGAGGCAC TGCCACGAGC GAGTTCGACG TTGTTGCGGA ACGCCGGCCG GCTCAAGGTG GGGGATCTCG TGTTCGCCGA CGTGTCTGAG GACCCGGACG GAGTGGGCAA GTCGGTCGAG GTGACCTCGG TACCCGATGT GGGAGTGGTT CCCGGTCTTC ACACCATCGC GGCACGGTTC GAGAAGGCGG TGTTGGCGGA CGGTTTCAAG GCGTACCTAC AGTTCGTACC GTCGTTTCGT GAAACTCTGC ACCGCCTTGT TGTGGGTACC AAGGTGCTGG CGACGACGCG TTCGCTCATT TCCAGTATCA CCCTGACGTT GCCGAACGTC GACGAGCAGC GCGCCATCGC GTCGGTCCTC ACGGATGCTG ATCGTGAGAT CGCTGTTCTC CGCGTTCGGC TGGCGAAGGC GAGGGATGTC AAGCAGGGCA TGATGCAGGA GCTGCTCGCA GGCCGTACGC GGTTGCCCGG AACGGGGAGC ACGGCATGA
|
Protein sequence | MTQPWPVSTV GEQFEVHLGK MLDSARNVGF PKPYVGNRAV QWGWIDLSAV GVAPLTQSDI RRFRLRNGDL LVCEGGEIGR GAIWRDQLSE CYYQKALHRM RPKNGYDVRL MLALLEYWST GGVFPNYVTQ TSIAHLPRDK FIEMPLPLPS AAEQARIGEV IQDVNDLIHA LRRMIAKKQA IRQGLRQQLL TGRTRLPGYS GSWREVSLGR YVSYVNTVAL SRAQLDGESP VRYVHYGDIH ARDSPMLDAA REALPRASST LLRNAGRLKV GDLVFADVSE DPDGVGKSVE VTSVPDVGVV PGLHTIAARF EKAVLADGFK AYLQFVPSFR ETLHRLVVGT KVLATTRSLI SSITLTLPNV DEQRAIASVL TDADREIAVL RVRLAKARDV KQGMMQELLA GRTRLPGTGS TA
|
| |