Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3037 |
Symbol | |
ID | 5707239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3446002 |
End bp | 3447180 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641272482 |
Product | peptidase M7 snapalysin |
Protein accession | YP_001537850 |
Protein GI | 159038597 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5640] Secreted trypsin-like serine protease |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.872515 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAGAA GGCAACTGTT GCGCGCGTTC GGCGCCGTGC TGGCGGCGGT GCTGGCAGCG GCCGGGGTGC AGATCGCCAC CGGTGCGCCG GCCACAGCGG CTCGCACGGT CTACTACGAC GCGAGCCGGG CCGGCGAGTT CCGTACGAAC TTCGACCAAG CGGCCCAGAT TTGGAACAGT TCGGTCAGCA ATGTACGGCT GGCTCCACGC AGTCCAGGCA ACGTCACCAT CTACGTCGAC GGCGGCTGGC CGCGGGCGCA GGTCACCGGG CTCGGCTCTG GCCGGATCTG GATGGGATGG ACCGCGGTCA ACCAGGGATA CGACCGTACC CGCATCGCCA GCCATGAGTT CGGCCACATC CTCGGCCTAC CCGATCGGCG TACCGGGCTC TGCTCCGACC TGATGTCGGG CAGCAGTGCG GCGGTCTCGT GCGACAACGC GTACCCCAGC AGCGCCGAGG CGTCCCGGGT CGACTCGCTG TTCGCCGGCA GTCGCACGAC AAGTGTCACC GGTACGTTCA CCTGGGCTGA TGCCGATATC ACGCCGTTCG TGGTCGGCGG CCGGCCGGCG ACCGAGAACT ACCCGTGGCT GGTCTACACC TCTGGCTGCA CCGGTACGTT GATCAAGTCG GACTGGGTCG TCACGGCGCG GCACTGCCCG ACACCGTCGT CGGTCCGTGT GGGTAGCGTC AACCGCACCA GCGGTGGCAC GGTCGTCGGG GTCCGCCGCG CCGTCAGCAA CCCCACAATC GATGTCAAGC TGCTGCAACT GTCCAATGCG GTCTCGTACG CCCCGGCCCC GATCCCGATG ACGTCCGGAG AGGTCGGTAC CGCTACCCGG ATCATCGGCT GGGGTCTGAC CTGTCCGTTC CGGGGCTGCG GTTCGGCGCC GACGGTCGCA CACGAGCTGG ACACGTCGAT CCTGTCGGAC AGTCGCTGCA TCGGCATCAA CGGCCCGTAC GAGATCTGCA CCGACAACAC GAACGGTGAC TCGGGCGCCT GCTACGGCGA CTCGGGGGGC CCGCAGGTTC GTCAGATCGG TGGGGTGTGG TATCTGGTCG GTGCCACCAG CCGGTCGGGC AACAACCACC CGATCTGTGC CACCGGTCCA TCGATTTACG GTGACCTGAC GTCGATCCGT TCCTGGATCG ACACCCGGGT CGGCGGCCTT CCCGCCTGA
|
Protein sequence | MVRRQLLRAF GAVLAAVLAA AGVQIATGAP ATAARTVYYD ASRAGEFRTN FDQAAQIWNS SVSNVRLAPR SPGNVTIYVD GGWPRAQVTG LGSGRIWMGW TAVNQGYDRT RIASHEFGHI LGLPDRRTGL CSDLMSGSSA AVSCDNAYPS SAEASRVDSL FAGSRTTSVT GTFTWADADI TPFVVGGRPA TENYPWLVYT SGCTGTLIKS DWVVTARHCP TPSSVRVGSV NRTSGGTVVG VRRAVSNPTI DVKLLQLSNA VSYAPAPIPM TSGEVGTATR IIGWGLTCPF RGCGSAPTVA HELDTSILSD SRCIGINGPY EICTDNTNGD SGACYGDSGG PQVRQIGGVW YLVGATSRSG NNHPICATGP SIYGDLTSIR SWIDTRVGGL PA
|
| |