Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1138 |
Symbol | |
ID | 5704282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1288848 |
End bp | 1289714 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641270653 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001536037 |
Protein GI | 159036784 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.473362 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000326844 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAGTGC AGTCCGGCGG GTTGGGCGAG CCGCGTGGCC CCTGGTTCAT CTCGCCCGAG TTGGGCCCGG ACGGGCGGCC CCGACTGGAC GAGCCCGGAT CGGTTCGGGT GGGCTCGCCG CGACGCTGGC ACAACCGGGT ACTGGCCGGG CTGGCGGTGG TGGCCCTGTC CAGCGTCTCC GGCGCTGCGG CGGGTGGCCT GGTGGCAAGC CAGGACGGGA CGCCCGGGGC AGCTCCCGCC TCGGCCGCAC CGGTGCCGGC GGAGCTGGTG ACCGCTGCCG AGCAGACCGT CCCGGGAGTG GTGTCGGTGC TGGCCGCTGG TGCCGATGGC GCGTCCGCGA CAGGTTCCGG CTTCGCCGTC GACGACCAGC AGCACATCAT CACCAACGAC CACATCCTGG CGAAGGGCCG CAGTGACTCG GTGATGGTGG AGTTGCCGGA CGGGCGACGG TTCGCCGCCG AGGTTGCGGG CCGGGAGCCT CGTAGCGACC TCGCGGTGTT GCGGGTGCCG CCGTCCGCGG GCTTGGCGGC GTTGCCGCTG GCGAAGCCGG GGACGACCCG GGTCGGCGAG CCGGTGCTGG CCGTGGGGTC GCCCCTCGGC CTTGCCGGCA CCGTGACCGC CGGCATCGTC AGCGCCGTGG ACCGGCAGGT CCGCCTCGGT GACAACCGGC ACACGGCGGT GCAGACGGAC GCCTCGATCA ACCCCGGTAA CTCGGGCGGG CCACTGGTGA ACGCCCGGGG TGAGGTGGTT GGGGTGAACA CGGCGATCGC CACGATCGAC GGGAACGGCT CGATCGGCAT CGGCTTCGCG ATCCCCATCG AGCAGGTCCA ACAGACCGCG GACACGATCA TCGGGAAGGG CGGCTGA
|
Protein sequence | MAVQSGGLGE PRGPWFISPE LGPDGRPRLD EPGSVRVGSP RRWHNRVLAG LAVVALSSVS GAAAGGLVAS QDGTPGAAPA SAAPVPAELV TAAEQTVPGV VSVLAAGADG ASATGSGFAV DDQQHIITND HILAKGRSDS VMVELPDGRR FAAEVAGREP RSDLAVLRVP PSAGLAALPL AKPGTTRVGE PVLAVGSPLG LAGTVTAGIV SAVDRQVRLG DNRHTAVQTD ASINPGNSGG PLVNARGEVV GVNTAIATID GNGSIGIGFA IPIEQVQQTA DTIIGKGG
|
| |