Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4756 |
Symbol | |
ID | 5705347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5385246 |
End bp | 5386424 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641274154 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001539500 |
Protein GI | 159040247 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00204971 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00219208 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCCGCCG TGGATCTCGT ACTGCTGTTA CTCATGCTCG TGTTCGCGAT CAGCGGATAC CGCCAGGGCT TCGTCACCGG TGTGCTGTCG TTCACGGGGT TCTTCCTCGG AGCGCTGGCT GGTCTGCAGA TCGGTCCCCT GCTCGCGCAA CAGTTCCTCG ACGGCGGTAC CCGGGTGTTG ATCTCACTGG TGACGGTCTT CGGGCTGGCG GTGATGGGGC AGGCGCTCGC CGGCTGGCTC GGTTCCCACC TGCGACGCAC GATCACCAGC GACGTCGGGC GACAGGCCGA CGACATCGGC GGCGCATTCG TTTCACTGAC CGCCGTGCTT CTGGTCGCCT GGCTGGTCGC AGTCCCCCTG GGCTCGTCGT CCCTGCCCTG GCTGGCCGCC GCGGTCCGCA ACAGCGCGCT GCTCACCGTG GTGGACCGGG TGCTGCCCGA CCAGGCGCAG GAGTTGTCCA ACGCGCTGCG GGAGACCGTC GACACCAACG GCTTCCCGGA CGTCTTCGGT GACCTGGCAC CCACCCGTGC CCGGCAGGTC GCTCCACCCG ACCCGGCCCT CGCCGGCTCA CAGGTGGTGG CCGACAGCCG ACGCGCGGTG GTCAAGGTGC TCGGCTCCGC CCCGGACTGC TCCCGTCGCA TCGAGGGCTC CGGCTTCGTC TACGCCGACG ACCGGGTGAT GACCAACGCG CACGTGGTGG CCGGAACCCG CTCCACCGTC GTCGAACTGA ACGGCGATCG GTACGACGCT CGAGTGGTGG TGTACGACCC GGACCGGGAC CTGGCGGTCC TGTACGTTCC CGGCTTGCCC GGCCCGTCCA TGCGCTTCGC CGCCGGCAAT GCGAGTAGCG GCACCGACGC GATCGTGCTC GGCTTCCCGC TCGACGGCCC GTACAACGCG CAGTCGGCGC GGGTCCGGGA TGTCGACCAG ATCACCGGGC CCGACATCTA CTCCAGCGGG AACGTGACCC GTGAGGTGTA CACCATCCGG GCCCTGGTGC AGAGCGGCAA CTCCGGCGGC CCGCTGGTGT CGACGAACGG CCTGGTACTC GGGGTGATCT TCGCGGCGGC GGCCGACGAC CCGAACACCG GCTTCGCGGT GACCGCAGCC GAGGCCCGCC CGGTCGCCCT GGCCGGAGCC GCACGCATCC GCGGCGTCGG CACCGGCGAG TGCACCTGA
|
Protein sequence | MSAVDLVLLL LMLVFAISGY RQGFVTGVLS FTGFFLGALA GLQIGPLLAQ QFLDGGTRVL ISLVTVFGLA VMGQALAGWL GSHLRRTITS DVGRQADDIG GAFVSLTAVL LVAWLVAVPL GSSSLPWLAA AVRNSALLTV VDRVLPDQAQ ELSNALRETV DTNGFPDVFG DLAPTRARQV APPDPALAGS QVVADSRRAV VKVLGSAPDC SRRIEGSGFV YADDRVMTNA HVVAGTRSTV VELNGDRYDA RVVVYDPDRD LAVLYVPGLP GPSMRFAAGN ASSGTDAIVL GFPLDGPYNA QSARVRDVDQ ITGPDIYSSG NVTREVYTIR ALVQSGNSGG PLVSTNGLVL GVIFAAAADD PNTGFAVTAA EARPVALAGA ARIRGVGTGE CT
|
| |