Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1471 |
Symbol | |
ID | 5704437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1700994 |
End bp | 1702196 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641270979 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001536360 |
Protein GI | 159037107 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.284694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000211937 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTTCGG GTCACGGGCG GTCCGGGGCG GGGGAGGACT TCGGCGCTAC TCCCGCGGAC GATACCCGAG ATCGCCCCGG ACCGTGGCCG CCCGCCGTCG ATCGGCCCCC GGTGCAACCG CCGCCGATCG CTCCGGCGCC GCCCGCCGGT GTCGCTCACG GTACGCCTGA ACCCGCCCGA CCCACGGTTG CACCGCCCCG AACCACGTGG TCTCCGCCAC CCCAGCCAGG CCCGTCGGAG GACGCCCCCG CTGCCCCGGT CGCCGCAGCC GTCCCCGGCG TGCCGACGTA CGGCGGCCCC GGTACGTCCG GTGCCGCCTA TCCGCCGCTT CCCGGGTCGG GTGGCCCGCC GCCGGTCGGG CCGCCACCTG GCGGCCCGGG CGGGATCTGG CCCGGAGCAG CCCATCCGGT TCCCGTTCGG GCGCCCCGAC CGCTCGGCCT GCTCGGGTGG GTGGCCGTGC TGGCCATCGG GGCGCTACTC GTGGTGACCG GCGTCCAGGC GTACCAGATC CACCGGCTCA CTGATCGGCT CGCCGACACC GACCGGCGGC TGGTTGCCGG GCAGGAGGAC AGCCAGGCCC GGCTCGACGG GCTGGAGGCG CGGGCGAAGA CCCTGGAGTC AGAGGTCGGC GCCGCGTTCG ACCCGGAAGT CATCGCCGCC GCCGCACTGC CCAGCGTGTT TCGGGTGCGC GCCGGGCGGT TCACCGGCAG CGCGTTCGCG GTCGGCGAGT CGACCGCCGG TGGTACCAAT CTGTTCACCA ACTTCCACGT GGTCGAAGGG GTCTGGGACG ACGGGGACCG GGAGGTCTTC CTGGAGCGCA CCGATCAGCG CTTCCCGGCC ACCATCGTCG AGGTCGACAA GGACAACGAC ATCGCCCAGC TCCGCACGAC AGGCAAGTTC ACCGGCCTCA CCGCCGCACC TGGCGAGGTC AAGCCCGGGC AGCAGATCGT CGTCGTCGGT GCACCCCTGG GCTTGGAGGA CAGCGTCACC ACCGGTGTGG TGAGCGCGTT TCGCGAGGCC AAGGGTGGTG ACCCGGCAGC GATCCAGTTC GATGCTCCGA TCAACCCCGG CAACTCGGGT GGTCCGGTGG TCAACGGCGA GCGGCAGGTG GTCGGCATCG CCACCGCCAA GGCGCGTGAC GCCGAGGGGA TCGGCCTCGC CGTCCCGATC GGCACGGCCT GCGAGGCCTT CGACATCTGC TGA
|
Protein sequence | MTSGHGRSGA GEDFGATPAD DTRDRPGPWP PAVDRPPVQP PPIAPAPPAG VAHGTPEPAR PTVAPPRTTW SPPPQPGPSE DAPAAPVAAA VPGVPTYGGP GTSGAAYPPL PGSGGPPPVG PPPGGPGGIW PGAAHPVPVR APRPLGLLGW VAVLAIGALL VVTGVQAYQI HRLTDRLADT DRRLVAGQED SQARLDGLEA RAKTLESEVG AAFDPEVIAA AALPSVFRVR AGRFTGSAFA VGESTAGGTN LFTNFHVVEG VWDDGDREVF LERTDQRFPA TIVEVDKDND IAQLRTTGKF TGLTAAPGEV KPGQQIVVVG APLGLEDSVT TGVVSAFREA KGGDPAAIQF DAPINPGNSG GPVVNGERQV VGIATAKARD AEGIGLAVPI GTACEAFDIC
|
| |