Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NSE_0166 |
Symbol | |
ID | 3932058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Neorickettsia sennetsu str. Miyayama |
Kingdom | Bacteria |
Replicon accession | NC_007798 |
Strand | + |
Start bp | 133504 |
End bp | 134925 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637900322 |
Product | periplasmic serine protease |
Protein accession | YP_506063 |
Protein GI | 88608224 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.358427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGAA ATAGGTTTTA TGAGATATTT TTCCTAGCTG TACTTTTGAA CTGTTCTTTT GCTGCTGGGG CTGTGCCATC TGAGGGATTT TCTGATGTTG TCTCAAGGCT TGCTCCTGCG GTTGTGAATA TTTCCAGCGA ATACAGGTTA AGGATGGATA ACCAGGGTCT GTGTGGTAAC CCGGCTATTT TGGAAGAGTT TTCTGAATTT TGTGAGCGCC TTAAGCCGTT TTTTCGCAAC AAGAATCCGG GTAAGAAGTA TGGGACATCT TTGGGTTCTG GCTTTTTGAT TTCCGATGAT GGGCTTATAG TTACGAATTA TCATGTCATT GCGAATGCTG ATAAAATCAG GGTCGTTCTG AGTCAGTGTA GTGAAGCGTG CCAACAGTAT GAGGCTACGG TGATTGGGTA TGATAAAAAA ACCGACCTTG CTGCGCTAAA AATTTCTGGA GTAAGTGGTC TTCCATATTT GCGTTTTGGT GATTCGTCTA AAATGAGGCC CGGGGACTGG GTGATAGCAG TTGGTAATCC TTTTGGTTTA GGGGGCTCTG TCAGTGCTGG AATTGTTTCA GCGATCAGTA GAGAGATCGG TCTATCTCAG AACAGCGATT TTATACAGAC GGACGTTGTG CTCAATTCTG GTAATTCTGG AGGACCGCTT TGTAATGCAA AGGGTGAAGT AATTGGTGTA AATACGGCTG CTGTGTATTC TAATGGTGGA AGTGCAGGGA TTGGTTTTGC CGTGCCATCG AATGTTGCGA AGCCAGTGAT AGAAGCTCTA GCTAAGGGTA AGCAGATTCA GCGTGGATGG ATAGGGATTG TCATCCAAGA GATCACAAAC GAAACAAAAG ACTCACTTGG TGGAGACTTA TCCGGTGTTC TGGTGGCGAG TGTTGAAAAA GATGGACCTG CGTATAAAGC TGGGATGAGG GTTGGAGACG TTATTACAGC TGTGAACGGG GAGAAAATTA GCGGTTCAAG GAGATTAGTA AGGGAAGTTT CCGGGCGGAG AATAGGTGAT ACAATAGAAC TGTCTGTAGT CAGGGATGCT CTTAAAAATA AGGAGACGGT GTCTTTGAAG GTAAAAATTG AAAAAACACC GCAAAGGTAT GCAGACGATG GGGCATCGCA GTTAGAGGTC ATAGGATTAG TGGTTTCCAA TCTGACTGAT ACGATTAGAA ATTCGTTTGG GCTTGGTGCT AGCATTGAAG GGGTGGTGGT CTTGGCCGTT GATCCTGACA AAGAGAGTTT TCTCAAAGCT GGGGATATAA TTATTGGGGT TGGTACCAAC AGGCAGATTT CTACTGTCCA GGAGTTTAAA CAGCACATAG ACGAAGCAAA GAAAAAGGGG CAAAGGTCGC TCCTAATGCT GATAAATCGT GGTAAGCAGA CGATCTTTGC GGCTGTAGGC TTGGATAACT AG
|
Protein sequence | MRRNRFYEIF FLAVLLNCSF AAGAVPSEGF SDVVSRLAPA VVNISSEYRL RMDNQGLCGN PAILEEFSEF CERLKPFFRN KNPGKKYGTS LGSGFLISDD GLIVTNYHVI ANADKIRVVL SQCSEACQQY EATVIGYDKK TDLAALKISG VSGLPYLRFG DSSKMRPGDW VIAVGNPFGL GGSVSAGIVS AISREIGLSQ NSDFIQTDVV LNSGNSGGPL CNAKGEVIGV NTAAVYSNGG SAGIGFAVPS NVAKPVIEAL AKGKQIQRGW IGIVIQEITN ETKDSLGGDL SGVLVASVEK DGPAYKAGMR VGDVITAVNG EKISGSRRLV REVSGRRIGD TIELSVVRDA LKNKETVSLK VKIEKTPQRY ADDGASQLEV IGLVVSNLTD TIRNSFGLGA SIEGVVVLAV DPDKESFLKA GDIIIGVGTN RQISTVQEFK QHIDEAKKKG QRSLLMLINR GKQTIFAAVG LDN
|
| |