Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnec_0410 |
Symbol | |
ID | 6183070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polynucleobacter necessarius subsp. necessarius STIR1 |
Kingdom | Bacteria |
Replicon accession | NC_010531 |
Strand | + |
Start bp | 364499 |
End bp | 365923 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641671103 |
Product | protease Do |
Protein accession | YP_001797302 |
Protein GI | 171463189 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.134213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTATTT TTAGTCTTGG GCAATTTGCA TTTATTCCAT CTGTCTCCGC GCAAAATCCT CGGGTAACGA TCCCCGATTT TGTCGATTTG GTTGAGCGCG CTAGTCCCGC TGTTGTGAAT ATTCGCACCA CCGAAAAAGT CGTAACGCAA CAGACTCAAG GCGGCATTCC TGGAATGCCT GAAGATCAAG CGGAATTTTT CCGTCGCTTC TTTGGGATGC CTATTCCTGG AATTCCAAAT GGTCCCAAAC AAACGCAACC AAATCCAGGC AAGCCACAAG AAGCAGATCG CGGTGTGGGT TCTGGTTTCA TTATCGAATC GAATGGTTTG ATTTTGACAA ACGCGCACGT GGTCGAGGGT GCTACAACTA TTTATGTGAC CTTGACTGAT AAGCGTGAGT TCAAGGCGAA GTTACTGGGT ATAGACAAGC GTACTGACGT AGCGGTTGTC AAAATTGAAG CACGTGATTT GCCGAAGCTA CCTTTGGGGG ATTCTTCTAA GGTACGTGTT GGTGAGTGGG TTCTGGCGAT TGGATCACCG TTTGGTCTTG AGAATACAGT CACCGCAGGG ATTGTGTCTG CTAAGAGTCG TGATACTGGC GACTACTTAC CATTTATCCA AACTGACGTT GCGGTTAACC CAGGAAACTC TGGTGGCCCG CTCTTAAATA CTGCCGGTCA AGTTATTGGT ATTAATTCAC AAATTTTTAG TCGCTCTGGC GGCTACATGG GAATTTCATT CGCCATTCCA ATTGACGAGG CGATGCGTGT TGCAGATCAG TTGCGTACCA ATGGCAAGAT GACGCGTGGT CGTATTGGTG TTGCTTTGGG TGAGATGATC AAGGAAGTGG CTGAGAGTTT AGGTTTGGGT AAACCTCGGG GTGCTTACGT ACGCAATGTT GAGCCAGGCG GTCCGGCGGC GGCTGGTGGA ATTGAGGCCG GCGATGTGAT TCTGAGTTTT AATGGCCGCG ATATTTCTAA GTCAGCTGAT TTACCAAGAG TGGTTGGTGA AACCAAGCCT GGAACTTCAG TGCTGGTACA GGTTTGGCGT AAGGGTGGTA CACGTGATTT GACTGTGACC GTAAGTGATA CAGAATCGAC TCAGGCTGCA AATAAGAAGC CAGATGCCCC AGCAGCAAAT GGCAATAGTG CAAATGCCCT TGGGGTTGCT GTAGGTGAGC TATCAGATGC CAAAAAGAAG GATTTGAATA TCAAAGGTGG GGTTGAGGTC ACTGGTTTAG GGGATGGTCC CCTGGCTAAG GCTGGAATTC GGCCTGGTGA TGTCATTATT CGGGTTGCAG ATGCCGATAT TACAGGTGTT AAGCAGTTTA AATCTTTGGT AAAGGGCTTA GATGCCAACA AGGCCGTTCC GGTCTTTATA TGCCGCGCTG ACAGCACTTT GGTAGTCCCC GTAAGATCCA AATAA
|
Protein sequence | MTIFSLGQFA FIPSVSAQNP RVTIPDFVDL VERASPAVVN IRTTEKVVTQ QTQGGIPGMP EDQAEFFRRF FGMPIPGIPN GPKQTQPNPG KPQEADRGVG SGFIIESNGL ILTNAHVVEG ATTIYVTLTD KREFKAKLLG IDKRTDVAVV KIEARDLPKL PLGDSSKVRV GEWVLAIGSP FGLENTVTAG IVSAKSRDTG DYLPFIQTDV AVNPGNSGGP LLNTAGQVIG INSQIFSRSG GYMGISFAIP IDEAMRVADQ LRTNGKMTRG RIGVALGEMI KEVAESLGLG KPRGAYVRNV EPGGPAAAGG IEAGDVILSF NGRDISKSAD LPRVVGETKP GTSVLVQVWR KGGTRDLTVT VSDTESTQAA NKKPDAPAAN GNSANALGVA VGELSDAKKK DLNIKGGVEV TGLGDGPLAK AGIRPGDVII RVADADITGV KQFKSLVKGL DANKAVPVFI CRADSTLVVP VRSK
|
| |