Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nwi_2934 |
Symbol | |
ID | 3674289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter winogradskyi Nb-255 |
Kingdom | Bacteria |
Replicon accession | NC_007406 |
Strand | + |
Start bp | 3170987 |
End bp | 3172111 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637714499 |
Product | trypsin-like serine protease |
Protein accession | YP_319536 |
Protein GI | 75677115 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTACTA TACGTCCATT CGCAGCGTTG CGGCCCATTC TTCTCATCTT GATCATTGTC GGGACGTTGC TGCCCTGGTG TCCAGCCGCG GCTCAAATCC CTGATCTGGG TAGTGGGCGG GTCCCGACAC TTGCTCCGCT GGTACGCGAA GTCACGCCGG CGGTTGTGAA TATCTCCGTT CGTGGCCGCG TGAAGGAGGA CAATCCTCTT TATCGCGATC CTTTCTTTCG CGATTTTTTT GACTTGCCGC GACAACTGGA GAGAGAGGTC CAGGCCACGG GATCGGGCGT CATCGTCGAT GCGCAACGCG GATATATCCT CACCGCCAAC CATGTCGTGG CGCAAATATC GACGGCTCAA ATCACGACAA AAGACGGCAG AAGGTTCTCG GCCGGGCTGA TCGGGCGCGA TCCAGGAACT GATATCGCGG TGCTCCAGAT CAAGCGCGGA AACAATCTCA AGGCTATTCG GTTGGGCGAC AGCGACAAAC TGGAGGTCGG CGACTTCGTG ATAGCTGTCG GCAACCCCTT CGGCCTGGGT CAGACGGTGA CCTCCGGCCT TGTGAGCGCG CTGGGTCGCA CCGGCCTTGG CAAGCATGGC TACGAGGACT TCATTCAAAC CGACGCGCCA ATCAATCCCG GCAACTCTGG CGGAGCATTG ATCAATTTGA AGGGCGAACT CGTCGGAATC AACACCGCGA TCATTTCACC GGGCGGCGGC AACATCGGAA TCGGCTTTGC CGTACCGATC AATATGGCAC GGCAGGTCAT GGAGCAGATC GTCGAATACG GCGTCGTGCG GCGCGGCCGT ATCGGCGTAT CGGTACAGGA TCTGAGCACG GTCAGTTCGG AACCACAAGC TACCGGAAGG AGCGAGGGCG CTCTGATCGC CAGTGTCCTC CGAGGATCTT CGGCCGAGGC GGCTGGCATT CGCAGAGGCG ATATCATCGT CGCTGCGGAC GGCGTTCCCA TTCGTAGTGC GGCGCAACTT CGCAATAAGA TCGGACTGGC CCGTATTGGC GAGCGCGTGC AGCTGACATT CGACCGCAGG GGCACTTTAC ACACAGTCGT CGTCGAGATC TCCGCTACAA GGAGTGCTCC GGCGGCGTCT TCACTGAGAA GATGA
|
Protein sequence | MITIRPFAAL RPILLILIIV GTLLPWCPAA AQIPDLGSGR VPTLAPLVRE VTPAVVNISV RGRVKEDNPL YRDPFFRDFF DLPRQLEREV QATGSGVIVD AQRGYILTAN HVVAQISTAQ ITTKDGRRFS AGLIGRDPGT DIAVLQIKRG NNLKAIRLGD SDKLEVGDFV IAVGNPFGLG QTVTSGLVSA LGRTGLGKHG YEDFIQTDAP INPGNSGGAL INLKGELVGI NTAIISPGGG NIGIGFAVPI NMARQVMEQI VEYGVVRRGR IGVSVQDLST VSSEPQATGR SEGALIASVL RGSSAEAAGI RRGDIIVAAD GVPIRSAAQL RNKIGLARIG ERVQLTFDRR GTLHTVVVEI SATRSAPAAS SLRR
|
| |