Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nwi_0744 |
Symbol | |
ID | 3674359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter winogradskyi Nb-255 |
Kingdom | Bacteria |
Replicon accession | NC_007406 |
Strand | - |
Start bp | 829993 |
End bp | 830964 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637712296 |
Product | trypsin-like serine protease |
Protein accession | YP_317362 |
Protein GI | 75674941 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTCAT TGACGCAATG GAAGGTGCCA AGCTCCGCTC AGCCGCGTCC GGAGGACTAT CGTTTCGATC TGGACCGCAC GCTGTCCGCC GTCGTCGGGC TGCATTCCTT CGTTCCGCCC GATGCCCTCA CGGCGGACAC CCTCGGCACC GAGCGGGCCG GGAACGGCGT CGTGATCGAC GACGGCCTCG TTCTGACGAT TGGCTACGTC ATCACCGAGG CGGAAACCGT ATGGCTGCAT CTCGCGGACG GCCGCGTGGT TCAGGCGGAC ACGGTCGGCG TCGATCGGGA AAGCGGCTTC GGCCTGGTGC AGGCACTCGG TGATCTCGAT CTCGAACCGC TGCCGCTGGG TTCGTCGGCC GCCGCCAGCG TCAATGACCC GGTCGTGGTC GCCGGAGTCG GCGGACGCTC CCGTTCGCTC GCCGCCCGGA TCATCTCAAG GGAGGAGTTC GCCGGCTACT GGGAATACCT CATCGATCAA GCCATCTTCA CCCGCCCCGC GCACCCCAAC TGGGGCGGCA CCGGCTTGAT CTCGGCCTCG GGCGATCTGA TCGGAATCGG CTCGCTGCAA CTCGAGCGGG AGGAAGGCGG CCGCAGCGAA CATCTCAACA TGAACGTGCC GATCGATCTG CTGAAGCCCG TCCTCGACGA CCTGCGGAAG TTTGGAAAGG TCAACAGGCC CGCCCGCCCC TGGCTCGGGA TCTACGTCGC CGAGGTCGAA AACAGGGTGA CCGCCGTCGG CATCGTACCG AAAGGACCGG CAGATCGCGC CGAACTGCGC GCCGGCGACG CGATCCTCGC GCTGAAGGGA GAAAAAGTAA CGGACGCGGC CCAGCTCTAT CGCAAATTGT GGGCGCTGGG CCCCGCGGGC GTCGACGTGC CGTTGACGCT TCATCGCGAA GGAGACACAT TCGACGTGGT GGTGACATCC ACCGACCGCG CGCGGATGCT GAAGAAGCCG AGACTCCACT GA
|
Protein sequence | MASLTQWKVP SSAQPRPEDY RFDLDRTLSA VVGLHSFVPP DALTADTLGT ERAGNGVVID DGLVLTIGYV ITEAETVWLH LADGRVVQAD TVGVDRESGF GLVQALGDLD LEPLPLGSSA AASVNDPVVV AGVGGRSRSL AARIISREEF AGYWEYLIDQ AIFTRPAHPN WGGTGLISAS GDLIGIGSLQ LEREEGGRSE HLNMNVPIDL LKPVLDDLRK FGKVNRPARP WLGIYVAEVE NRVTAVGIVP KGPADRAELR AGDAILALKG EKVTDAAQLY RKLWALGPAG VDVPLTLHRE GDTFDVVVTS TDRARMLKKP RLH
|
| |