Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nwi_1195 |
Symbol | |
ID | 3674423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter winogradskyi Nb-255 |
Kingdom | Bacteria |
Replicon accession | NC_007406 |
Strand | + |
Start bp | 1301419 |
End bp | 1302969 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637712745 |
Product | peptidase S1C, Do |
Protein accession | YP_317809 |
Protein GI | 75675388 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC TGCCGTCTTC CCAGTTAAAG CCCATTCGTT CTGGGGTCAC CGCGCGCAGG TTCGCGTTGA TGGCGTCTGT CGTCGCGGGT GTTGGCGTAG CCGCTTACGG GCTAAGCCCT TCCGCAGACA AGGCCGAGTT CTTCATAAGT CCCGCTTATG CGCAGGTCAA CACCGACGTG AGCAAGGTCG CGCAGCCGAT CGGCTTTGCC GACATCGTCG AGCGGGTGAA GCCGTCGGTA ATTTCGGTCA AGATCCGTAT GAAGGACAGG GCGGTGGCCA GCGGGGACGA TTCGCCGTTC CAGCCGGGCT CGCCCATGGA GCGGTTTTTC CGCCGCTTTG GCGGACCGGA TGGCCTACCG CCAGGAATGC GCGGCCGGCG CGGTCCTGGC CTCGTGACAG GGCAGGGCTC CGGCTTCTTC ATCTCGGCTG ATGGGTATGC TGTGACCAAC AACCATGTCG TCGACGGCGC GGACAAGGTC GAAGTCACGA CCGACGACGG CAAGACCTAC ACCGCCAAGG TGATCGGGGC TGATCCGCGC ACGGACATTG CCCTGATCAA GGTCGAGGGC GGCTCCAGTT TCCCGTTTGC GAAACTGTCG GATGACAAGG TGCGGATCGG CGATTGGGTT CTCGCGGTCG GCAATCCGTT CGGTCTTGGC GGTTCGGTGA CCGCCGGCAT CGTGTCGGCC AGCGGCCGTG ACATTGGCAC CGGCCCCTAT GACGACTTCA TCCAGATTGA CGCCCCGGTC AACAAAGGTA ATTCCGGCGG CCCGGCTTTC AACATGAAGG GCGACGTCAT CGGCGTGAAT ACCGCGATCT ATTCGCCGTC CGGCGGCAGC GTCGGCATTG CGTTCTCGAT TCCCGCGAAT ACGGTGAAGA GTGTCGTTGC TCAGCTCAAG GACAAAGGGT CGGTGAGCCG CGGATGGATC GGCGTCCAGA TCCAGCCGGT GACTTCCGAC ATCGCCGACA GCCTCGGTCT CAAGAAAGCC GAAGGCGCGC TGGTGGCTGA GCCGCAGGCC AACAGTCCGG CGGCCAAGGC GGGTATAAAG TCCGGCGACG TGATCACGGC CGTCAATGGT GATCCGGTCA AGGACGCGCG CGAACTGGCG CGGACCATCG GTAGCTTGGC GCCGGGAACG TCGATCAAGC TCAACGTCCT GCGCAAGGGC ACGGACAAGG TCGTCAACCT GACGCTTGGC ACTCTGCCCG ACACCGTCGA AGCCCGAGCT GACCACTATG GCGATCGTGG CTCATCGCGC GACAACGATA TTCCCAGGCT TGGTTTGACC GTTGCGCCGG CAGGATCGGT TGCGGGTGCC GGCGGGGAAG GGGTCGTCGT GGTTGACGTG GATCCGAAAA GCCCCGCGGC CGAGCGCGGG TTCAGGGAAG GCGACGTGAT TTTGGAGGTC GCGGGAAAGA GCGTGTCCAG CTCCGCCGAC ATACGCGAGG CAATCAAGGC GGCGCGCGCG GAAGACAAGA ACAGTGTTTT GATGCGGGTT CGCACCGGCA GCACGGCGCG TTTCGTCGCA ATGCCAATTG CGAAGGGTTA A
|
Protein sequence | MTDLPSSQLK PIRSGVTARR FALMASVVAG VGVAAYGLSP SADKAEFFIS PAYAQVNTDV SKVAQPIGFA DIVERVKPSV ISVKIRMKDR AVASGDDSPF QPGSPMERFF RRFGGPDGLP PGMRGRRGPG LVTGQGSGFF ISADGYAVTN NHVVDGADKV EVTTDDGKTY TAKVIGADPR TDIALIKVEG GSSFPFAKLS DDKVRIGDWV LAVGNPFGLG GSVTAGIVSA SGRDIGTGPY DDFIQIDAPV NKGNSGGPAF NMKGDVIGVN TAIYSPSGGS VGIAFSIPAN TVKSVVAQLK DKGSVSRGWI GVQIQPVTSD IADSLGLKKA EGALVAEPQA NSPAAKAGIK SGDVITAVNG DPVKDARELA RTIGSLAPGT SIKLNVLRKG TDKVVNLTLG TLPDTVEARA DHYGDRGSSR DNDIPRLGLT VAPAGSVAGA GGEGVVVVDV DPKSPAAERG FREGDVILEV AGKSVSSSAD IREAIKAARA EDKNSVLMRV RTGSTARFVA MPIAKG
|
| |