Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nwi_2346 |
Symbol | |
ID | 3674492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter winogradskyi Nb-255 |
Kingdom | Bacteria |
Replicon accession | NC_007406 |
Strand | - |
Start bp | 2566954 |
End bp | 2568444 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637713910 |
Product | peptidase S1C, Do |
Protein accession | YP_318952 |
Protein GI | 75676531 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.466246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGAG CGATGCAAGC ATTGAGTCTT CGCCGGCGTT TCCTTCTGAG CGCGTTTTGC CTCGGCGTCG CGAGCGCCGT GGTATCCGCG CCGGCGCTGG CCCGCGGTCC CGATGGCATC GCCGACGTGG CGGAAAAGGT GATCGACGCG GTGGTCAATA TCTCGACGAC CCAAAAGGTG GAAGCCAAGA GCGGCGAGGG CCGCCGCATG ATTCCGCAGC TTCCGCCGGG ATCACCGTTC GAGGAGTTCT TCGACGACTT CTTCAAGAAT CGCCGCGGCG GCAAGAGTGA CGGCCTGCAG ACGCACAAGA CGAACTCGCT TGGCTCCGGA TTCATCGTCG ATGCCGCCGG AATTGTGGTC ACCAATAATC ACGTCATCGC CGATTCCGAT GAGATCAACG TCATTTTGAA TGACGGCACC AAGGTCAAGG CCGAGACCGT CGGGGTCGAC AAGAAAACTG ACCTTGCGGT GCTGAAGTTC AAGCCGCCGC ATCCGCTGAT GGCCGTTAAA TTCGGCGATT CCGACAAGTT GCGGCTCGGC GAATGGGTGA TCGCGATCGG CAACCCTTTC AGTCTGGGCG GCTCCGTTTC CGCCGGAATC GTCTCGGCGC GCAACCGCGA CATCAATAAC GGCCCTTACG ACAGCTACAT CCAGACCGAT GCCGCCATCA ATCGCGGCAA CTCCGGCGGC CCGTTGTTCA ACCTTGAAGG CGAGGTCGTG GGAGTGAATA CGCTGATCAT CTCGCCATCC GGCGGCTCGA TCGGTATCGG TTTCGCGGTG CCGTCGAAGA CGGTAGCGGG CGTCGTCGAT CAGTTGCGGA AATTCGGCGA GCTGCGTCGC GGCTGGCTCG GCGTTCGCAT TCAGCAGGTG ACCGACGAAA TCGCCGACAG CCTCAATATC AAGCCGGCTC GCGGAGCGCT GGTTGCCGGT GTGGAGGATA AGGGGCCGGC CAAGCCTGCC GGCATCGAAC CCGGCGATGT CGTCATCGCT TTCGACGGCA AGGACATCAA GGAGCCTAAG GATCTGTCGC GCATGGTCGC GGATACGGCG GTGGGCAAGA CCGTCAGCGT CGTGGTCATC CGCAAGGGCA AGGAAGAGAC CAAGAAGGTG ACGCTGGGAC GTCTCGACGA CAGCGACAAG GCCATCCCCG TTTCGATCAA GACCAAGCCG GAAGATGACG ACAAGCCCGT CACGCAAAAG GCGCTCGGCC TCGATCTCGC CACGCTAAGC AAGGATCTGC GCGCGCGCTA CAAGATCAAG GACAGCGTCA AGGGCGTCAT CATCACCGAG GTGGAAGCAG ATTCCGACGC CGCCGAGAAG CGGCTCAGCG CCGGCGACGT GATCGTCGAG GTGGCGCAGG AAGCTGTGAA CAGTGCGTCC GATATCCGCA AGCGCATCGA TCAGCTGAAA AAGGACGGCA AGAAAGCTGT GCTGCTGATG GTGTCGAACG GCGCCGGCGA ACTGCGTTTC GTGGCGCTGA GCCTGAAGTA G
|
Protein sequence | MTRAMQALSL RRRFLLSAFC LGVASAVVSA PALARGPDGI ADVAEKVIDA VVNISTTQKV EAKSGEGRRM IPQLPPGSPF EEFFDDFFKN RRGGKSDGLQ THKTNSLGSG FIVDAAGIVV TNNHVIADSD EINVILNDGT KVKAETVGVD KKTDLAVLKF KPPHPLMAVK FGDSDKLRLG EWVIAIGNPF SLGGSVSAGI VSARNRDINN GPYDSYIQTD AAINRGNSGG PLFNLEGEVV GVNTLIISPS GGSIGIGFAV PSKTVAGVVD QLRKFGELRR GWLGVRIQQV TDEIADSLNI KPARGALVAG VEDKGPAKPA GIEPGDVVIA FDGKDIKEPK DLSRMVADTA VGKTVSVVVI RKGKEETKKV TLGRLDDSDK AIPVSIKTKP EDDDKPVTQK ALGLDLATLS KDLRARYKIK DSVKGVIITE VEADSDAAEK RLSAGDVIVE VAQEAVNSAS DIRKRIDQLK KDGKKAVLLM VSNGAGELRF VALSLK
|
| |