Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_1441 |
Symbol | |
ID | 4032842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | + |
Start bp | 1624754 |
End bp | 1626325 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637969915 |
Product | peptidase S1C, Do |
Protein accession | YP_576723 |
Protein GI | 92116994 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACA GGCCCACTGA TCTCTCGCCG CCTTCTCCGC CCAAGCCCGC CCGCCGTTCC GGGCTCACCG CGCGCAAGTT CGCGCTGATG GCGTCGGTCG CCGCCGGCCT CGGCCTGGCC GCATACGGGT TCAGTCCCCC GACGGACAAA GCCGATCTCT TCACCACCCC GGCCCATGCG CAGGTTAATA ACGACGTCAG CAAGGTCGCG CAGCCGATCG GCTTTGCCGA TATCGTCGAG CGTGTGAAGC CGTCGGTGAT TTCGGTCAAG GTCCGTATGA AGGACAAGGT GGTCGCCGGC GACGACGATT CGCCGTTCCA GCCGGGTTCG CCGATGGAAC GCTTCTTCCG TCGCTTTGGC GGACCGGATG GTCTGCCGCC GGGAATGCAT GGCCGGCGTG GCCATGGCAT CGTGACGGGA CAGGGCTCCG GCTTCTTCAT TTCGGCCGAT GGCTACGCCG TAACCAACAA CCACGTCGTC GACGACGCCG ACAAGGTCGA AGTCACGACC GATGGCGGCA AGACCTACAC CGCCAAGGTG ATCGGAACCG ATCCGCGCAC CGACATTGCC CTGATCAAGG TCGAGGGCGG CTCGAATTTC CCGTTTGCGA AGCTATCGGA CGACAAGGCG CGGATCGGCG ACTGGGTGCT CGCGGTCGGC AACCCGTTCG GCCTCGGCGG CACCGTGACC TCCGGCATCG TGTCGGCCAT CGGCCGCGAC ATCGGCAACG GTCCCTATGA CGATTTCATC CAGATCGACG CGCCCGTCAA CAAGGGTAAC TCCGGCGGTC CGGCCTTCAA CATGAAGGGC GACGTGATCG GCGTGAACAC CGCGATCTAT TCGCCGTCCG GCGGCAGCGT CGGCATCGCA TTTTCGATTC CCGCCAACAC CGTGAAGAAC GTCGTCGCGC AACTCAAGGA CAAGGGGTCG GTGAGCCGCG GATGGATCGG CGTCCAGATC CAGCCGGTCA CCGCTGACAT CGCCGATAGC CTCGGCCTCA AGAAAACCGA AGGCGCGCTG GTGGCCGAGC CGCAGGCCGA CAGCCCGGCG GCCAAGGCCG GCATCCAGTC CGGCGACGTG ATCACGCAAG TCAACGGTGC TACCGTCAAG GATGCGCGCG AACTGGCGCG GACCATCGGT GGCCTTGCGC CGGGGACCTC GATCAAGCTG AACGTCCTGC ACAAGGGGAC GGACAAGGTC ATCAACCTGA CGCTCGGCAT CCTGCCGAAC ACCGCGGAAG CAAAGGCCGA TCTATACGGC GATGATCACG GGCCGTCGCA TGGCGACGAC GTTCCCAGGC TTGGCATGAC GGTTGCTCCG GCGGGGTCGG TTGCCGGCGC CGGCACGGAG GGCGTGGTCG TCGTCGATGT CGATCCGAAG AGCGCGGCCG CGGATCGCGG CTTCAAGGAA GGCGACGTGA TCCTCGAGGT CGGAGGAAAG GACGTCTCCA ATTCGGCCGA CGTGCGCGAG GCGATCAAAG CGGCGCGCGC GGACCACAAG AACAGTATTT TGATGCGGGT TCGCACCGGC GGCACGGCGC GTTTCGTCGC GGTTCCGATC GCGAAGGGCT AG
|
Protein sequence | MTDRPTDLSP PSPPKPARRS GLTARKFALM ASVAAGLGLA AYGFSPPTDK ADLFTTPAHA QVNNDVSKVA QPIGFADIVE RVKPSVISVK VRMKDKVVAG DDDSPFQPGS PMERFFRRFG GPDGLPPGMH GRRGHGIVTG QGSGFFISAD GYAVTNNHVV DDADKVEVTT DGGKTYTAKV IGTDPRTDIA LIKVEGGSNF PFAKLSDDKA RIGDWVLAVG NPFGLGGTVT SGIVSAIGRD IGNGPYDDFI QIDAPVNKGN SGGPAFNMKG DVIGVNTAIY SPSGGSVGIA FSIPANTVKN VVAQLKDKGS VSRGWIGVQI QPVTADIADS LGLKKTEGAL VAEPQADSPA AKAGIQSGDV ITQVNGATVK DARELARTIG GLAPGTSIKL NVLHKGTDKV INLTLGILPN TAEAKADLYG DDHGPSHGDD VPRLGMTVAP AGSVAGAGTE GVVVVDVDPK SAAADRGFKE GDVILEVGGK DVSNSADVRE AIKAARADHK NSILMRVRTG GTARFVAVPI AKG
|
| |