Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_1572 |
Symbol | |
ID | 4033146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | + |
Start bp | 1775221 |
End bp | 1776627 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637970047 |
Product | peptidase S1C, Do |
Protein accession | YP_576853 |
Protein GI | 92117124 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0853453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCGAA TCAGTGCTTT GCTTGCGGCG GCTCTGTGTG TCGTCGTCGT ATCCGGCAGC GTCAACTCTG CAACGGCGCA GGATCGCCGG GTGCCGGCCT CCCAGGCCGA GCTTCGGTTG TCCTATGCGC CGATCGTGCA GCGGGTGGCG CCGGCCGTGG TCAACGTCTA TGCCGCCAAG GTGATCCAGC ACCGCAACCC GCTTCTCGAC GACCCGATCT TTCGCCGCTT CTTCGGCGTT CCCGGCCAGC AGCCGGAACA GATGCAGCGT TCGCTCGGCT CGGGGGTGAT GGTCGATCCG TCCGGGCTTG TCGTGACTAA CAATCACGTG ATCGAGGGCG CCGATCAGGT CAAGGTCGCC TTGTCGGACA AGCGCGAGTT CGAGGCCGAG ATCGTGCTCA AGGACACGCG GACCGATCTC GCGGTCCTGC GGCTGAAGGG TGTGCATGAG AAGTTTCCGA CGCTCGATTT TGCCAATTCG GATGAGTTGC AGGTCGGAGA TGTCGTCATC GCGATCGGAA ACCCGTTCGG CGTCGGGCAG ACGGTGACGC ACGGCATCAT TTCGGCGCTG GCGCGCACAC AGGTCGGTAT CACCGACTAT CAGTTCTTCA TCCAGACCGA TGCGGCGATC AATCCGGGCA ACTCCGGCGG CGCGCTGGTC GATATGACGG GCCATCTTGC AGGCATCAAC ACGGCGATCT TCTCGCGCTC GGGAGGATCG CAGGGCATCG GCTTCGCGAT TCCGGCCAAC ATGGTGCGGG TCGTCGTGGC GTCCGCCAAG AGCGGCGGCA AGGCCGTGAA GCGGCCCTGG CTCGGCGCGC GACTGCAGGC GGTGACGCCG GAAATTGCCG AAACCATCGG CCTCAAGGCG CCCGCCGGCG CGCTGGTCGC CAGCGTTGCG CCGGGAAGTC CGGCGGAGCG GGCCGGCCTC AAGCTCTCCG ATTTGATCCT GTCGATCGAC GGACAGCCTG TCGAGGACCC CAATGCGTTC GACTATCGCT TCGCGACGCG CCCGCTCGGT GGCACGTCGC AAGTGGAAGT TCAACGCAAT GGCAGGCTGA TGAAACTCGC TGTTCCGCTG GAGACGGCGC CGGATACAGG TCGAGACGAG ATCGTGATTA CCGCGCAATC GCCGTTCCAG GGCGCAAAGG TTGCGAACAT TTCGCCGGCG CTTGCCGACG AGCTGCATCT CGATTCCAGC GCCGAAGGCG TCGTGGTGAC TGCGCTGACC GATGATGGAA TGGCCGCGAA TGTCGGCTTC AGGAAAGGCG ACATCATCCT GTCGGTCAAC AACAAGAAGA TCGTCAAGAC GGCCGATCTC GAACGGGCCG TTAGCGAAGC CAGGCGGCTC TGGCGCATCA CCCTGATGCG CGGCGGCCAG CAGATTCAAG TCATGCTCGG CGGATGA
|
Protein sequence | MSRISALLAA ALCVVVVSGS VNSATAQDRR VPASQAELRL SYAPIVQRVA PAVVNVYAAK VIQHRNPLLD DPIFRRFFGV PGQQPEQMQR SLGSGVMVDP SGLVVTNNHV IEGADQVKVA LSDKREFEAE IVLKDTRTDL AVLRLKGVHE KFPTLDFANS DELQVGDVVI AIGNPFGVGQ TVTHGIISAL ARTQVGITDY QFFIQTDAAI NPGNSGGALV DMTGHLAGIN TAIFSRSGGS QGIGFAIPAN MVRVVVASAK SGGKAVKRPW LGARLQAVTP EIAETIGLKA PAGALVASVA PGSPAERAGL KLSDLILSID GQPVEDPNAF DYRFATRPLG GTSQVEVQRN GRLMKLAVPL ETAPDTGRDE IVITAQSPFQ GAKVANISPA LADELHLDSS AEGVVVTALT DDGMAANVGF RKGDIILSVN NKKIVKTADL ERAVSEARRL WRITLMRGGQ QIQVMLGG
|
| |