Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0807 |
Symbol | |
ID | 3785851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 924964 |
End bp | 926160 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637810893 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_411506 |
Protein GI | 82701940 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACAAGC TCTGGCTAGT TTTCGCGCAA GCTACCACGA TAGTTCTGGC AGCCCTGTTT GTCGTTTCGA CTTTACGCCC GGGTCTATTA CCCTGGCAAT CCGGAGATGG CGGTGTCGTA ACCATAAAGG AAGCTCCGGT TGACAAGTCA CACAAGGCAG AGGTCGAGCC TGCCCCTGGG AGCTTCAGTA GCGCGGCGAA AAAGGCCATG CCCTCCGTGG TGAATGTATT CACCACCAAG GAAATCAAGG CTGCCCCCCA TCCCTTCATG GAGGATCCTT TCTTCCGGCG TTTCTTCGGA GATCGTTTCG AATCCCCTCA GAGCCGCCGC GCCGCCAGCC TGGGATCCGG GGTGATCGTG AGTCCGCAGG GATATATCCT TACGAACCAT CATGTCATTG AAGCGGCGGA TGAAATCGAG ATTGCGCTGG CGGACGGGAG AAAAACGAAA GCGCGGGTCA TCGGCTCCGA TCCCGAAACC GATCTTGCAG TGGTAAGAGT GGATATGGAA GGACTTCCGG CCATCACTTT CGGATACTCC GACAATGCCC TGGTCGGCGA TATTGTCCTT GCGATCGGTA ATCCTTTCGG TGTGGGCCAG ACGGTAACGA TGGGGATTAT CAGCGCACTC GGACGAACCC ATCTGGGTAT CAACACCTTC GAAAATTTCA TTCAGACCGA TGCTGCCATC AATCCGGGAA ATTCTGGCGG TGCACTGGTG GATGCGTCGG GTAACCTCAT CGGCATCAAT ACCGCCATAG TCTCCAGAAC GGGAGGATCA CTTGGGATAG GCTTTGCCAT TACAGCAGGG GTAGCCAAGC AGATCATGGA GCAGATTATC CGGACAGGGG GCGTGACCCG TGGCTGGATC GGCGTGGAAG TACAGGATAT GACGCCGGAA CTTGCGGAGT CGTTCAAGCG CTCGACTACC AGCGGGGCGT TGATTGCAGG TGTGCTCAAG GGAGGACCTG CGGATCGTGC CGGAGTGAAG CCGGGTGACA TTATTGTAGG AGTGGGAGGA AAAGAGGTAA CGGATTCATC CGGCATGCTC AATCTGGTAG CGGCATTACC TCCCGGAAAC ATGGCGACCA TTACAGTCAT GCGTAACCAG AACAAGAAGG CAATCGAGAT CAATGTTGGA AAACGTCCCA AGCCTCAGCC TCAGGAGCAG TTTCAGGAGC CCGAGGAGCT GGAATAA
|
Protein sequence | MHKLWLVFAQ ATTIVLAALF VVSTLRPGLL PWQSGDGGVV TIKEAPVDKS HKAEVEPAPG SFSSAAKKAM PSVVNVFTTK EIKAAPHPFM EDPFFRRFFG DRFESPQSRR AASLGSGVIV SPQGYILTNH HVIEAADEIE IALADGRKTK ARVIGSDPET DLAVVRVDME GLPAITFGYS DNALVGDIVL AIGNPFGVGQ TVTMGIISAL GRTHLGINTF ENFIQTDAAI NPGNSGGALV DASGNLIGIN TAIVSRTGGS LGIGFAITAG VAKQIMEQII RTGGVTRGWI GVEVQDMTPE LAESFKRSTT SGALIAGVLK GGPADRAGVK PGDIIVGVGG KEVTDSSGML NLVAALPPGN MATITVMRNQ NKKAIEINVG KRPKPQPQEQ FQEPEELE
|
| |