Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0402 |
Symbol | |
ID | 3706573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 445303 |
End bp | 446892 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637736914 |
Product | peptidase M48, Ste24p |
Protein accession | YP_342458 |
Protein GI | 77163933 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAAGC CTAGGTTTTC AGAGATCATC CTAGGCTTTT ATGAGCGGCG AGTAAGCTTC GCTTTATGGC TACTAGCTGC CGCACTGCTT AGCGCCTGTG CGATTAATCC GGTAACAGGC GAGCGGGAGC TGTCGCTGGT CTCCGAAACC CAGGAAATTC AGATGGGGGA GGAGAATTAT TTGACGATGC GGCAGATGCA GGGAGGGGAT TATACGGCTG ACCCCGCACT CACAGCCTAT GTGAGCCAGG TGGGTCAGCG TCTGGCGGCC GTGAGTGATC GTTCGCTGCC CTATGAGTTC TCGGTTATTA ACGATTCCAC CCCTAATGCG TGGGCCCTGC CGGGTGGTAA AATTGCTCTC AACCGGGGTT TATTGACGGA GCTAAACAAT GAAGCTGAGC TGGCGGCGGT GCTAGGCCAT GAGATCGTCC ATGCCGCAGC CGGTCATAGC GCCCAAGGCA TGGAGCGTGA TTTATTATTG AAGGGGGCGG TGTTAGGTTC AGTGCTGGCA ACGGGAGTGA GTGAATACAC GCCCCTGGTG TTGGGCGGGG CGCAAGCGGC GGCGCAACTG GTCAATCGGA AATATAGCCG CGATGCGGAG CGGGAAGCGG ATCTTTATGG CATGCGCTAT ATGTCCCGCG CCGGCTATGA TCCTTGGGCA GCGGTGAGCT TGCAGGAAAC TTTCGTTCGT CTCTCCGAGG GACAGCAGGA AAATTGGCTT TCGGGGTTAT TGGCAAGCCA TCCTCCTTCC CTGGAACGGG TGGAGGCCAA TGAGATGACT GCCCGTACTT TGCCTGCCGG AGGTGAGTTA GGGGCGGAAC GCTACCAAGC CAAGCTTGCT CCCCTGAGGC AGGTAGAAAC GGCCTATGCT GCTTATGATC AAGGCCGTAA AGCACTTCAA GAGGGCAACC TGGAGCAGGC GCTGAGTTTA GCGGAGCGGG CCATTACTGA AGAGCCCCGG GAAGCCCTGT TCTATGGCTT GCGCGGTGAT GTCTATCTGG CAAGGAAACG CTATCAGGAA GCTCTGGCCG ATTATAATCG GGCTATTAAG CGCAATGATC ATTTTTTTTA TTTTTATAAT CAGCGGGGTT TGGTAAACAA AGCATTAGGC CATTCAGAAA AGGCGCGTCA AGATCTGCAG CAGAGTATAG CCCTTTTACC CACGGAAAGC GCTAATAAAG CCTTAGGTGA TTTAGCCTTG ACGCAAGGAG ATCGGCAAGG CGCCATGACC TACTACCAAA AGGCAGCCGC CTCCCAAACC CCTTTGGGCC TTGAAGCAAG ACGCGCCTTG GTTTACCTGG ATTTGCCTAA TAATCCCCAA AAATATCTAG CGGCTCAGGT CAAGTTAAAT CGGCGTGGCT ATCTCGTTGT CCGGGTGACT AATCAGGCAC CGCTCCCCGT CCGCGATATT GGCATTGAAG TGCGTTATCT GGATTCCCAG GGGCACGTCC AGTCCCATAA GCAAGCGTTT CAAGGAATTC TTGCCGCAGG TCAGACCGCG CGCCTCAAGA CGAATCTGGG GCCGCTTTCA GATCCACGTG CGCTCGAACG GATCGAGGCT AAGGTGATCC AGGCCCGAAT TGCCAATTAA
|
Protein sequence | MNKPRFSEII LGFYERRVSF ALWLLAAALL SACAINPVTG ERELSLVSET QEIQMGEENY LTMRQMQGGD YTADPALTAY VSQVGQRLAA VSDRSLPYEF SVINDSTPNA WALPGGKIAL NRGLLTELNN EAELAAVLGH EIVHAAAGHS AQGMERDLLL KGAVLGSVLA TGVSEYTPLV LGGAQAAAQL VNRKYSRDAE READLYGMRY MSRAGYDPWA AVSLQETFVR LSEGQQENWL SGLLASHPPS LERVEANEMT ARTLPAGGEL GAERYQAKLA PLRQVETAYA AYDQGRKALQ EGNLEQALSL AERAITEEPR EALFYGLRGD VYLARKRYQE ALADYNRAIK RNDHFFYFYN QRGLVNKALG HSEKARQDLQ QSIALLPTES ANKALGDLAL TQGDRQGAMT YYQKAAASQT PLGLEARRAL VYLDLPNNPQ KYLAAQVKLN RRGYLVVRVT NQAPLPVRDI GIEVRYLDSQ GHVQSHKQAF QGILAAGQTA RLKTNLGPLS DPRALERIEA KVIQARIAN
|
| |