Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2139 |
Symbol | |
ID | 3705331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2466230 |
End bp | 2467669 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637738615 |
Product | peptidase S1C, Do |
Protein accession | YP_344129 |
Protein GI | 77165604 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0865536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAGAT ATCGGATGAT TTTGTTATTG CTAGCGGCGC TTTTTGCATT GCCAGCACAG GCCCTTGAGG AAGGTGGCGT TGAGAGTCTG CGCCAGACGG GTAAGGCCTT TGCCGCCGTG GCCCGTGAAG TATCACCATC AGTGGTTTTC ATCCAGGTGG AGAGTATCCG TGAGGAACGT GGTTCGCGGG CGTTTCCTCC ACCCTTTGGC GATGAGTTTT TCAGGCGCTT TTTTGGCGAG CCCTTTTCGC CCCGGGGCGA GGCGCCAAAA GGCCAGCGCC GTGCCATCGG CCAGGGATCA GGCTTCATTT TTTCATCGAA AAAGGGCTTG CTGTCGGACA AGACTTATAT TCTTACTAAC AGCCATGTAG TGGAAGACGC TGACAAGATT CGGGTCCAAT TTCAAGATGA TCGCGAGTTC GAGGGTGAAA TTGTTGGGAC CGATCCCAAA TCGGATATCG CGGTTATTGA GATTACCGTT GGCGGTCTTC CAGCCCTGGA ATGGGGAGAT TCTTCCAAGC TTCAAGTAGG GGAATGGGTA ATAGCCATGG GTAACCCCTT TGGCTTGAGC CATACCCTGA CCGTGGGCGT GGTGAGCGCC ACGGGCCGCA CCAGCTTAGG CATCAGCGAT TATGAGGATT TTATTCAAAC GGATGCCGCT ATCAATCCTG GCAATTCCGG TGGTCCCCTG GTAAATCTGA ACGGCGAAGT GGTGGGCGTC AATACCGCTA TCTTCAGCCG GAGCGGGGGC TATATGGGCA TAGGATTCGC CATTCCCAGC AAATTAGCCA AGGCTATTGC CAATCAACTC ATTGAAACGG GCGAGGTGAC CCGGGGTTAT CTGGGAATTG TCATTCAGCC CCTGACGGCG GAGCTGGCGG AATCCTTCAA TATGGAACAG TCCCAGGGTA TTCTCGTTGC CCAGGTATCG GAGGACTCGC CTGCAAAAAA GGCGGGGCTC AAGCAAGGGG ATGTGATTGT CGGTTACCAG GATAAGCCGG TTAAAGATAT TGGCGGTTTC CGTAATCGAG TTGCGCTCAC CGCGCCCGGT AGCCGTGAGA CCCTTACTAT TATCCGCGAT GGTAAGCGGC AGAAGGTAAA AATTACTATT GGTGAACTCA GCCAAGAGCA GACCCTTGCC GAAGGGCCTA CCCAGAGCGC GGAAGAATTG GGGTTGGCGG TTCAGACTTT GACTCCGGAG CTGGCAAGAC AATTTGATGC CAAGGCGGGT GAAGGTGTGG TTGTTACTGG GGTTGAGCGT GGCTCCATCG CCGCCATGGC GGGGATCAGG GTGGGTAACG TGATTCTCCA GATCAACCGC AAGCCGATCC ATAGTGCGAA GGAATTCAAT CGTGCCATGC AGGAAAGCCA GGATGATAAG CGTGCTTTGT TGCTGATTCG TTCAGGTAAT ATGCAACATT ACGTGGTACT GCGCTGGTAG
|
Protein sequence | MHRYRMILLL LAALFALPAQ ALEEGGVESL RQTGKAFAAV AREVSPSVVF IQVESIREER GSRAFPPPFG DEFFRRFFGE PFSPRGEAPK GQRRAIGQGS GFIFSSKKGL LSDKTYILTN SHVVEDADKI RVQFQDDREF EGEIVGTDPK SDIAVIEITV GGLPALEWGD SSKLQVGEWV IAMGNPFGLS HTLTVGVVSA TGRTSLGISD YEDFIQTDAA INPGNSGGPL VNLNGEVVGV NTAIFSRSGG YMGIGFAIPS KLAKAIANQL IETGEVTRGY LGIVIQPLTA ELAESFNMEQ SQGILVAQVS EDSPAKKAGL KQGDVIVGYQ DKPVKDIGGF RNRVALTAPG SRETLTIIRD GKRQKVKITI GELSQEQTLA EGPTQSAEEL GLAVQTLTPE LARQFDAKAG EGVVVTGVER GSIAAMAGIR VGNVILQINR KPIHSAKEFN RAMQESQDDK RALLLIRSGN MQHYVVLRW
|
| |