Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0041 |
Symbol | |
ID | 3705974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 39123 |
End bp | 40130 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637736565 |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | YP_342113 |
Protein GI | 77163588 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.676343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCGTAC TTGGAATAGA AACTTCCTGC GATGAAACAG GGGTAGCCGT TTTTGACAGT AAACAGGGAC TATTGGCCGA TATCCTCCAT AGTCAGGCAA AACTACATGC AGGCTACGGT GGGGTCGTTC CAGAATTAGC CTCTCGAGAT CACATTCGCA AATTACTGCC TCTGCTTCGC AAGGTGTTAA ACCAGGCTGG CCTTAACAAG GAGGACATTG ATGGCGTTGC CTATACGGGA GGTCCAGGGT TAATCGGCGC TTTATTAGTC GGTGCTGCCG TGGGTCGTAG CTTGGCCTGG GCTTTGGGCA AACCGGCAAT CGCTGTCCAT CACATGGAAG GACATTTACT CTCCCCTCTA TTAGAACCCA ATCCCCCCGG TTTCCCCTTT TGCGCCCTCC TTATTTCCGG TGGCCATACC ATGCTGGTTA CGGTCAACCG TATCGGCGCT TATCGGATAC TCGGAGAGTC CCTGGATGAC GCGGTTGGGG AAGCTTTTGA TAAGACGGCG AAGCTGCTTC AACTCGGGTA TCCAGGGGGA CCCGCCCTGG CTAAACTTGC TGAACAGGGA AATCCTGACC GTTTTTATTT CCCGCGCCCT ATGCTCGACC GCCCAGGGTT AAACTTTAGC TTTAGTGGGT TAAAAACCTA TGCTCTCAAT ACCCTGCATA AAAATGGCTC TGAAGCTGCC GCCGATATTG CCCGGGCCTT TCAAGATGCC GTCGTAGACA CCCTTACTGT CAAATGTCGG CGGGCCCTGC AACAAACAGG ACTCAACCGC CTGGTCGTTG CCGGCGGGGT TAGCGCCAAC CAAGCACTTC GACAGCGACT TAAAGCCATG GGAACCCAAG AAAATGTCCA AGTTTATTAC CCGCGATTAG CTTTTTGTAC TGATAACGGG GCAATGATCG CTTTTGCAGG CTGTCAGCGC CTGCTAGCAG GAGAACAAAC CAGCTCCGGA TTTAGCGTTC GGGCAAGATG GCCCTTAGAT ACGCTGCCGG AAATTTAA
|
Protein sequence | MRVLGIETSC DETGVAVFDS KQGLLADILH SQAKLHAGYG GVVPELASRD HIRKLLPLLR KVLNQAGLNK EDIDGVAYTG GPGLIGALLV GAAVGRSLAW ALGKPAIAVH HMEGHLLSPL LEPNPPGFPF CALLISGGHT MLVTVNRIGA YRILGESLDD AVGEAFDKTA KLLQLGYPGG PALAKLAEQG NPDRFYFPRP MLDRPGLNFS FSGLKTYALN TLHKNGSEAA ADIARAFQDA VVDTLTVKCR RALQQTGLNR LVVAGGVSAN QALRQRLKAM GTQENVQVYY PRLAFCTDNG AMIAFAGCQR LLAGEQTSSG FSVRARWPLD TLPEI
|
| |