Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0939 |
Symbol | |
ID | 3707330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1038059 |
End bp | 1038988 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637737448 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_342981 |
Protein GI | 77164456 |
COG category | [S] Function unknown |
COG ID | [COG1652] Uncharacterized protein containing LysM domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.013426 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGATT TTAAACGTGG CGATTCTAAT TACGTACGGT CCGCCTATAG GCGGCCAGGT TCTGAACGGC CCGAATATGG GCGTTCTAAA AATCGTGGGC AGATTTGGTC TGTCATCATC GTGCTTTTGC TGATCGTCGT GTTAGCGGGA GGCGCGGTAT GGATGTATCT CAATGGTAAA GGCGAAGAAG AAGCCGTGGC AAGCAAAGAA TCGGAAGTTG CTAGCCCGCA AATGGCTGAG TCAAACACCA CCGAGTTGGA ACCTTCCCAG CAAGGAGGCA CTGAAGAAGA ATTTGCATCT ACAGCTCCCA GTGAAGAAGA TGAGTTTGCA TCCTCAACTC CCAGCGAGGA AGAAGATGAG TTTGCTTCGA TTTTAGAGGA ATTAGGGGTA GGGAGTGAAC CTAACGAAGA AGAAGCTTCC ACTCCCAGTA CAGTAGAGCC GCAAGCAGAG GGATCAGCCG TTGAAGGTAA TACTGTAATA GGGACTGCGC CGGAGGAGCC TCAGTTCTTC GATGAAGAAG AAGATCAATT TTCTACCGAG GAATTTACCA TACCTGAACA AGAGCAACCA GAAGAAGCGG TTGGAGAACA AGCTGAGCAA GCTACTGGGC AGTTTAGAGG AACTGAGCGG CAAGTAGAGG AGACCCAAAA GAGGGGAGAA GCTAATAATA AGATGAAGGA AGCGGAGGAG GAAGATTTTG CTGCGGTAAC TCCATCATTT GAAGAAGCTC CAACCAGATC CCAGGCTGAA GGATCATTTC CACAGACGGT AACTGTTCAG TCAGGCGATT CTCTTTCCGT TATCGCTGAC CGTGTTTACG GTGATGCGGG TAAGTGGCGT TTAATTTATG AAGCTAACCA GGATCAATTG GAAAATCCTG ACCAATTGTT GGTAGGGATG AAGTTAACCG TTCCTGATCC AAATGATTAA
|
Protein sequence | MADFKRGDSN YVRSAYRRPG SERPEYGRSK NRGQIWSVII VLLLIVVLAG GAVWMYLNGK GEEEAVASKE SEVASPQMAE SNTTELEPSQ QGGTEEEFAS TAPSEEDEFA SSTPSEEEDE FASILEELGV GSEPNEEEAS TPSTVEPQAE GSAVEGNTVI GTAPEEPQFF DEEEDQFSTE EFTIPEQEQP EEAVGEQAEQ ATGQFRGTER QVEETQKRGE ANNKMKEAEE EDFAAVTPSF EEAPTRSQAE GSFPQTVTVQ SGDSLSVIAD RVYGDAGKWR LIYEANQDQL ENPDQLLVGM KLTVPDPND
|
| |