Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1342 |
Symbol | |
ID | 3706145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1490807 |
End bp | 1491976 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637737838 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_343367 |
Protein GI | 77164842 |
COG category | [P] Inorganic ion transport and metabolism [S] Function unknown |
COG ID | [COG0428] Predicted divalent heavy-metal cations transporter [COG1652] Uncharacterized protein containing LysM domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.779699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGGG CGGCAACTCT CCTATCTTTG TTAAAAATGC AAGACTTTCT AATAGTTTTC GGCCTTGCCT TGTTGCCCGC ATTGGGGAAT TTTGTCGGCG GGCTGTGGGC GGAATTTCTT CGAACCTCAG AACGCGCCCT CAACCGGGCG CTCCACGCCG CGGCCGGTAT CGTCCTTGCT ATCGTCGCCA TTGAACTGAT GCCCGAGGCG CTGAAAAGTA TCTCCCCCTG GATGATTGCC TTGGCCTTTG CCCTGGGCGG CTTCGCCTAT ATGGCCCTGG AAGCGGCGAT TGAGTATTTG CAGAAGAAAA AAGGAAAGAA TAGCTCTGGA AGCACGGCCA TGTGGATGCT CTATGGGGCG GTGGCTACGG ATTTGTTCAG TGATGGCCTC ATGATTGGTG CCGGTTCGGC TGTTTCACCC AGTATGGCGC TTATTTTGGC GCTGGGACAG GTGCTGGCCG ATGTTCCGGA AGGGTATGCC GCGATCGCCA ATTTCAAGGA TAAAAACATC CCCCGTAGAC GGCGGTTTTG GCTTTCCGCT TCTTTCGCTT TGCCAGCGCT AACCGCTGCC ACTCTGGCTT ATTTCCTGCT TCGTGACCAG AATGAAACCC TAAAAATGGC GGGATTAGTT TTTACGGCGG GACTGCTTAC GGTAGCCGCT GTGGAGGACA TGGTTTCCGA GGCTCATGAA ATCGCGCAAG ATACGCGCTG GTCGGACTTT TCCTTCATTG GTGGCTTTGT CTTATTTATC CTTGTTTCCG CCGGTTTCAA AAGCTACCTG ATAGAAGAGC CTGAATCTGC TGTAGCGGCC AAAGCAGGGG CGGAGGCGCT ACCGGCGCTT TCGGTTTCCG AAACAGCCAA GTCTGGGGAA AAAGAGTCTC TCGTAACGCG CCTCCAGGAA CGTTCCCGCG AAAAAGACGC CGACATGATT ACTAGGCTTC CTCCTACGTC GGCAACCAAA AAATCACAGG AAAAGCGTGC GTTAACGTCT AGGCTGCCGA AAAGGTTGGT TGTCCAGCAC GGTGATACCT TGTCGCAGAT CGCGGCGCGT CTCTATGGCG ATCCTGCTCA ATGGCGACTC CTGTATGCGG CCAATCGGGA CAGACTTGAT AATCCTGATT TACTCAGAGC AGGAATGGAG CTTGTTGTTC CCCTTGATTC GGAAAAATAG
|
Protein sequence | MMGAATLLSL LKMQDFLIVF GLALLPALGN FVGGLWAEFL RTSERALNRA LHAAAGIVLA IVAIELMPEA LKSISPWMIA LAFALGGFAY MALEAAIEYL QKKKGKNSSG STAMWMLYGA VATDLFSDGL MIGAGSAVSP SMALILALGQ VLADVPEGYA AIANFKDKNI PRRRRFWLSA SFALPALTAA TLAYFLLRDQ NETLKMAGLV FTAGLLTVAA VEDMVSEAHE IAQDTRWSDF SFIGGFVLFI LVSAGFKSYL IEEPESAVAA KAGAEALPAL SVSETAKSGE KESLVTRLQE RSREKDADMI TRLPPTSATK KSQEKRALTS RLPKRLVVQH GDTLSQIAAR LYGDPAQWRL LYAANRDRLD NPDLLRAGME LVVPLDSEK
|
| |