Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1087 |
Symbol | |
ID | 3707076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1192577 |
End bp | 1193626 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637737589 |
Product | peptidase M42 |
Protein accession | YP_343122 |
Protein GI | 77164597 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTCGA TGAGTTATGA ACGGCTATTT GCGCAGATAG AAGATCTGGT CTTGCGCCAT TCTCCCAGTG GGGCCGAGCA AGAAATAGAT GGATGGTTGT TAGAACGTTT CACGGCGTTA GGGCAGACTG TCTGGCAAGA TGCCGCGGGC AACATTGTTG TGAAAGTTAG GGGGAAAAAT CCCGGGGCAA TCGCGATTAC CGCCCATAAG GATGAAATTG GCGCAATCGT GAAAACTAGT AATAGCCAAG GGCAAGTGGA GGTGCGGAAG CTAGGCGGCG CTTTCCCCTG GGTCTACGGA GAAGGGGTAG TGGATTTGTT AGGCGACCAT GCCTGCATTA GCGGTATTCT TTCGTTTGGT TCCCGCCATG TTTCCCATGA GAGCCCGCAG AAAGTCCAGC AAGATCAGGC GCCACTGCAA TGGCAGCATG CCTGGGTGGA AACCAAATGC ACGCAAAAAG AGCTGGAGGC CGCCGGAGTG CGTTCAGGTA CGCGGATGGT GGTGGGAAAA CACCGCAAGC GCCCCTTCCG CCTTAAGGAT TACATTGCCA GTTATACGTT GGATAACAAG GCTTCAGTGG CGATTTTGCT GGCATTGGCG CAGAAGCTTC ATAAACCCTT AGTAGATGTT TATTTGGTGG CCTCTGCCAA GGAAGAAGTG GGAGCAATAG GAGCGCTTTA CTTTAGCCGG TGTCATTCTC TGGATGCTTT GATTGCTTTG GAGATATGTC CCTTGGCGCC TGAATATTTC ATCAAAGAAG GCACCGCTCC TGTGCTGCTT TCCCAGGACG GTTATGGTCT TTACGATGAG GGACTCAATG GACTTATCAG GCAAGCCGCG GCCCGGCGAG AAATCCCCTT GCAACTGGCG GTGATTAGCG GTTTCGGCAG TGATGGTTCC ATTGCCATGA AATATGGCCA CGTGCCACGG GCCGCTTGTT TAGGTTTCCC TACCCAAAAC ACACATGGTT ATGAGATCGC CCATTTAGGC GCCATTGCCC ATTGTATTGA AATACTTTAC GCCTACTGCA CCCAGACGGA GGAGGGTTAG
|
Protein sequence | MDSMSYERLF AQIEDLVLRH SPSGAEQEID GWLLERFTAL GQTVWQDAAG NIVVKVRGKN PGAIAITAHK DEIGAIVKTS NSQGQVEVRK LGGAFPWVYG EGVVDLLGDH ACISGILSFG SRHVSHESPQ KVQQDQAPLQ WQHAWVETKC TQKELEAAGV RSGTRMVVGK HRKRPFRLKD YIASYTLDNK ASVAILLALA QKLHKPLVDV YLVASAKEEV GAIGALYFSR CHSLDALIAL EICPLAPEYF IKEGTAPVLL SQDGYGLYDE GLNGLIRQAA ARREIPLQLA VISGFGSDGS IAMKYGHVPR AACLGFPTQN THGYEIAHLG AIAHCIEILY AYCTQTEEG
|
| |