Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0208 |
Symbol | |
ID | 3706243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 230782 |
End bp | 232821 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637736724 |
Product | oligopeptidase A |
Protein accession | YP_342268 |
Protein GI | 77163743 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATC CACTTCTTGA ATTTGCGGGC CTACCTCCAT TTTCGAAAAT TCAGCCAGCC CATGTGGAGC CTGCGATTGA TTGTCTGCTG GCAGAAGGGC GGGCCTTGAT TGAACAACTG CTTACTCGCC ATACTGTCTA CACTTGGGAT AATCTAGCTC AGCCGCTAGA AGACTTGCGG GAGCATCTTG ACCGAGTGTG GTCTCCTGTG TCCCATATGA ATGCGGTGGT TAATAGCGAT GGATTACGGC GGGCTTACAA TGCCTGTTTG CCTAAACTTA GCGAGTTTGC CACTGAACTC GGGCAAAACG AAAACTTGTA CCGGGCCTTT CAATCGATTG CTGAAGGCGA CGAGTATCCG AGCCTGAATG TTCCCCAGAA AAAAATTATT GTCAATGCCC TGCGCGATTT TCGGCTTTCA GGAGTGACTC TGCCATCGGA AAAGAAAGCT CGCTTTAAAG CCATCCAGCA GCAGTTAGCC AGTTTAAACG CCAAATTTGA GGAAAATCTG CTGGATTCGA CCCAAGCTTG GCGTAAGCAC CTTGCGGATG AGACAATCCT GGCGGGACTA CCCGAGGGAG CACGGGCCCA AGCGCGCCAA GCAGCGGAAC AAGCGGGCCT GGAGGGTTGG TTGTTGACCT TGGAAGCACC TTCCTATGTT GCGGTCACGA CCTACGCCGA TGACCGGGAA CTACGGGAAG AAATTTACAC CGCCTTTGTC ACTCGCGGTT CGGATCAGGG CCCCCATGGA GGGCGCTGGG ATAATACTCA AGTCATGGAG GAAATCTTGG CCTTGCGCCA TGAAGCGGCA CAGTTGCTTG GTTTTGCCAA TCATGCTGAA TGCTCCCTAG CTACCAAAAT GGCCGGAAAT CCGCAGCAAG TATTAGATTT TCTTAATGAT CTAGCCGTCC GCTCCAAACG AGTGGCCGAG CAGGATTTAG CTGAGGTCCG ATCCTTTGCC CAGGCACACT ATGGTATTGA GGATCTCCAG GCGTGGGATG TGGCGTATTA TGGAGAAAAA TTGCGGCAGC ACAAATATGC CATTTCCCAG GAGGAACTCA AGCCTTATTT CCCCGTCTGG CGGGTGCTGG AAGGGCTATT TACTATCGTG AATCGACTCT ATGGCCTGGA AATTCAGGAA CGAAAGGATG TGGATACTTG GCATCCCCAG GTGCGTTTTT TCGATATTTT CGATGACAGT GGCGAACTGC GCGGGCAGTT CTATCTGGAT CTCTATGCGC GCAGCAACAA GCGAGGGGGA GCATGGATGG CCGATTGTCT GTCCCGCAAG CGCCAGGGAA GTCAATTACA AATTCCAGTG GCCTATTTGA CTTGTAACTT GACACCACCG GTAGATGATA AGCCAGCCCT GTTCACCCAT AATGAGGTGA TCACGTTGTT TCACGAGTTT GGCCACGGTT TACATCACTT GCTCACCAAA ATTGATTATC CCAGCGTGGC GGGAATTAGC GGTGTGTTCT GGGATGCAGT GGAATTGCCT AGCCAGTTTA TGGAGAATTG GTGCTGGCAA CAAGAAGCGT TGGCCCTCAT TGCCTGCCAT TTTGAAACCC ACGAACCTCT CCCGGAAAAA TTATTTGAGC GCATGCTAGC GGCCAAGAAT TTCCTCTCGG GGATGATGAT AGTGCGCCAG CTTGAGTTTG CCTTGTTCGA TTTTCGCCTG CACTTGGAGT ATGACCCCGC GAAGGGCGCC CGGGTTGATG AGCTGCTGCA GGAGGCGCGA GAGCAGGTTG CCGTTGTCAA GCCGCCGTCC TTTAACCGTT TCGCCCATAG CTTCAGTCAT ATCTTCGCCG GTGGTTATGC CGCTGGTTAC TATAGTTACA AGTGGGCCGA GGTGCTATCA GCAGATGCCT TTTCCCGGTT TGAGGAAGAA GGTATTTTTG ATCGGCAGGC AGGCAGGGCT TTCATGAGCA GCATCTTGGA GCAGGGAGGC AGCCGTGATC CCCTGGAACT ATTTATCGAA TTTCGGGGCC GAGAGCCCGT CATTGACGCC TTGCTCCGCC ACAGTGGCAT TGCCGCCTGA
|
Protein sequence | MSNPLLEFAG LPPFSKIQPA HVEPAIDCLL AEGRALIEQL LTRHTVYTWD NLAQPLEDLR EHLDRVWSPV SHMNAVVNSD GLRRAYNACL PKLSEFATEL GQNENLYRAF QSIAEGDEYP SLNVPQKKII VNALRDFRLS GVTLPSEKKA RFKAIQQQLA SLNAKFEENL LDSTQAWRKH LADETILAGL PEGARAQARQ AAEQAGLEGW LLTLEAPSYV AVTTYADDRE LREEIYTAFV TRGSDQGPHG GRWDNTQVME EILALRHEAA QLLGFANHAE CSLATKMAGN PQQVLDFLND LAVRSKRVAE QDLAEVRSFA QAHYGIEDLQ AWDVAYYGEK LRQHKYAISQ EELKPYFPVW RVLEGLFTIV NRLYGLEIQE RKDVDTWHPQ VRFFDIFDDS GELRGQFYLD LYARSNKRGG AWMADCLSRK RQGSQLQIPV AYLTCNLTPP VDDKPALFTH NEVITLFHEF GHGLHHLLTK IDYPSVAGIS GVFWDAVELP SQFMENWCWQ QEALALIACH FETHEPLPEK LFERMLAAKN FLSGMMIVRQ LEFALFDFRL HLEYDPAKGA RVDELLQEAR EQVAVVKPPS FNRFAHSFSH IFAGGYAAGY YSYKWAEVLS ADAFSRFEEE GIFDRQAGRA FMSSILEQGG SRDPLELFIE FRGREPVIDA LLRHSGIAA
|
| |