Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0712 |
Symbol | |
ID | 3706978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 770459 |
End bp | 771403 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637737215 |
Product | peptidase S33, proline iminopeptidase 1 |
Protein accession | YP_342756 |
Protein GI | 77164231 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0121198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGACTC TCTATCCCGA CATCAAGCCT TATGTACGCC ATACCCTAAC GGTGGACCCG CCCCATGAAC TCTATGTCGA GGAATGCGGC CACCCGGGAG GACTCCCCAT CCTCTTTCTC CACGGAGGGC CAGGTAGTGG CTGCCAACCC CATCACCGCT GCTTTTTTGA TCCCGACATT TATCGGGTAA TTTTATTCGA TCAGCGTGGT TGCGGCAGAT CCCAACCCCA TGGCGAGTTG GAGAAAAATA CCACTACAGC GCTACTTGCG GATATGGAAT TTATCCGCAA CCACTTAGAG ATTGAGCGCT GGCTTATTTT TGGTGGCTCC TGGGGCGCCG CCCTCGGGCT ACTCTACGGA GAAACTCATC CAAGCCGGGT TTTAGGGCTC ATTTTACGCG GCATCTTCCT GGGCCGGGAG CAAGACACCC GCTGGTTCCT GCAAGAGGGC GCGCCGCGAA TTTTTCCCGA TGCTTGGGCG GCCTTGGTAG AGGATATTCC CGCCGAGGAG AGAAATAACC TCATAGAATT CTTCCACCAC CGTCTTAAGG GTCCCGACGA GCTGGCCCAG ATGGCGGCGG CTAAGGCCCT ACATGCCTGG GAGTCCAGTT GTATGCGCCT TGTCAACAGC GAGGCACCTT CCCAATCAGG CCGCACCACA CTGCTAGCCC ACGCCCGTTT GCTTATTCAC TACGCCAGAC ATCATTACTT TATTCAACCC AATCAGATAC TCGATCATGC CCATCAATTA AAAAATATTC CTGGAATCAT CGTCCATGGC CGCTATGATG TCATTTGCCC TGCCGGCAAT GCCTGGGAGC TGCATCAAGC CTGGCCTTCA TCCGAGCTGC AAATCGTGCC CCTAGCCGGC CATGGAGCAA CCGAGCCAGC CATCGCGGAC GCGCTAATTC GGGCAACGAA CCTCATGGCA AGGCGGGTAG GGTAA
|
Protein sequence | MLTLYPDIKP YVRHTLTVDP PHELYVEECG HPGGLPILFL HGGPGSGCQP HHRCFFDPDI YRVILFDQRG CGRSQPHGEL EKNTTTALLA DMEFIRNHLE IERWLIFGGS WGAALGLLYG ETHPSRVLGL ILRGIFLGRE QDTRWFLQEG APRIFPDAWA ALVEDIPAEE RNNLIEFFHH RLKGPDELAQ MAAAKALHAW ESSCMRLVNS EAPSQSGRTT LLAHARLLIH YARHHYFIQP NQILDHAHQL KNIPGIIVHG RYDVICPAGN AWELHQAWPS SELQIVPLAG HGATEPAIAD ALIRATNLMA RRVG
|
| |