Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2295 |
Symbol | |
ID | 3704501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2647658 |
End bp | 2648986 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637738774 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_344283 |
Protein GI | 77165758 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.853343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAACA TTAAAACGTT AATCCATGCC CGCTGGGTTA TTCCCGTTAT TCCAGAAGGA CAAATACTGG AAAACCATAG CCTTGCCATC TACCAAGGGC GTATTGTGGA TTGTCTCCCA AGAATAGAAG CGGAAACCCG CTACCGAGAC GCTCACCAAA TTGAGCTTAC CCAGCATGCA CTCATACCCG GTCTTATTAA TGCCCATATC CATAGTCCCA TGTCTCTCCT ACGCGGTCTC GCTGACGACC TCCCCCTTAT GGAGTGGCTT GAAAAGCATA TCTGGCCAGC CGAAACCAAA TGGGTAAGCG AAACCTTTGT GCGTGATGGC GCCCTTCTAG CGATTGCTGA AATGCTCCGT GGAGGCATTA CTTGCTTTAA CGATATGTAT TTTTTCCCCG AAGTCGTAGC GCAGGCGGCA GTCGAAGCTA ACATGCGCGC TGTTATTGGC ATGATTGTCA TTGATTTTCC CAGCAGATGG GCAAAAACCC CCGAGGATTA TCTCCGCAAA GGGTTAGAGT TAAATGATAA CTATCAGAAT CATCCTCTTA TCAAAACGGC CTTTGCTCCC CATGCCCCTT ATACTGTCAG CGATGAATCC TTAACGCAAG TCGCTATTTT ATCTAAGGAA CTAAATATTC CAGTCCATAT GCATATCCAT GAGACGGTGG AGGAAATTAA TAGGAGTATA ACCCAATATG GCATGCGACC CTTAGGGCGG TTACAGCGCC TAGGACTACT TTCATCCCGG CTCTTAGCCG TCCATATGAC TCAACTTACC GATCAAGAGT TTCAGACGAT AACGAAGCAT GGAATTCATA TCGTTCATTG TCCAGAGTCT AATCTCAAAC TTGCCAGCGG CTTTTGTCCA GTAGCTAAAC TATATCAAGC GGGAATTAAT ATTGCTTTAG GTACCGATAG TGCAGCAAGC AATAACGATT TGGATATGTT TGTGGAAATG CGCCTTACCG CTTTGTTGGC TAAAGCATTA GCTAGCGACG CTAGCGCCAT TCCCGCAAAG CAGGCGCTTC GCATGGCTAC CTTAAATGGT GCCCAAGCGT TAGGCTTGGA GCAAGAAATT GGATCCCTGG AAATCGGTAA GATAGCTGAC ATAGTCGCCG TCGATTTGGG TGGGCTAGAA ACACAGCCGC TCTATGATCC CATTTCCCAA TTAGTCTATA CGGCAGGCCG CGATAAGGTC AGCGATGTTT GGATAGCCGG CCAACAGGTC CTTAAGAGGC GCCAATTTAC AACTTTGGAT GAACGATTGC TTCTATCCCG AACCCAAGCC TGGGCGGAAA GAATAAAAGA ATCAAGGAAA TTCACATGA
|
Protein sequence | MQNIKTLIHA RWVIPVIPEG QILENHSLAI YQGRIVDCLP RIEAETRYRD AHQIELTQHA LIPGLINAHI HSPMSLLRGL ADDLPLMEWL EKHIWPAETK WVSETFVRDG ALLAIAEMLR GGITCFNDMY FFPEVVAQAA VEANMRAVIG MIVIDFPSRW AKTPEDYLRK GLELNDNYQN HPLIKTAFAP HAPYTVSDES LTQVAILSKE LNIPVHMHIH ETVEEINRSI TQYGMRPLGR LQRLGLLSSR LLAVHMTQLT DQEFQTITKH GIHIVHCPES NLKLASGFCP VAKLYQAGIN IALGTDSAAS NNDLDMFVEM RLTALLAKAL ASDASAIPAK QALRMATLNG AQALGLEQEI GSLEIGKIAD IVAVDLGGLE TQPLYDPISQ LVYTAGRDKV SDVWIAGQQV LKRRQFTTLD ERLLLSRTQA WAERIKESRK FT
|
| |