Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1205 |
Symbol | |
ID | 3706704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1312360 |
End bp | 1315119 |
Gene Length | 2760 bp |
Protein Length | 919 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637737707 |
Product | hypothetical protein |
Protein accession | YP_343236 |
Protein GI | 77164711 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.326173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTCTC CCAACCAGCA CCCGGAGCAG GTAGCCCGCA ACCGGATTGA CCTTCAGCTC AAGGCAGCCG GCTGGGTGGT GCAGGATAGT CAGGCCCTTG ACTTTAATGC CGGTCCGGGC CTTGCGGTGC GTGAGTATCG GACCGATGCC GGCCCGGCGG ACTATGTGTT GTTCGTGAAC CGGCAGGCGG TCGGGGTCAT TGAGGCCAAG CCCGAGGCCT GGGGACAGCG GATCACCACC GTGGAGGAAC AGTCCACGGG GTATGCCAAC GCCACCCTGA AATGGGTCAA CAACTCAGCG CCGCTGCCCT TTGTTTACGA AAGCACGGGC ATCATCACCC GCTTTACCGA TGGCCGCGAT CCCAAGCCCC GCTCCCGGGA GGTCTTCAAC TTTCACCGCC CGGAGAGCCT GGCCGAATGG CTGACTCAGT CTGCCTCCCT CCGCGCCCGG TTGCACGCCC TGCCGCCGCT AGCGCGCGTT GGCCTGCGGG ATTGCCAGAT CGGGGCCATT GAAAACCTGG AAGCCTCCTT CAAGGCGGAC CGGCCCCGCG CCCTGGTGCA GATGGCCACC GGCTCCGGCA AGACCTATAC CGCCATTACC GCCGGGTACC GGCTGCTCAA ACACGCCGAC GCCAAGCGCA TTTTATTTCT GGTGGACACC AAAAATCTGG GGGAGCAGGC CGAGCAAGAA TTCATGGCCT TTCTGCCCAA CGACGATAAC CGCAAGTTCA CCGAGCTCTA CAATGTCCAG CGCCTCAAGT CGTCTTTCGT GGCCCAGGAC AGCCAGGTCT GCATCAGCAC CATCCAGCGT ATGTACGCCC TGCTCAAGGA TGAGCCTTTG GATGAAGCCG CCGAGGAGAA CCACCCCGCC GAGCGCCGGC TAAAACCCAA GCAATCGCTG CCGGTGGTCT ACAATGGCAA ACTCCCGCCG GAGTTTTTCG ATTTCATCAT CATTGATGAG TGCCATCGCT CCATCTACAA CCTGTGGCGG CAGGTCATCG AGTATTTCGA TGCCTTCCTG ATTGGCCTGA CCGCCACCCC GGATAATCGC ACCTACGGCT TTTTCCGCAA GAACGTGGTC AGTGAGTACG GCCACGAGCA GGCCGTGGCC GACGGCGTCA ATGTGGGCAA TGAAGTCTAT GTGATTGAGA CTGAGCGGAC CCGGCAGGGC GGCACCCTCA AGGCCCATCA GCAGGTGGAA AAACGCGAGC GCCTCACCCG CAAGCGGCGC TGGGAAACCC AGGACGAAGA GCAGGCTTAT TCCGCCAAGC AACTGGACCG GGACATCGTC AACCCGGATC AAATCCGCAC CGTGATTCGC GCCTTTAAGG GAAAGCTGCC TGAGATCTTC CCCGGTCGCA ACGAGGTGCC CAAGACCCTT ATTTTCGCCA AAACCGACAG CCATGCCGAT GACATTATCC AGACGGTGCG GGAGGAGTTT GGCGAGGGTA ATCCCTTCTG CAAGAAGGTG ACCTATCAGG CCAAGGAAGA CCCCAAATCG GTCCTGGCCC AGTTCCGCAA CGATTATTAC CCCCGGATTG CCGTCACCGT AGACATGATT GCCACCGGCA CCGATGTAAA GCCCCTGGAA TGCCTGCTGT TTATGCGAGA TGTCAAAAGC CGCAACTACT TTGAGCAGAT GAAAGGGCGC GGCACCCGTA CCCTGGACGC CGATAGCCTG AGGAAGGTCA CCCCCTCGGC CACCGCCGCC AAGACCCACT ATGTCATTGT GGACGCCATC GGCGTCACCC AATCCCTGAA AACCGCCAGC CAGCCCCTTA TCACCAAGCC CTCGGTGTCC CTTAAGGATT TGGCCATGGG CGTGATGATG GGCGCGCGGG ACGAGGATAC GGTCTCTTCC CTTGCTGGCC GCCTGGCCCG CCTCGATAAA CAGCTCAACG ACAAGGACAA GGCCCGCATC CGGGAAGCCG CTGGCGGCAT GACTTTAACC GATATGGTCG GCGCCCTCGT CCAAGCCATC GACCCGGACC GGATTGAGGA GAAAACCCGG GAGTTCTCCG GCGCTGGCGA GCCGGGCCAC AGTGAGCGGG AAAAGGCCCG GGACCAGTTG GTAGGCCAAG CAGCCCAGGT TTTCACCGGC CCCTTGATTG CGCTTATCGA GGGTATCCGT CGGGATAAGG AGCAAACCCT CGACCATGAC AATCTGGATA CCCTGCTCCG TGCCGGGTGG GCGGGGGATA GCACCGAGAA CGCCAAGGCG CTGGCCCAGG AATTTGCGCG GTACCTAAGC GAGCACCGCG ACGACATTGA GGCCCTCACT CTCTACTTCC AGACACCCGC CCGCCGCGCC GAGGTGACCT ACGCCATGAT TAAAGCACTC CTAGAGCGAC TCAAGCAGGA CCGCCCCAAG CTTGCCCCCC TGCGGGTCTG GCAGGCTTAT GCGCATCTGG ATGACTATCA GGGGGAGCAC CCCATCAGCG AGTTGACCGC CCTGGTGGCG CTAATTCGCC GGGTCTGCGG GCTGGACCCG ACCCTCTCCA CCTACGCGGC CACCGTGCGC CGCAACTTCC AGCACTGGAT CATGCAACAT CACAGCGGCG CGGGGGAGAA ATTCAACGAG GCGCAGATGG CCTGGCTGCG GATGATTCGC GATCACATCA TCAGCTCTTT CCACATGGAG CATGACGATC TGGAGATGGC GCCCTTTGAT GCCCAGGGCG GGATGGGACG GATGTATCAG TTGTTTGGGG ATAGGATGGA TGAGGTGATT GGGGAATTGA ATCGGGAGTT GGTGGCTTAG
|
Protein sequence | MTSPNQHPEQ VARNRIDLQL KAAGWVVQDS QALDFNAGPG LAVREYRTDA GPADYVLFVN RQAVGVIEAK PEAWGQRITT VEEQSTGYAN ATLKWVNNSA PLPFVYESTG IITRFTDGRD PKPRSREVFN FHRPESLAEW LTQSASLRAR LHALPPLARV GLRDCQIGAI ENLEASFKAD RPRALVQMAT GSGKTYTAIT AGYRLLKHAD AKRILFLVDT KNLGEQAEQE FMAFLPNDDN RKFTELYNVQ RLKSSFVAQD SQVCISTIQR MYALLKDEPL DEAAEENHPA ERRLKPKQSL PVVYNGKLPP EFFDFIIIDE CHRSIYNLWR QVIEYFDAFL IGLTATPDNR TYGFFRKNVV SEYGHEQAVA DGVNVGNEVY VIETERTRQG GTLKAHQQVE KRERLTRKRR WETQDEEQAY SAKQLDRDIV NPDQIRTVIR AFKGKLPEIF PGRNEVPKTL IFAKTDSHAD DIIQTVREEF GEGNPFCKKV TYQAKEDPKS VLAQFRNDYY PRIAVTVDMI ATGTDVKPLE CLLFMRDVKS RNYFEQMKGR GTRTLDADSL RKVTPSATAA KTHYVIVDAI GVTQSLKTAS QPLITKPSVS LKDLAMGVMM GARDEDTVSS LAGRLARLDK QLNDKDKARI REAAGGMTLT DMVGALVQAI DPDRIEEKTR EFSGAGEPGH SEREKARDQL VGQAAQVFTG PLIALIEGIR RDKEQTLDHD NLDTLLRAGW AGDSTENAKA LAQEFARYLS EHRDDIEALT LYFQTPARRA EVTYAMIKAL LERLKQDRPK LAPLRVWQAY AHLDDYQGEH PISELTALVA LIRRVCGLDP TLSTYAATVR RNFQHWIMQH HSGAGEKFNE AQMAWLRMIR DHIISSFHME HDDLEMAPFD AQGGMGRMYQ LFGDRMDEVI GELNRELVA
|
| |