Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2927 |
Symbol | |
ID | 3705356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 3309584 |
End bp | 3312616 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637739404 |
Product | Type III restriction enzyme, res subunit |
Protein accession | YP_344902 |
Protein GI | 77166377 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00341327 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCC AATTTGAAGA CGATCTGGAT TACCAAAAAG CCGCGATAGA CTCGGTTGTC AGTCTGTTTA AAGGACAGGA AATATCCCGC TCTGAATTCA CCGTGACATT CCAGCCGGAG TCTAGCCTCA ACTTATCTTT GGGTTTGGAG GAGAATCAGC TCGGTATCGG CAACAGGCTC TTGTTGGTTG ATGAAGAAAT TGAAGAAAAC CTGCGTAAGG TGCAACTTCA GAATGGCCTG CGGCCTACCA AGAAGCTAGC CAGCGGAGAC TTTACCGTTG AAATGGAGAC CGGAACGGGC AAGACCTACG TGTATTTGCG CACGATCTTT GAACTGAACA AGAACTACGG CTTCACCAAG TTCGCGATTG TCGTGCCGTC CATTGCGATT AAGGAAGGTA CTTATAAGAC GCTGCAAATC ACGCAAGAGC ATTTTGAAGG CTTGTACCCG AAGGCCAAGG GCTACGAATA CTTCCTCTAT GATTCCTCCA AGCTGGGTCA GGTGCGCAAC TTCGCTACCA GTTCCAATAT CCAGATCATG GTGACGACGG TCGGCGCGAT CAATAAAAAG GATGTCAACA ATCTATACAA GGAAAACGAG AATACAGGTG GTGAAAAGCC GATTGACCTT GTTCGCGCCA CGAACCCGAT CATTATTGTC GATGAGCCAC AAAGCGTTGA CGGTGGCCTG ACAGGCAAAG GCAAGGAAGC GCTCACGGCG ATGAATCCGT TGTGCACCCT GCGTTATTCC GCGACCCATG TGGACAAGCA TCACATGACC TTCCGCTTGG ATGCCGTGGA CGCCTATGAG CGAGGCTTGG TTAAGCAGAT TGAGGTGGCG TCACTCCAGA TCGACAGTGG ACACAACAAA CCCTATATCC GGCTGGACTC CACGGATAAC CGGAAAGGAT CAATTACCGC CAAAGTGGAA GTGGATGTTC AGCGCGGCAA GAATGTCCGG CGAGAATTCC TAACCGTGGA GGACGGCGAC GATCTGGAGC AGATCACAGG CCGCTCCATT TATGAAAACA TGCAGATGGG TACCATTACC TGTGGCAAAG ATAATGAGTC CATTGAGGTC AAGGGCGACG GCTTCGATCA ACGGCTTCGA CCAGGCGAGG CCATTGGTGG CGTTGATCCC GACCAAATCA AGCGCTTGAT GATCCGGCGA ACCATCAAGG AGCACTTCGA TAAGGAGCTG ATGTTCGCTG CGAACAAGAA GCCAATCAAA GTGCTCAGCC TATTTTTCAT TGATAGTGTC ACGCACTACC GCCAGTACGA CGAGGACGGT AACGCTGTTA AGGGCAAGTA CGCTCGGATG TTCGAGGAGG AATACCGCAA GCTGGCCAAG TCAGCGGATT ATCAAAGCCT GTTCAGGGAA ATTGATCTGG AGGCTGGGGC GGACGAGGTG CATAACGGCT ATTTTTCTAT TGATAAGAAA GGCCGATCTG TCGACACCGC CGAGAATAAT CAGGCGAACC GCGACAATGC AGAGCGGGCA TACAACCTGA TTATGAAGGA GAAGGAAAAG CTACTGAGCT TTGATACCAA GCTGAAATTC ATTTTTTCTC ACTCCGCTCT GAAGGAAGGT TGGGACAATC CCAACGTTTT CCAGATTTGC ACGCTCCGCG ACATGGGAAG CGAACGGGAG CGACGCCAAA CCCTTGGCCG TGGCCTGCGT CTCTGCGTCA ATCAGAATGG CGAACGGCTT CGCGGCAACG ACGTGAACAC GCTGACCGTA ATTGCCACGG AAAACTACGA AAAATTTGCC GATAACCTGC AAAAAGAGAT CGAACAGGAT ACGGGTATTC GCTTCGGCAT TGTCGAACCA CACCAATTCG CGACCATTCA AACCCTCAAC GAACAAGGCG AGGTTGCACC TTTGGGGGTG GAACAGTCAG AGAAGATTTG GCAGTTCCTC AAAGATCAGC AATTTGTGGA TGTAAAAGGT AAGGTGCAGG ACAGTCTTCG CGCCGCGTTG AAAAACGGAA ACTTTGAGCT GCCAGAGGAA ATCAAGCAGG AGCTTGTCAA GAATCACGGA GAGGAACAGA CTGATATAAT CACATCGGAT ATTCAGGCGG TTCTACGCAA GCTGGCTGGC AAGCTGGATA TCAAGAACGC CGATGATCGG AAAATCATTC GAACGCGGGA AGCTGTTCTT GAATCCGATG AGTTCCGGTT GCTCTGGGAG CGGATCAAGT ACAAGACCAC GTACCGTGTG GAGTTCGACA ACCTGAAGCT CCTGAACGAT TGCGCCGACG CCATAAGAAG CTGCCCGCCG ATTACCAAAA CCCGCGCCCA ATTCAGAAAG GCGGATATTG CTATCGGTAA AGGTGGCGTA GGGGTTCAGG AAACCAGCGC TTCCGGCTAT ACCACAATCC ACGAAAATGA TATCGAGTTG CCCGATATTA TTACCGACTT GCAGGACAAG ACCCAGCTCA CTCGAAAAAG TATCGTCCAG ATACTCAGGG AGAGCCGAAG GCTTCAGGAC TTCCTGCGCA ATCCCCAGCA ATTCATCGAC TATTGTTCCG AGGCGATTAA CCGTACTAAG CGGCTAGCGC TTGTTGACGG CATCAAATAC ACTAAGATCG GTGACGATCA CGGCTATGCC CAGGCACTTT TTAAGCAGGA AGAGTTGAAA GGCTATCTGA AGAATACGCT GGAAGTACAA AAATCCGTAT ACACCCATGT CGTGTATGAC TCGGGAAGCG TGGAGAAGTC CTTCGCCGAA GACTTGGAGA AGAACGAAAG AGTTAAAGTT TACGCCAAAT TGCCCCCGTG GTTCAAAGTG CCAACGCCGC TCGGTTCCTA TAACCCCGAT TGGGCTGTCG TTGTGGAAGA TGGCGGTGAG GAAAAGCTCT ACTTTGTGGT TGAGACCAAA GGCAGCGCGT GGTGGGATGA TTTACGCCAC CTTGAAGGAG CGAAGATCAA GTGCGGAGAA CGGCACTTTG AAGAAATTGC AAAAGACACA GAAAACCCCG TCCGCTATAT CAAGGCAATG GATGTTGCAG GGATGATGGG TCATGTGGAA TAG
|
Protein sequence | MKLQFEDDLD YQKAAIDSVV SLFKGQEISR SEFTVTFQPE SSLNLSLGLE ENQLGIGNRL LLVDEEIEEN LRKVQLQNGL RPTKKLASGD FTVEMETGTG KTYVYLRTIF ELNKNYGFTK FAIVVPSIAI KEGTYKTLQI TQEHFEGLYP KAKGYEYFLY DSSKLGQVRN FATSSNIQIM VTTVGAINKK DVNNLYKENE NTGGEKPIDL VRATNPIIIV DEPQSVDGGL TGKGKEALTA MNPLCTLRYS ATHVDKHHMT FRLDAVDAYE RGLVKQIEVA SLQIDSGHNK PYIRLDSTDN RKGSITAKVE VDVQRGKNVR REFLTVEDGD DLEQITGRSI YENMQMGTIT CGKDNESIEV KGDGFDQRLR PGEAIGGVDP DQIKRLMIRR TIKEHFDKEL MFAANKKPIK VLSLFFIDSV THYRQYDEDG NAVKGKYARM FEEEYRKLAK SADYQSLFRE IDLEAGADEV HNGYFSIDKK GRSVDTAENN QANRDNAERA YNLIMKEKEK LLSFDTKLKF IFSHSALKEG WDNPNVFQIC TLRDMGSERE RRQTLGRGLR LCVNQNGERL RGNDVNTLTV IATENYEKFA DNLQKEIEQD TGIRFGIVEP HQFATIQTLN EQGEVAPLGV EQSEKIWQFL KDQQFVDVKG KVQDSLRAAL KNGNFELPEE IKQELVKNHG EEQTDIITSD IQAVLRKLAG KLDIKNADDR KIIRTREAVL ESDEFRLLWE RIKYKTTYRV EFDNLKLLND CADAIRSCPP ITKTRAQFRK ADIAIGKGGV GVQETSASGY TTIHENDIEL PDIITDLQDK TQLTRKSIVQ ILRESRRLQD FLRNPQQFID YCSEAINRTK RLALVDGIKY TKIGDDHGYA QALFKQEELK GYLKNTLEVQ KSVYTHVVYD SGSVEKSFAE DLEKNERVKV YAKLPPWFKV PTPLGSYNPD WAVVVEDGGE EKLYFVVETK GSAWWDDLRH LEGAKIKCGE RHFEEIAKDT ENPVRYIKAM DVAGMMGHVE
|
| |