Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0063 |
Symbol | |
ID | 3705901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 64658 |
End bp | 67546 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637736588 |
Product | Type III restriction enzyme, res subunit |
Protein accession | YP_342135 |
Protein GI | 77163610 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTGGC AATACAGCAC CGTTCATAAC AGCGCCTGTA AGGTCATCGA AGAACAGACC TTATGGGGGC AGGCCGTGTG CCGTATCTGG TTGCCGAACC AGGACGCGGT GGTGCGCGTG CCCCGCTCCG CCTTACGGCC GCTGAGTGCC GACCTGCAAC CGGAGATCGA GGCTGGACGC ATTGCCTATG TGGCCGCCGC AGCCAAGGTG GCCGAGGTGC TCGAAGGCTC CACCAGCGCC ACTGACGGTC ATGTATTGCT GGCTCCCATG GAGTCCAATG TTATCCCGCT GCCGCACCAG ATCCACGCCT TGTCCCGGGC TATCTCCGGC GACCGTGTGC GCTACCTGTT GGCCGACGAG GTGGGCCTCG GCAAGACCAT CGAGGCCGGT CTAGTCATGC GCGAGCTCAA GCTGCGCGGA CTGGTTCGGC GAATCCTGGT CGTCTCTCCC AAAGGCATCG CTACCCAGTG GGTGGCGGAA ATGCAGACCC ACTTCAATGA GCAGTTCCAG CTCGTGCTGG GTGATGACAT CAGCACCTTG CAGCGCCTGG CCCCAGGGGC GGACCACCGG AACTCAGCCT GGTCGATGTT CGATCAGGTC ATCGTTCCCC TGGATTCGGT CAAGCCCATG GACAAGCGGC GGGGTTGGAC CGCCGGGCGC GTTGCCGAAT ACAATCGCAG CCGGTTCGAG GATCTGATCA CTGCTGGTTG GGATCTGGTA GTGGTGGACG AAGCGCACCG CCTGGGTGGC AGTACCGATC AGGTCGCCCG CTATAAGCTC GGCAAGGGCC TGGCGGAGGC CGCGCCCTAT GTACTGCTCC TTTCGGCTAC GCCCCACCAG GGGAAGACCG ATGCTTTCCA TCGCTTGATG AACCTGCTGG ATGAGGACGC CTTCCCGGAT ATGGACAGCG TTTCCCGCGA CCGGGTGGCT CCTTACGTCA TCCGCACGGA AAAACGGAAG GCCATCGATG CCGATGGCAA GCCCCTCTTC AAAGCCCGGC GCACGCAGAT GGCCCCGGTA GTCTGGGAGA GCCGTCATCA CCTGCAGCAG CTCCTCTATG AGGCGGTGAC CGACTATGTG CGCGAGGGCT ACAACCAGGC TCTGCGCGAG AAGAAGCGCC ACATCGGCTT TCTGATGATC CTGATGCAGC GCCTGGTGGT CTCTAGCACC CGAGCAATCC GCACCACTTT GGAGCGTCGG CTCGCGGCAC TTAAGGAAGG CGAGCAGCAA GCCAGCCTGC GCCTAGCGGA GCTGGAAAAC AGTGCGGGGG GATCGGAAAA CACAGACGAT GAAATAACCG AGCTCTACGA CATGGACGGC CAGGAGTTGC TCGATGAACT GCTGAAATCC CATGTGTTGG CTCTACAGAG CGAAGGCAGT CATGTGGAGA CTTTGCTAGA TGCGGCGGTT CGCTGCGAGC AGGCGGGGCC GGACGCCAAG GCCGAGGCGT TGATCGAGTG GATCTACGAG TTGCAAGCCG AGGAAAACGA GCCGGATCTG AAAGTGTTGA TCTTCACTGA GTTCGTACCG ACCCAGGAGA TGCTGAAGGA GTTTCTGGAA GCCCGGGGAA TCTCGGTGGT CACCCTGAAC GGCTCCATGG ATATGGAGGT ACGTGGGGCA GCCCAGGATA CCTTCCGTAA ATCGCACCGC GTGCTGCTTT CCACCGATGC GGGCGGTGAG GGTCTAAACT TGCAGTTCGC CCATGTCATC ATCAACTACG ACATCCCCTG GAACCCGATG CGGTTGGAGC AGCGAATCGG CCGCGTGGAC CGTATCGGCC AGCCCAAGAT GGTGCGAGCG ATCAACTTCG TGTTTGAAGA TTCGGTCGAG TTTCGCGTTC GCGAAGTGTT GGAACAGAAG CTCTCGGTGA TCTTTGACGA GTTCGGCATC GATAAGACTG GTGACGTGCT TGACTCAGCT CAGGCCGGCG AGTTGTTCGA GGATGTGTTC GCGCAGGCGT TCGCCAACCC TGATGGTATT GAAACTTCCG TCGATCAGAC GGTGACTCGG ATTCGCGATG AGATTCAGCA GGTGCGGGAG TCCTCCGCCA TCTATGGCAT TTCCGAAGAA CTGAATGTGC AGGCGGCTGA GCAGCTGCGC TCCCATCCGC TGCCCCACTG GGTGGAGCGG ATGACGGTGG GCTATCTCAA TTCCCACGGC GGCACAGCCA GCCGTAAACG CTCCTGGTGG GATCTAAATT GGCCGGACGG TCAGGAGCAT CGCAAGGCCG TGTTCAATGC CCGGGAAGCG GACCGGCTGA CCGATGCAAC CCTGCTCAAT CTTGAAAACA GCCGTGTCCG TGGGCTGGCC TTGAACCTGC CGCAGATCGC GGCGGGCCAG CCATTGCCTT GCGTAAGCGT GAGCGGTCTG CCAACCAGCA TCTCCGGTCT CTGGGGACTC TTTGAGATCC GCCTTCAGGC CGGAATGCAC CAGAAGACAC AACTCCTGCG CATCCCCATG GTGCGGCGCG GTTATGTCAG CGTGTTCCTG AGCGAGGAAG GCAAACTGTT TCTGCCCACG GCCCGGCATA TCTGGGATGC GCTGCAGACA GCGGAAGCCC AGGTGCAAGC CACCCTCGGG CGAGATGAAT CCATCACCGC CCATGAGCGT TTGCGGATTG CTGCCGAACA GGCCGGACAG GAGCTGTTTG ACGCCTTGCA GCAGGTACAT CTTGCCGCTG TGGCTTACGA GGAGGAACGC GGAATTGTCT CCTTTGCCTC GCGCCGCAAG GCCATCGAAC GGGTTGGATT GCCGGAGGTG CGGCAATTCA GGCTGGCCCG TTGCGACGCA GAAGAATCCG AATGGCGACA TGAACTGCAA TCGGCGCGGC AGATCGTGCC GGAAATCCGG TCGCTGCTGA TGCTGCGGAT TATCAAAAGA GGCGCTTAA
|
Protein sequence | MPWQYSTVHN SACKVIEEQT LWGQAVCRIW LPNQDAVVRV PRSALRPLSA DLQPEIEAGR IAYVAAAAKV AEVLEGSTSA TDGHVLLAPM ESNVIPLPHQ IHALSRAISG DRVRYLLADE VGLGKTIEAG LVMRELKLRG LVRRILVVSP KGIATQWVAE MQTHFNEQFQ LVLGDDISTL QRLAPGADHR NSAWSMFDQV IVPLDSVKPM DKRRGWTAGR VAEYNRSRFE DLITAGWDLV VVDEAHRLGG STDQVARYKL GKGLAEAAPY VLLLSATPHQ GKTDAFHRLM NLLDEDAFPD MDSVSRDRVA PYVIRTEKRK AIDADGKPLF KARRTQMAPV VWESRHHLQQ LLYEAVTDYV REGYNQALRE KKRHIGFLMI LMQRLVVSST RAIRTTLERR LAALKEGEQQ ASLRLAELEN SAGGSENTDD EITELYDMDG QELLDELLKS HVLALQSEGS HVETLLDAAV RCEQAGPDAK AEALIEWIYE LQAEENEPDL KVLIFTEFVP TQEMLKEFLE ARGISVVTLN GSMDMEVRGA AQDTFRKSHR VLLSTDAGGE GLNLQFAHVI INYDIPWNPM RLEQRIGRVD RIGQPKMVRA INFVFEDSVE FRVREVLEQK LSVIFDEFGI DKTGDVLDSA QAGELFEDVF AQAFANPDGI ETSVDQTVTR IRDEIQQVRE SSAIYGISEE LNVQAAEQLR SHPLPHWVER MTVGYLNSHG GTASRKRSWW DLNWPDGQEH RKAVFNAREA DRLTDATLLN LENSRVRGLA LNLPQIAAGQ PLPCVSVSGL PTSISGLWGL FEIRLQAGMH QKTQLLRIPM VRRGYVSVFL SEEGKLFLPT ARHIWDALQT AEAQVQATLG RDESITAHER LRIAAEQAGQ ELFDALQQVH LAAVAYEEER GIVSFASRRK AIERVGLPEV RQFRLARCDA EESEWRHELQ SARQIVPEIR SLLMLRIIKR GA
|
| |