Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0704 |
Symbol | |
ID | 3706952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 756326 |
End bp | 759601 |
Gene Length | 3276 bp |
Protein Length | 1091 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637737207 |
Product | ATP-dependent dsDNA exonuclease (SbcC) |
Protein accession | YP_342748 |
Protein GI | 77164223 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.859038 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATAC GGCAGGTACG CTTTAAGAAT CTGAACTCAC TGGTCGGCGA GTGGGAAATC GACTTGACGC ACCCAGCCTT CGTGTCTGAT GGCATCTTCG CCATTACAGG CCCTACTGGC GCGGGCAAGA CGACTATTCT CGATGCCATT TGTATGGCTC TTTACGGGCG GACGCCTCGC CTAAACAAGG TCACTAAGCG TGGCAACGAG ATTATGTCCC GCCAGACCGG CGAATGCTTT GCAGAGGTGA CCTTCGAAAC CCAGACTGGG CGTTACCGCT GTCACTGGAG CCAGCACCGG GCACGCAAGA AGCCTGATGG CGAGCTCCAG GCTCCGAGAC ACGAAATTGC CAATGCCGAT TCCGGTGAGA TTTTCGAATC CAAAATCAGA GGGGTCGCGG ACCAGATCGA GTCGGCTACC GGTATGGATT TCCACCGTTT TACCCGCTCC ATGTTACTGG CCCAGGGCGG CTTTGCCGTG TTCCTCCAAG CGGTGCAGGA TGAGCGGGCG CCGATCCTTG AGCAGATCAC GGGCACGGAG ATTTACAGCC AGATTTCCAT CCGTGTTCAT GAGCGCCAAC GGGAAGAGCG GGAAAAACTG AACCTGCTTC AGGCTGAAAC GGAAGGCATC GTGATGCTTG AGCCGGAACA GGAACAAGAG ATTGGGCAGA CGCTTGAGAT AAAGCGGAAG GAAGAGGCAG ACCTTACCGC CAAGTTCGCC GACACTGGGC AGGCCATGGC CTGGCTCACC ACCATCGATG GTCTGAAGAA GGAAATCGTC AACCTGGCCG ATGAGGTGCG CAAGCTGCAA AACGATATCG AGGCGTTCAG ACCGGATCGT GAAAAGCTCA ACCGGGCTTT GAGTGCTGCC TCACTGGACG GCGCATACGC AACGCTCACA GCCATCCGCA AACAGCAGGT GGAGGACAGA GAAGCCTTGA AAGCTGAGGG AGAAGCGCTT CCTGGATTGG AATCCTCCGC CAAGGAGCAG GCCGAGTCAC TAAAATCGGC TGAGCAACAA ACCGCTCGGG TCAAAGAAGA GCTAAAAGTT GCCGCACCTA CCTTGCAGAA GGTTCGCTCC CTGGATCAGG AGCTCGCCAA TCTCAAAAAA ACTGCGGCAG AAGATAAACA GGATTGCCAA CAGGATCTTG AAAAGATTGA TACAGACAAA CAAGCCCGGC TTGAGGAGCA GGAAAAACGT TCCACGGCTC ACGGGAATCT GGAACTTGTT GACAGCTACC TCAAGGAGCA TGCACAGGAT GAATGGCTGA TCAGCGGCCT GGCTGGTGTG GAAGAACAGG TGAGCAGCCT ACTCTCCAGG CAAAATGAAA TCCATCAAAA AGAGATTGAC CAGGATAAGG CCGCGAAAGC CTTGGAACAG GCGACAAAGT CACTCGACGA TTGTCAGAAG CAATCTGACC TTCGGAAGCA GGCGCTGGAG GACTCATCCA AACAGCTTCA GCAGGGCAAA GATGCTTTGA GCCAGCTACT GGGGAACCGC TTATTGCGAG AATACCGCAC CGAGAAGGAA ACCCTGCTGC GTGAAATGGC CTTCCTGGCG AAAATAGCGG AGCTTGAAGA TCACCGGGCA AAACTGGAAG ATGGCAAGCC CTGTCCACTT TGCGGCGCAA CCGAGCATCC CTTCGCGGCA GGCAATGTCC CTGTTGCCGA TGAATCCGAA CAGAAGATCG ACGCGTTGAC CAGGCTGATC AGCGAAGTCG AGGATCAGGA AACCGCCATC AAGGAACACG AAAAAGCTGA AAGCTTGGCC CATAAGGACC TGACGGAGGC TGAAAAACAG GAGTCAGCAG CAGCTAATGG CAGGAAGGTT GCCGAAAAAG CCCTTGCCGA AGTGACGGAC AGCTTGGAAA AACTCCGGGC TGATTTTGCT GAACGCAGGC AGGCCGTTGC TGCCAAACTT CTGCCCCTTG GTATCACGGA CATCCCTGAA ACGGATATTT CATTACTACC CGAAATCCTC AGAGCACGAC TGAAGGCGTG GCAGGCCCAG GTCAAGAAAA AGGCGGATAT TGAGAAACAG ATTACCGACC TCGACAGCGA GGTGAAACGG CTGGATGCGG TCATTGAAAC CCAAAGCACC GCTCTGGCCG AAAAGCTGAA GCGCCTGGAG AGCTTAAAGA AGGAACTCGC CACCGTGAGT GATGAGCGAA ATGCACTGTA CGGCGGCAAG AATCCCGACG ATGAGGAGCG CTGCTTGAAC AAGGCGGTTG CTGATGCGGA AGGCGTCGAA AGGTGGGTCA GAGAGCAGCA CAATGAACTC CAGCAACAAT GGAAAACCGG GAAGGCCCTT GTCGAATCGT TGAAGAAAGG CATTGACCAA CGAGAGCCGG AACTGAGTAG GCTGGAAACA GAATTCTTCG CAGCACTTGT GTCCGTGGAT TTTTCAAATG AAGAACAGTA TCTGGCAGCC CTATTGTCTT CAGAGCGGAG GGCTGAGCTG GTGACTACGG CCAAGGATCT GGATGATTGC CAAACGGACC TCAAGGCTAG GCAAAAAGAT CGGGAAACGC ACTTGGCTAC GGAAATGGCC AAAAAGGTTA CTGACCAATC TATTGAGGAA CTGGAGTCGC AATCCAAGGA GTATGAAAAC ACACTGAAAG AGCTGCGAGA TATCATTGCC AGTCTTAAGC ATAAGCTCAG TGAGAATATG GCTGCCAAAG AGCGGCTAAA GGAGAAGCAA GGGGCTATCG AAGCCCAGAA AAAAGAATGT CGCAGGTGGA AGAACCTGCA TGAATTAATC GGCTCCGCAG ATGGTAAGAA GTACCGCAAT TTTGCCCAGG GGTTGACCTT TGAAGTGATG GTTGGCCATG CCAACCGGCA ACTGCGGAAA ATGACTGACC GTTACTTGCT AGTCCGTGAC GAGGCTCAGC CCCTGGAGCT CAACGTGGTT GACAATTACC AGGCTGGGGA GATTCGGTCC ACGAAGAACC TTTCCGGCGG TGAAAGCTTT ATCGTCAGCC TGTCCCTGGC GCTGGGTTTG TCCCATATGG CCAGCAAGAA TGTCCGGGTG GACTCGCTGT TCCTGGATGA AGGCTTCGGC ACCCTGGACG AAGAAGCCCT CGACACCGCC TTAGAAGCCC TTGCGGGCCT GCAGCAGGAT GGCAAGCTGA TCGGGATCAT TTCACACGTA CCTGCCTTGA AAGAACGGAT TAGCTCCCAA ATCCAGGTAA CACCTCAAAC CGGTGGCAGG AGCAAGATAT CGGGGCCTGG ATGCGGTGGG TTGAGTGCTG CAAAATGGGC CAAAGAAGCG GGTTAA
|
Protein sequence | MRIRQVRFKN LNSLVGEWEI DLTHPAFVSD GIFAITGPTG AGKTTILDAI CMALYGRTPR LNKVTKRGNE IMSRQTGECF AEVTFETQTG RYRCHWSQHR ARKKPDGELQ APRHEIANAD SGEIFESKIR GVADQIESAT GMDFHRFTRS MLLAQGGFAV FLQAVQDERA PILEQITGTE IYSQISIRVH ERQREEREKL NLLQAETEGI VMLEPEQEQE IGQTLEIKRK EEADLTAKFA DTGQAMAWLT TIDGLKKEIV NLADEVRKLQ NDIEAFRPDR EKLNRALSAA SLDGAYATLT AIRKQQVEDR EALKAEGEAL PGLESSAKEQ AESLKSAEQQ TARVKEELKV AAPTLQKVRS LDQELANLKK TAAEDKQDCQ QDLEKIDTDK QARLEEQEKR STAHGNLELV DSYLKEHAQD EWLISGLAGV EEQVSSLLSR QNEIHQKEID QDKAAKALEQ ATKSLDDCQK QSDLRKQALE DSSKQLQQGK DALSQLLGNR LLREYRTEKE TLLREMAFLA KIAELEDHRA KLEDGKPCPL CGATEHPFAA GNVPVADESE QKIDALTRLI SEVEDQETAI KEHEKAESLA HKDLTEAEKQ ESAAANGRKV AEKALAEVTD SLEKLRADFA ERRQAVAAKL LPLGITDIPE TDISLLPEIL RARLKAWQAQ VKKKADIEKQ ITDLDSEVKR LDAVIETQST ALAEKLKRLE SLKKELATVS DERNALYGGK NPDDEERCLN KAVADAEGVE RWVREQHNEL QQQWKTGKAL VESLKKGIDQ REPELSRLET EFFAALVSVD FSNEEQYLAA LLSSERRAEL VTTAKDLDDC QTDLKARQKD RETHLATEMA KKVTDQSIEE LESQSKEYEN TLKELRDIIA SLKHKLSENM AAKERLKEKQ GAIEAQKKEC RRWKNLHELI GSADGKKYRN FAQGLTFEVM VGHANRQLRK MTDRYLLVRD EAQPLELNVV DNYQAGEIRS TKNLSGGESF IVSLSLALGL SHMASKNVRV DSLFLDEGFG TLDEEALDTA LEALAGLQQD GKLIGIISHV PALKERISSQ IQVTPQTGGR SKISGPGCGG LSAAKWAKEA G
|
| |