Gene Noc_0704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0704 
Symbol 
ID3706952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp756326 
End bp759601 
Gene Length3276 bp 
Protein Length1091 aa 
Translation table11 
GC content54% 
IMG OID637737207 
ProductATP-dependent dsDNA exonuclease (SbcC) 
Protein accessionYP_342748 
Protein GI77164223 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.859038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAC GGCAGGTACG CTTTAAGAAT CTGAACTCAC TGGTCGGCGA GTGGGAAATC 
GACTTGACGC ACCCAGCCTT CGTGTCTGAT GGCATCTTCG CCATTACAGG CCCTACTGGC
GCGGGCAAGA CGACTATTCT CGATGCCATT TGTATGGCTC TTTACGGGCG GACGCCTCGC
CTAAACAAGG TCACTAAGCG TGGCAACGAG ATTATGTCCC GCCAGACCGG CGAATGCTTT
GCAGAGGTGA CCTTCGAAAC CCAGACTGGG CGTTACCGCT GTCACTGGAG CCAGCACCGG
GCACGCAAGA AGCCTGATGG CGAGCTCCAG GCTCCGAGAC ACGAAATTGC CAATGCCGAT
TCCGGTGAGA TTTTCGAATC CAAAATCAGA GGGGTCGCGG ACCAGATCGA GTCGGCTACC
GGTATGGATT TCCACCGTTT TACCCGCTCC ATGTTACTGG CCCAGGGCGG CTTTGCCGTG
TTCCTCCAAG CGGTGCAGGA TGAGCGGGCG CCGATCCTTG AGCAGATCAC GGGCACGGAG
ATTTACAGCC AGATTTCCAT CCGTGTTCAT GAGCGCCAAC GGGAAGAGCG GGAAAAACTG
AACCTGCTTC AGGCTGAAAC GGAAGGCATC GTGATGCTTG AGCCGGAACA GGAACAAGAG
ATTGGGCAGA CGCTTGAGAT AAAGCGGAAG GAAGAGGCAG ACCTTACCGC CAAGTTCGCC
GACACTGGGC AGGCCATGGC CTGGCTCACC ACCATCGATG GTCTGAAGAA GGAAATCGTC
AACCTGGCCG ATGAGGTGCG CAAGCTGCAA AACGATATCG AGGCGTTCAG ACCGGATCGT
GAAAAGCTCA ACCGGGCTTT GAGTGCTGCC TCACTGGACG GCGCATACGC AACGCTCACA
GCCATCCGCA AACAGCAGGT GGAGGACAGA GAAGCCTTGA AAGCTGAGGG AGAAGCGCTT
CCTGGATTGG AATCCTCCGC CAAGGAGCAG GCCGAGTCAC TAAAATCGGC TGAGCAACAA
ACCGCTCGGG TCAAAGAAGA GCTAAAAGTT GCCGCACCTA CCTTGCAGAA GGTTCGCTCC
CTGGATCAGG AGCTCGCCAA TCTCAAAAAA ACTGCGGCAG AAGATAAACA GGATTGCCAA
CAGGATCTTG AAAAGATTGA TACAGACAAA CAAGCCCGGC TTGAGGAGCA GGAAAAACGT
TCCACGGCTC ACGGGAATCT GGAACTTGTT GACAGCTACC TCAAGGAGCA TGCACAGGAT
GAATGGCTGA TCAGCGGCCT GGCTGGTGTG GAAGAACAGG TGAGCAGCCT ACTCTCCAGG
CAAAATGAAA TCCATCAAAA AGAGATTGAC CAGGATAAGG CCGCGAAAGC CTTGGAACAG
GCGACAAAGT CACTCGACGA TTGTCAGAAG CAATCTGACC TTCGGAAGCA GGCGCTGGAG
GACTCATCCA AACAGCTTCA GCAGGGCAAA GATGCTTTGA GCCAGCTACT GGGGAACCGC
TTATTGCGAG AATACCGCAC CGAGAAGGAA ACCCTGCTGC GTGAAATGGC CTTCCTGGCG
AAAATAGCGG AGCTTGAAGA TCACCGGGCA AAACTGGAAG ATGGCAAGCC CTGTCCACTT
TGCGGCGCAA CCGAGCATCC CTTCGCGGCA GGCAATGTCC CTGTTGCCGA TGAATCCGAA
CAGAAGATCG ACGCGTTGAC CAGGCTGATC AGCGAAGTCG AGGATCAGGA AACCGCCATC
AAGGAACACG AAAAAGCTGA AAGCTTGGCC CATAAGGACC TGACGGAGGC TGAAAAACAG
GAGTCAGCAG CAGCTAATGG CAGGAAGGTT GCCGAAAAAG CCCTTGCCGA AGTGACGGAC
AGCTTGGAAA AACTCCGGGC TGATTTTGCT GAACGCAGGC AGGCCGTTGC TGCCAAACTT
CTGCCCCTTG GTATCACGGA CATCCCTGAA ACGGATATTT CATTACTACC CGAAATCCTC
AGAGCACGAC TGAAGGCGTG GCAGGCCCAG GTCAAGAAAA AGGCGGATAT TGAGAAACAG
ATTACCGACC TCGACAGCGA GGTGAAACGG CTGGATGCGG TCATTGAAAC CCAAAGCACC
GCTCTGGCCG AAAAGCTGAA GCGCCTGGAG AGCTTAAAGA AGGAACTCGC CACCGTGAGT
GATGAGCGAA ATGCACTGTA CGGCGGCAAG AATCCCGACG ATGAGGAGCG CTGCTTGAAC
AAGGCGGTTG CTGATGCGGA AGGCGTCGAA AGGTGGGTCA GAGAGCAGCA CAATGAACTC
CAGCAACAAT GGAAAACCGG GAAGGCCCTT GTCGAATCGT TGAAGAAAGG CATTGACCAA
CGAGAGCCGG AACTGAGTAG GCTGGAAACA GAATTCTTCG CAGCACTTGT GTCCGTGGAT
TTTTCAAATG AAGAACAGTA TCTGGCAGCC CTATTGTCTT CAGAGCGGAG GGCTGAGCTG
GTGACTACGG CCAAGGATCT GGATGATTGC CAAACGGACC TCAAGGCTAG GCAAAAAGAT
CGGGAAACGC ACTTGGCTAC GGAAATGGCC AAAAAGGTTA CTGACCAATC TATTGAGGAA
CTGGAGTCGC AATCCAAGGA GTATGAAAAC ACACTGAAAG AGCTGCGAGA TATCATTGCC
AGTCTTAAGC ATAAGCTCAG TGAGAATATG GCTGCCAAAG AGCGGCTAAA GGAGAAGCAA
GGGGCTATCG AAGCCCAGAA AAAAGAATGT CGCAGGTGGA AGAACCTGCA TGAATTAATC
GGCTCCGCAG ATGGTAAGAA GTACCGCAAT TTTGCCCAGG GGTTGACCTT TGAAGTGATG
GTTGGCCATG CCAACCGGCA ACTGCGGAAA ATGACTGACC GTTACTTGCT AGTCCGTGAC
GAGGCTCAGC CCCTGGAGCT CAACGTGGTT GACAATTACC AGGCTGGGGA GATTCGGTCC
ACGAAGAACC TTTCCGGCGG TGAAAGCTTT ATCGTCAGCC TGTCCCTGGC GCTGGGTTTG
TCCCATATGG CCAGCAAGAA TGTCCGGGTG GACTCGCTGT TCCTGGATGA AGGCTTCGGC
ACCCTGGACG AAGAAGCCCT CGACACCGCC TTAGAAGCCC TTGCGGGCCT GCAGCAGGAT
GGCAAGCTGA TCGGGATCAT TTCACACGTA CCTGCCTTGA AAGAACGGAT TAGCTCCCAA
ATCCAGGTAA CACCTCAAAC CGGTGGCAGG AGCAAGATAT CGGGGCCTGG ATGCGGTGGG
TTGAGTGCTG CAAAATGGGC CAAAGAAGCG GGTTAA
 
Protein sequence
MRIRQVRFKN LNSLVGEWEI DLTHPAFVSD GIFAITGPTG AGKTTILDAI CMALYGRTPR 
LNKVTKRGNE IMSRQTGECF AEVTFETQTG RYRCHWSQHR ARKKPDGELQ APRHEIANAD
SGEIFESKIR GVADQIESAT GMDFHRFTRS MLLAQGGFAV FLQAVQDERA PILEQITGTE
IYSQISIRVH ERQREEREKL NLLQAETEGI VMLEPEQEQE IGQTLEIKRK EEADLTAKFA
DTGQAMAWLT TIDGLKKEIV NLADEVRKLQ NDIEAFRPDR EKLNRALSAA SLDGAYATLT
AIRKQQVEDR EALKAEGEAL PGLESSAKEQ AESLKSAEQQ TARVKEELKV AAPTLQKVRS
LDQELANLKK TAAEDKQDCQ QDLEKIDTDK QARLEEQEKR STAHGNLELV DSYLKEHAQD
EWLISGLAGV EEQVSSLLSR QNEIHQKEID QDKAAKALEQ ATKSLDDCQK QSDLRKQALE
DSSKQLQQGK DALSQLLGNR LLREYRTEKE TLLREMAFLA KIAELEDHRA KLEDGKPCPL
CGATEHPFAA GNVPVADESE QKIDALTRLI SEVEDQETAI KEHEKAESLA HKDLTEAEKQ
ESAAANGRKV AEKALAEVTD SLEKLRADFA ERRQAVAAKL LPLGITDIPE TDISLLPEIL
RARLKAWQAQ VKKKADIEKQ ITDLDSEVKR LDAVIETQST ALAEKLKRLE SLKKELATVS
DERNALYGGK NPDDEERCLN KAVADAEGVE RWVREQHNEL QQQWKTGKAL VESLKKGIDQ
REPELSRLET EFFAALVSVD FSNEEQYLAA LLSSERRAEL VTTAKDLDDC QTDLKARQKD
RETHLATEMA KKVTDQSIEE LESQSKEYEN TLKELRDIIA SLKHKLSENM AAKERLKEKQ
GAIEAQKKEC RRWKNLHELI GSADGKKYRN FAQGLTFEVM VGHANRQLRK MTDRYLLVRD
EAQPLELNVV DNYQAGEIRS TKNLSGGESF IVSLSLALGL SHMASKNVRV DSLFLDEGFG
TLDEEALDTA LEALAGLQQD GKLIGIISHV PALKERISSQ IQVTPQTGGR SKISGPGCGG
LSAAKWAKEA G