Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0204 |
Symbol | |
ID | 3785877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 215482 |
End bp | 218748 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637810275 |
Product | ATP-dependent dsDNA exonuclease (SbcC)-like |
Protein accession | YP_410904 |
Protein GI | 82701338 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.18794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAT TGCAGGTACG ATTCAAGAAT CTGAACTCGC TGATGGGTGA ATGGGAAATC GACCTGACGC ATCCCGCCTT TACCTCCGAC GGTATTTTTG CCATCACCGG TCCGACCGGT GCCGGGAAGA CAACCATCCT CGATGCCATC TGCCTCGCTC TCTACGGGCG GACCCCTCGG TTGAGCAAGG TCACCAAAAG CGGAAACGAG ATAATGTCCC GTCAGATCGG CGAATGTTTT GCTGAAGTGA CGTTTGAGAC ACAGGCCGGT CGTTTCCGTT GCCACTGGAG CCAGCACCGG GCACGTAAAA AACCGGATGG AGAACTACAG GCTCCGAAAC ATGAGATTGC TCATGCCGAT TCGGGGAAGA TTTTCGAATC CAAAATCAAG GGAGTTGCTG ACCGGATCGA GGCGGCTACC GGTATGGATT TCGACCAATT CACCCGCTCC ATGTTGCTGG CTCAGGGAGG CTTTGCCGCA TTCCTGCAGG CTTTACCCGA TGAGCGAGCG CCGATCCTGG AGCAGATAAC GGGCACGGAG ATTTATAGCA AAATATCGAT CCGCGTCCAT GAGCGCCAGC GTGAAGAGCG GGAGAAACTG AACCTGCTTC AGGCTGAAAC AGCCGGTATC GTGATTCTGG AGCAGGAACA GGAAAAAGAT ATTCAACAAG AACTCGAGTT AAAACAGAAA GAAGAGTCCG CTCTGACCAG AAAATTAATG GATACGGATA AAGCCATCAC GTGGCTGAAG GCCGTAGATG GACTGAGGAG GGAAATCGAG GATCTGACAG CAGAAGCGAG CAAACTGCAA GCCGACATCG ACATTTTTAA ACCAGAGCGT GACAAACTCA GCCGGGCAAT TCAGGCAGCG TCACTGGAAG GTTCCTACGC GACGTTGACA GCAATCCGAA CACAACATGC GGATGAGCAG AAGGCAGTGA AAACTGAGGA AGCAGTCCTT CCTCAACTGG CATCCTCTGC AAAGGAACAG GCTGAATCCC TGAAATTGGC TGAGCAGCAG TCCACCAGAG CCAAAGCGGA GCTGGAAGCT GCTTTACCGC TCATCAGGAA GGTCCGTTCG CTCGATCAGA CGCTTGCCGA TCAGAAAAAG GCTATTTCGA AAGATGAAGA CAGCTGCAGG AAAGACACGA AACAGATCGA TGCTGACAGA GAGGTCAAGC TGAAAGAACA GGTGAAACGC GCCGATGCTG AAAAGAATCT GAAGGTTGCT GAAGATTATC TCAAGGAGAA CGCTCGGGAT GAGTGGCTGA TAAGCGGTCT GGCCGGTGTT GAGGAACAAC TGGAAAATTT GCTCTCCAGG CAAAAGGAAA TAGCAAAAAA AGAGGTCGAT AAAGAGAAGG CAGTGACGGC TCTGGAACAG GCAACAAGAA AACTAGCTGC CTGCCGGAAG CAATGCGAGG CTCGGAAACA GGAGCTGGAG AACGTCTCAA AGAATCTCCA ACAAGGCAAA GATAAATTAA ACCAGTTATT GGGTGACCGG CTGTTGCGTG AATACCGTGC AGAAAAGGAA ACCCTGCTTC GAGAAATGGT TTTTCTGAAA AAAATAGCGG AACTTGAAGA TCACCGAGCC AGATTGGAAG ATGGCAAACC CTGTCCGCTT TGTGGCGCAA CGGAGCATCC CTTTGCGGAA GGCAACGTCC CTATTCCCGA TGAAATCGAA CGGAAGATTG GGGCGCTCAC TGAACTGATA AGCAAGGCTG AGGATCATGA AACCGCTATC AGACATTTTG AGGAAGACGA AGGCACAGCC CGTAAAAATT TGATGGACGG TGAAAAGCAG GAAATAACAG CAGTTAATGA CAAAAAAACT GCCGGGCAAG TCCTGGCCGG GCTGCAGGAA TATCTGGAAA AACTCCAAAC TGATATTACC GGACTGAAGC AGACTGTATC AGCCAAACTC CAACCGCTCG GCATTGAGGA AATTCTGGAT TCCAATGTTT CAGCACTGCT GGAATCCTTG CGAGAACGCG TTAAAGCATG GCAGATACAG ATCAAGAGAA AGACTGATAT CGAGAAACAG ATCGGAGATT TCGATAGCGA GTTGAAAAGG TTGGACGCAG TCATTGAAAC TCAAAGCAAT GCATTGAATG AAAAACGGGA ATGTCTGCAG TCCCTCAAAG AGGATTGCGC CGCTGGATGC AAAGAGCGGA AAGAATTGTA CGGTGATAAA GATCCTGATG ATGAAGAACG GCGTTTGAAC AAGAGTATTT CCGTTTCAGA GGAGGCTGAA AGGAAGGCCA GAGTCCTGCA TCATGATCTT CAGCAGAAAT TGAATGCTGC CAAGACTCAA ATTGAATCTC TGAAAAAGCG CATTGGTCAA AGAGACCCGG AGCTGAAGGA ACTGGAAGCC GGATTTACAG AAGCATTGGG ATTAGCTGGT TTTTCGGGTG AAAAACAGTT TCTTGAAGCC AGGCTCCCGA TTGGGCAAAG GGATGCGCTA TCTGCAAGGG CAAAAAATCT GGACGGTCAC CAAATAGATC TCAAGGCCAG ACAAAAGGAT CGGGAAGCGC GCTTGGCTAC GGAAGCTGCC AGGAAGATTA CCGACAAATC TCTTGAGGAA CTGGAACCGC AATTCAAGGA TTCCGAGGAG TCCCTGAAAC AGCTGCGGGA TACTACTATT CGACTTAAAC ACAAACTGAA CGAGAATGCT GCTGCCAAGG AACGGCTGGA GGCAAAGCAA ACGGCTATCG GGGCTCAAAA GATAGAGTGC CGCCGATGGG AAAATCTACA CGAATTGATT GGCTCGGCGG ACGGAAAGAA ATACCGGAAT TTTGCCCAGG GACTGACCTT CGAGATGATG ATCGGTCATG CCAACCGGCA GTTGCAGAAA ATGACCGATC GCTATCTGCT GGTTCGCGAT GATACTGAGC CTCTGGAACT CAACGTTGTC GACAACTATC AGGCGGGTGA GATTCGGTCC ACGAAAAACC TTTCAGGTGG CGAGAGTTTC ATCGTCAGCT TGTCGCTGGC GTTGGGACTG TCTCACATGG CCAGCAAAAA CGTTCGCGTG GATTCGCTTT TTCTGGATGA AGGCTTCGGC ACGCTGGATG AAGAAGCATT GGATACCGCT CTGGAAACTC TTGCGAGCCT GCAGCAGGAG GGTAAGTTGA TCGGCGTGAT CTCGCATGTC CCAATGCTCA AGGAGCGCAT CAGCACACAG ATTCTGGTAA TTCCTCAAAC CGGTGGAAGG AGCGAGATAT CTGGACCGGG CTGCAGAAGA AACATAACCG GCCAGATTTC CATTTAA
|
Protein sequence | MKILQVRFKN LNSLMGEWEI DLTHPAFTSD GIFAITGPTG AGKTTILDAI CLALYGRTPR LSKVTKSGNE IMSRQIGECF AEVTFETQAG RFRCHWSQHR ARKKPDGELQ APKHEIAHAD SGKIFESKIK GVADRIEAAT GMDFDQFTRS MLLAQGGFAA FLQALPDERA PILEQITGTE IYSKISIRVH ERQREEREKL NLLQAETAGI VILEQEQEKD IQQELELKQK EESALTRKLM DTDKAITWLK AVDGLRREIE DLTAEASKLQ ADIDIFKPER DKLSRAIQAA SLEGSYATLT AIRTQHADEQ KAVKTEEAVL PQLASSAKEQ AESLKLAEQQ STRAKAELEA ALPLIRKVRS LDQTLADQKK AISKDEDSCR KDTKQIDADR EVKLKEQVKR ADAEKNLKVA EDYLKENARD EWLISGLAGV EEQLENLLSR QKEIAKKEVD KEKAVTALEQ ATRKLAACRK QCEARKQELE NVSKNLQQGK DKLNQLLGDR LLREYRAEKE TLLREMVFLK KIAELEDHRA RLEDGKPCPL CGATEHPFAE GNVPIPDEIE RKIGALTELI SKAEDHETAI RHFEEDEGTA RKNLMDGEKQ EITAVNDKKT AGQVLAGLQE YLEKLQTDIT GLKQTVSAKL QPLGIEEILD SNVSALLESL RERVKAWQIQ IKRKTDIEKQ IGDFDSELKR LDAVIETQSN ALNEKRECLQ SLKEDCAAGC KERKELYGDK DPDDEERRLN KSISVSEEAE RKARVLHHDL QQKLNAAKTQ IESLKKRIGQ RDPELKELEA GFTEALGLAG FSGEKQFLEA RLPIGQRDAL SARAKNLDGH QIDLKARQKD REARLATEAA RKITDKSLEE LEPQFKDSEE SLKQLRDTTI RLKHKLNENA AAKERLEAKQ TAIGAQKIEC RRWENLHELI GSADGKKYRN FAQGLTFEMM IGHANRQLQK MTDRYLLVRD DTEPLELNVV DNYQAGEIRS TKNLSGGESF IVSLSLALGL SHMASKNVRV DSLFLDEGFG TLDEEALDTA LETLASLQQE GKLIGVISHV PMLKERISTQ ILVIPQTGGR SEISGPGCRR NITGQISI
|
| |