Gene Nmul_A0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0204 
Symbol 
ID3785877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp215482 
End bp218748 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table11 
GC content50% 
IMG OID637810275 
ProductATP-dependent dsDNA exonuclease (SbcC)-like 
Protein accessionYP_410904 
Protein GI82701338 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.18794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAT TGCAGGTACG ATTCAAGAAT CTGAACTCGC TGATGGGTGA ATGGGAAATC 
GACCTGACGC ATCCCGCCTT TACCTCCGAC GGTATTTTTG CCATCACCGG TCCGACCGGT
GCCGGGAAGA CAACCATCCT CGATGCCATC TGCCTCGCTC TCTACGGGCG GACCCCTCGG
TTGAGCAAGG TCACCAAAAG CGGAAACGAG ATAATGTCCC GTCAGATCGG CGAATGTTTT
GCTGAAGTGA CGTTTGAGAC ACAGGCCGGT CGTTTCCGTT GCCACTGGAG CCAGCACCGG
GCACGTAAAA AACCGGATGG AGAACTACAG GCTCCGAAAC ATGAGATTGC TCATGCCGAT
TCGGGGAAGA TTTTCGAATC CAAAATCAAG GGAGTTGCTG ACCGGATCGA GGCGGCTACC
GGTATGGATT TCGACCAATT CACCCGCTCC ATGTTGCTGG CTCAGGGAGG CTTTGCCGCA
TTCCTGCAGG CTTTACCCGA TGAGCGAGCG CCGATCCTGG AGCAGATAAC GGGCACGGAG
ATTTATAGCA AAATATCGAT CCGCGTCCAT GAGCGCCAGC GTGAAGAGCG GGAGAAACTG
AACCTGCTTC AGGCTGAAAC AGCCGGTATC GTGATTCTGG AGCAGGAACA GGAAAAAGAT
ATTCAACAAG AACTCGAGTT AAAACAGAAA GAAGAGTCCG CTCTGACCAG AAAATTAATG
GATACGGATA AAGCCATCAC GTGGCTGAAG GCCGTAGATG GACTGAGGAG GGAAATCGAG
GATCTGACAG CAGAAGCGAG CAAACTGCAA GCCGACATCG ACATTTTTAA ACCAGAGCGT
GACAAACTCA GCCGGGCAAT TCAGGCAGCG TCACTGGAAG GTTCCTACGC GACGTTGACA
GCAATCCGAA CACAACATGC GGATGAGCAG AAGGCAGTGA AAACTGAGGA AGCAGTCCTT
CCTCAACTGG CATCCTCTGC AAAGGAACAG GCTGAATCCC TGAAATTGGC TGAGCAGCAG
TCCACCAGAG CCAAAGCGGA GCTGGAAGCT GCTTTACCGC TCATCAGGAA GGTCCGTTCG
CTCGATCAGA CGCTTGCCGA TCAGAAAAAG GCTATTTCGA AAGATGAAGA CAGCTGCAGG
AAAGACACGA AACAGATCGA TGCTGACAGA GAGGTCAAGC TGAAAGAACA GGTGAAACGC
GCCGATGCTG AAAAGAATCT GAAGGTTGCT GAAGATTATC TCAAGGAGAA CGCTCGGGAT
GAGTGGCTGA TAAGCGGTCT GGCCGGTGTT GAGGAACAAC TGGAAAATTT GCTCTCCAGG
CAAAAGGAAA TAGCAAAAAA AGAGGTCGAT AAAGAGAAGG CAGTGACGGC TCTGGAACAG
GCAACAAGAA AACTAGCTGC CTGCCGGAAG CAATGCGAGG CTCGGAAACA GGAGCTGGAG
AACGTCTCAA AGAATCTCCA ACAAGGCAAA GATAAATTAA ACCAGTTATT GGGTGACCGG
CTGTTGCGTG AATACCGTGC AGAAAAGGAA ACCCTGCTTC GAGAAATGGT TTTTCTGAAA
AAAATAGCGG AACTTGAAGA TCACCGAGCC AGATTGGAAG ATGGCAAACC CTGTCCGCTT
TGTGGCGCAA CGGAGCATCC CTTTGCGGAA GGCAACGTCC CTATTCCCGA TGAAATCGAA
CGGAAGATTG GGGCGCTCAC TGAACTGATA AGCAAGGCTG AGGATCATGA AACCGCTATC
AGACATTTTG AGGAAGACGA AGGCACAGCC CGTAAAAATT TGATGGACGG TGAAAAGCAG
GAAATAACAG CAGTTAATGA CAAAAAAACT GCCGGGCAAG TCCTGGCCGG GCTGCAGGAA
TATCTGGAAA AACTCCAAAC TGATATTACC GGACTGAAGC AGACTGTATC AGCCAAACTC
CAACCGCTCG GCATTGAGGA AATTCTGGAT TCCAATGTTT CAGCACTGCT GGAATCCTTG
CGAGAACGCG TTAAAGCATG GCAGATACAG ATCAAGAGAA AGACTGATAT CGAGAAACAG
ATCGGAGATT TCGATAGCGA GTTGAAAAGG TTGGACGCAG TCATTGAAAC TCAAAGCAAT
GCATTGAATG AAAAACGGGA ATGTCTGCAG TCCCTCAAAG AGGATTGCGC CGCTGGATGC
AAAGAGCGGA AAGAATTGTA CGGTGATAAA GATCCTGATG ATGAAGAACG GCGTTTGAAC
AAGAGTATTT CCGTTTCAGA GGAGGCTGAA AGGAAGGCCA GAGTCCTGCA TCATGATCTT
CAGCAGAAAT TGAATGCTGC CAAGACTCAA ATTGAATCTC TGAAAAAGCG CATTGGTCAA
AGAGACCCGG AGCTGAAGGA ACTGGAAGCC GGATTTACAG AAGCATTGGG ATTAGCTGGT
TTTTCGGGTG AAAAACAGTT TCTTGAAGCC AGGCTCCCGA TTGGGCAAAG GGATGCGCTA
TCTGCAAGGG CAAAAAATCT GGACGGTCAC CAAATAGATC TCAAGGCCAG ACAAAAGGAT
CGGGAAGCGC GCTTGGCTAC GGAAGCTGCC AGGAAGATTA CCGACAAATC TCTTGAGGAA
CTGGAACCGC AATTCAAGGA TTCCGAGGAG TCCCTGAAAC AGCTGCGGGA TACTACTATT
CGACTTAAAC ACAAACTGAA CGAGAATGCT GCTGCCAAGG AACGGCTGGA GGCAAAGCAA
ACGGCTATCG GGGCTCAAAA GATAGAGTGC CGCCGATGGG AAAATCTACA CGAATTGATT
GGCTCGGCGG ACGGAAAGAA ATACCGGAAT TTTGCCCAGG GACTGACCTT CGAGATGATG
ATCGGTCATG CCAACCGGCA GTTGCAGAAA ATGACCGATC GCTATCTGCT GGTTCGCGAT
GATACTGAGC CTCTGGAACT CAACGTTGTC GACAACTATC AGGCGGGTGA GATTCGGTCC
ACGAAAAACC TTTCAGGTGG CGAGAGTTTC ATCGTCAGCT TGTCGCTGGC GTTGGGACTG
TCTCACATGG CCAGCAAAAA CGTTCGCGTG GATTCGCTTT TTCTGGATGA AGGCTTCGGC
ACGCTGGATG AAGAAGCATT GGATACCGCT CTGGAAACTC TTGCGAGCCT GCAGCAGGAG
GGTAAGTTGA TCGGCGTGAT CTCGCATGTC CCAATGCTCA AGGAGCGCAT CAGCACACAG
ATTCTGGTAA TTCCTCAAAC CGGTGGAAGG AGCGAGATAT CTGGACCGGG CTGCAGAAGA
AACATAACCG GCCAGATTTC CATTTAA
 
Protein sequence
MKILQVRFKN LNSLMGEWEI DLTHPAFTSD GIFAITGPTG AGKTTILDAI CLALYGRTPR 
LSKVTKSGNE IMSRQIGECF AEVTFETQAG RFRCHWSQHR ARKKPDGELQ APKHEIAHAD
SGKIFESKIK GVADRIEAAT GMDFDQFTRS MLLAQGGFAA FLQALPDERA PILEQITGTE
IYSKISIRVH ERQREEREKL NLLQAETAGI VILEQEQEKD IQQELELKQK EESALTRKLM
DTDKAITWLK AVDGLRREIE DLTAEASKLQ ADIDIFKPER DKLSRAIQAA SLEGSYATLT
AIRTQHADEQ KAVKTEEAVL PQLASSAKEQ AESLKLAEQQ STRAKAELEA ALPLIRKVRS
LDQTLADQKK AISKDEDSCR KDTKQIDADR EVKLKEQVKR ADAEKNLKVA EDYLKENARD
EWLISGLAGV EEQLENLLSR QKEIAKKEVD KEKAVTALEQ ATRKLAACRK QCEARKQELE
NVSKNLQQGK DKLNQLLGDR LLREYRAEKE TLLREMVFLK KIAELEDHRA RLEDGKPCPL
CGATEHPFAE GNVPIPDEIE RKIGALTELI SKAEDHETAI RHFEEDEGTA RKNLMDGEKQ
EITAVNDKKT AGQVLAGLQE YLEKLQTDIT GLKQTVSAKL QPLGIEEILD SNVSALLESL
RERVKAWQIQ IKRKTDIEKQ IGDFDSELKR LDAVIETQSN ALNEKRECLQ SLKEDCAAGC
KERKELYGDK DPDDEERRLN KSISVSEEAE RKARVLHHDL QQKLNAAKTQ IESLKKRIGQ
RDPELKELEA GFTEALGLAG FSGEKQFLEA RLPIGQRDAL SARAKNLDGH QIDLKARQKD
REARLATEAA RKITDKSLEE LEPQFKDSEE SLKQLRDTTI RLKHKLNENA AAKERLEAKQ
TAIGAQKIEC RRWENLHELI GSADGKKYRN FAQGLTFEMM IGHANRQLQK MTDRYLLVRD
DTEPLELNVV DNYQAGEIRS TKNLSGGESF IVSLSLALGL SHMASKNVRV DSLFLDEGFG
TLDEEALDTA LETLASLQQE GKLIGVISHV PMLKERISTQ ILVIPQTGGR SEISGPGCRR
NITGQISI