Gene Nmul_A0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0143 
Symbol 
ID3784115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp150565 
End bp152610 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content55% 
IMG OID637810214 
ProductATP-dependent OLD family endonuclease 
Protein accessionYP_410844 
Protein GI82701278 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.128728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATAC ATATATTGCC TGTCAAAGAA AAAGACACAG ACTTTATGAA TAAGCAACCA 
TTGCAAGACA TGCTGCTCCT TCCCCGAATA TCAGACATAC CAGAGATACC AGAGCCTCAG
GATCATGCGG AAGCCACAAG CAACAGCGAA AGTCCCGTTG AAAGTCCTGT TGACAGCCCG
GTTGACGCCC CGGCCCCGGG AGAAGCAATA GACGACAGAA TAATACACCC CATGCACCTT
ACCCGATTTC TGGTCACAAA TTTTCGCTCG GTGGAGAACA GTGGCTGGAT TGAGGTTGAC
AGCGTTACCG CGCTCATCGG CGTGAACGAA TCCGGCAAGA CAAACCTGCT GGTTCCGCTC
TGGAAGCTGA ATCCGGCAGT CGGGGGAGAG ATCGTGCCGG CTTCCGACTA CCCGAAAAAA
CAATTCGGCC CTGTGCGCCA GGCACCGGAA AGCTTCCACT TCATAACGGC GGAATTTGAG
GCAGGAGAGC TTGGAGAAGA GCTGTGCGGC AAGCTGGAGA TCTCCAGCGA AGAGGCCTCC
CTCATCGCTG TGAAACGGTA CTTCAATGGG GATTATTCGA TCTCCTTCCC CCGGCGGGAA
GAGCTTGGCG AGATCAGGAG ATCATCGGGG GAAGAGGCAG AGGATGTTGT GGATACGGTC
ATCCACTCCC TTCCCAAATT TGTCTATTAC TCTGAATTCG GAAATCTCGA TTCTGAAATC
TATCTTCCCC ACGTAGTACA GAATCTTCAA CGCACTGACC TTGGGCCCAG AGAAGCAGCC
AAGTCGCGTA CATTGCGCGT TCTTTTTAAA TTCGTAGGCC TCGAGCCGGG AGAAATTCTT
GAACTCGGGC GGGACTTTCC AAGGCGCAAG GGCCGTCGTC GTGAACCCAC AGTGGAAGAA
ATCCGGGAAA TCGCGATGAA GAAAAGGGAG CGTTCCATCC TGCTGCAATC GGCAGGCGGG
CTTCTGACGG AAAAGTTCAG AAACTGGTGG AAGCAGGGAG ATTACAAATT CCGCTTCGAA
GCGGATGGAA GCCATTTCCG CATATGGGTT TCCGATGACC GCCGCCCGGA GGAAGTTGAG
CTCGAAAGCA GAAGCACGGG GCTGCAGTGG TTTCTGAGTT TTTATCTGGT CTTCCTCGTG
GAAAGCGGGG GTGAGCACCA GAACGCCGTC TTGTTGCTGG ACGAGCCCGG ATTGTCCCTG
CACCCCCTGG CGCAGCGCAA CCTTTTTGCC TTCTTTGACA GTCTTGCAAA AGTCAACAAA
ATCCTCTACA CGACCCATTC ACCATTCCTG CTGGATGCAG AGCACCTGGG ACGCGCGCGC
AAGGTATATG TTTCCGCGAA CGGAACGACG CAAGTGACGT CGGACCTTCG TAGCGTGGAA
AAGGATGTCC GGCAGGCGGG AGCCGCCTAT GTGGTTCACT CGGCCTTGAA CCTGAATATT
GCCGAGAGCA TGGTAACGGG CTGTCAACCG GTCGTCGTGC AGCGCGTCTC GGATCAATAT
TATCTCTCGA CCATCAAGAC ACTCCTCATT GGGGCAAACA AGATAGCCCC GAAGCGCGAA
CTGCTCTTTC CGCCGGCTGG CGCCAGCAAA ACGGTAGGCG TGATCGCAAG CATCATTAGT
GGGCGCGACG AGCCGCTGCC CAAGGTAGTC CTGCGGGATG ACGAAGCAGG AAAGCAGATC
AATGCAGAGT TGAAGTCGGA TTTGTACAGC GACGCGCAGG AGCGTATCTT TTCGACGGAT
AATTACGTTC TCTTCAGCAA CTCCACGACA GAAGATCTGA TACCTGCACC GTTTTTTGCA
CAAGTAATAG ACCGCTGGGA GAGGAATACG GACACGGCCT TTGCGGACGT CCTGGTCAAC
GGCAAGCCCG TGGTGGCGCA GGTGCAGGCC TGGGCCGAAG CACAGGACCT GGTGTTACGG
GATGAATGGA GAGTTGAAGT TGCAAAGCGC GTCAAGGAAC AGGCGCTCAA AAAGGGCATC
GGCGCGTTTG ATGCTGGCAC CATTGCGCGC TGGGCAAAAC TCTTTCAGGA GCTGATCCGG
AACTGA
 
Protein sequence
MTIHILPVKE KDTDFMNKQP LQDMLLLPRI SDIPEIPEPQ DHAEATSNSE SPVESPVDSP 
VDAPAPGEAI DDRIIHPMHL TRFLVTNFRS VENSGWIEVD SVTALIGVNE SGKTNLLVPL
WKLNPAVGGE IVPASDYPKK QFGPVRQAPE SFHFITAEFE AGELGEELCG KLEISSEEAS
LIAVKRYFNG DYSISFPRRE ELGEIRRSSG EEAEDVVDTV IHSLPKFVYY SEFGNLDSEI
YLPHVVQNLQ RTDLGPREAA KSRTLRVLFK FVGLEPGEIL ELGRDFPRRK GRRREPTVEE
IREIAMKKRE RSILLQSAGG LLTEKFRNWW KQGDYKFRFE ADGSHFRIWV SDDRRPEEVE
LESRSTGLQW FLSFYLVFLV ESGGEHQNAV LLLDEPGLSL HPLAQRNLFA FFDSLAKVNK
ILYTTHSPFL LDAEHLGRAR KVYVSANGTT QVTSDLRSVE KDVRQAGAAY VVHSALNLNI
AESMVTGCQP VVVQRVSDQY YLSTIKTLLI GANKIAPKRE LLFPPAGASK TVGVIASIIS
GRDEPLPKVV LRDDEAGKQI NAELKSDLYS DAQERIFSTD NYVLFSNSTT EDLIPAPFFA
QVIDRWERNT DTAFADVLVN GKPVVAQVQA WAEAQDLVLR DEWRVEVAKR VKEQALKKGI
GAFDAGTIAR WAKLFQELIR N