Gene Nmul_A0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0120 
Symbol 
ID3785768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp126615 
End bp127904 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content45% 
IMG OID637810190 
ProductATP-dependent OLD family endonuclease 
Protein accessionYP_410821 
Protein GI82701255 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAAA AAATCGTACT ACATGGTTGG CGTCAGTTCC GGAACGTTGA CATCGATTTC 
CACCCACGTC TGACAGTGCT GACTGGAGCT AATGGAGCAG GTAAGACCAC GTTATTAAAT
TTGGTTAGTA GGCACTTCGG TTGGGACGGG TCATTTATAA GCACACCAGT ACCACGTCGG
TCGAACCCGA GTCTCATGTA TTCGACAGAT TTTTGGGATA TAGACGATAT CCAAGTTGAC
ATTATCCATT CCTTTGAAAG AGAGCAGAAT CGGAAAAAGC AAGCTGCTGC GCAAGGATCG
CAAACAACAA TTGGGAAAAT TATTTACGGA GATGGAACTG AGACACTAAT AACAGTACCA
AATAGCGAAG TCGGATCAAG ATACGATGTA AGTATTCCGG CACGACAACG CATCGATGGC
CTGCATATTC CTTCACACCG CGCACCGTCC ACTTATCAGC AGGTTCAAAA TATTCCGACG
ATTCCTCGCA GAAGACAGGA GGTATTTAAC CAATATCTTA GTCTTGTCCA GAGTAGATAC
CTTGGAAGCT ACACTCAATG GTCTCCACAG TATTACATGA AGGAAACGCT AATCAGTCTA
GCTACTTTTG GATATGGCAA TGCAGTTGTA GATGCAGACC CAGAGTCTGC TAGGCTCTTT
GAGGGGTTTC AGGAAATTCT TCGAAAAATG CTTCCTCCAA AACTCAGGTT TAAACGCTTA
CAAATTCGTG TTCCGGAGGT TATTCTAGAA ACAGAAACGG GAAACTTCTC TATCGATGCT
CTCTCCGGAG GGGCTGCAGC AGTAATAGAT TTAGCGTGGC AAGTATTTAT GTATGAGCCA
TCAGAAAGTG AGTTTGTGGT AACACTGGAT GAGCCTGAGA ATCATCTGCA TCCCGAACTG
CAGCAGAGAG TTTTGGCAGA TCTCCTGACA GCTTTCCCGT CCGTACAATT CGTAGTCGCT
ACCCACAGTC CATTCATTGT TGGATCTGTA CCCCATTCTC ATGTTTATGT ACTCGGATAT
GACGACAGTC GCCGCGTAAA TAGCACTCTG CTAGATACAG TAAACAAAAC TGGGACGGCC
AACGAAATAC TGAGAGACGT GCTTGGACTT GAGTTTACGA TTCCAGTCTG GGTCGAAAAC
AGGTTGGAGA ATTTGATTGA AAAATATTCG AAAAAGGATT TCACGGAAGA CAACCTAATG
ATGCTTCGTC AAGAAATGAC TTCGCTTGGT TTGGGCAAGC ACGTACCTCA AACGATTTCA
ATGCTGGCAC AGAAGAAGGA TGGGCAATGA
 
Protein sequence
MFKKIVLHGW RQFRNVDIDF HPRLTVLTGA NGAGKTTLLN LVSRHFGWDG SFISTPVPRR 
SNPSLMYSTD FWDIDDIQVD IIHSFEREQN RKKQAAAQGS QTTIGKIIYG DGTETLITVP
NSEVGSRYDV SIPARQRIDG LHIPSHRAPS TYQQVQNIPT IPRRRQEVFN QYLSLVQSRY
LGSYTQWSPQ YYMKETLISL ATFGYGNAVV DADPESARLF EGFQEILRKM LPPKLRFKRL
QIRVPEVILE TETGNFSIDA LSGGAAAVID LAWQVFMYEP SESEFVVTLD EPENHLHPEL
QQRVLADLLT AFPSVQFVVA THSPFIVGSV PHSHVYVLGY DDSRRVNSTL LDTVNKTGTA
NEILRDVLGL EFTIPVWVEN RLENLIEKYS KKDFTEDNLM MLRQEMTSLG LGKHVPQTIS
MLAQKKDGQ