Gene Nmul_A0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0452 
Symbol 
ID3785999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp501630 
End bp502910 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content54% 
IMG OID637810528 
Productribonuclease BN, putative 
Protein accessionYP_411152 
Protein GI82701586 
COG category[S] Function unknown 
COG ID[COG1295] Predicted membrane protein 
TIGRFAM ID[TIGR00765] YihY family protein (not ribonuclease BN) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGCTTT TCTCTCAATC CTCACGACCT GTCGCGAAAG TAATGAAATC CATACGTCCC 
GTCGATTTCA TGCACTATGT CCTTGTGCGC TTTTTCCAGC ACAACTGCAC CCAGATTGCA
GGGAGTCTTA CATTCACCAC CTTGCTTTCG TTGGTACCAA TGCTCGCGAT CGGGTTATCG
GTAATAGCGG CATTTCCCGC ATTCGCTGAA TTCTCGGACC GGATAAAGGA ATTCATTCTC
ACCACCATGG TGCCGGAAGC AGCAAACAAG GTCATCTCCG TATACATGCA GCAATTCGCC
GACAATGCGG CCAAGCTGAC TGCCATAGGC ATCGCCTTCC TGGGCGTAAC CGCACTTGCG
CTCATGCTTA CAATAGACGA AGCACTCAAC AGCATCTGGC GAGTATCCCG CCTGCGGCCG
CTCCTGCATC GCCTCCTGAT ATATTGGTCC GTTCTGACGA TTGGCCCTTT ATTGATCGGC
GCGAGCCTGT CGCTTACGTC CTGGCTCATG ACCGCTTCCA GGGGATTTAC TCGTGATATC
CCGGGGGGGG ACATCATGCT TTTGCGGTTG AGTCCGCTCG TGCTGACGAG TATTGCCTTT
TCGGCTTCCT ATCTCATCGT GCCGAATCGT CAGGTGGCAT GGCGGCATGC GATAGCGGGT
GGCGTGGCGG CGGCGATAGG GTTCGAGATA ATGAAGGAAG GCTTTGCGTT CTACATCACC
CGGTTTCCGA CTTATCAGGC AGTATACGGC ACCTTTGCGA CCATTCCCAT TTTCCTGCTA
TGGCTTTATC TTTCATGGTT GATGGTGCTG CTCGGAGCGG TTATTGCCGC ATCGCTTTCA
AGCTGGCGTT TCCGGGAGTG GCGTGACGAC CCGAACGCCA GGGGTAAGCA GTTTTTTGAT
GCATTGCGTT TATTGGGGAT ACTTGGAGAG GCGTTGAAGG CAGGCAAAGT TGAAACCGCT
CTCAGCTTGC AGCAGCAGTT GATGCTAAGT CCCGAAGAAG TGGAGCGGAT ACTGGAGCTC
ATGGTGAAAG CCAATTTCGT GCGACAGGTT CAGGAAGGAG GATGGGTTCA AATACTGGAT
CCCGCCGAGA TCCGCATTGC GGATGTCTAC CGCCTGTTTG CGTTTCGTCC CGAAGCACTA
AGGGGTACGG CAGGGGGGGA TACCCGGCTG GAGCAGCTGC TCGATGATAT TGCTGTGGGG
ATCGATGAGA AAATGAGCCT TCCGCTGTCG CAACTTTTTA CTTCCGCGGA ACCGGAACCA
CCCGCAGAAA TGTCAGCTTA G
 
Protein sequence
MELFSQSSRP VAKVMKSIRP VDFMHYVLVR FFQHNCTQIA GSLTFTTLLS LVPMLAIGLS 
VIAAFPAFAE FSDRIKEFIL TTMVPEAANK VISVYMQQFA DNAAKLTAIG IAFLGVTALA
LMLTIDEALN SIWRVSRLRP LLHRLLIYWS VLTIGPLLIG ASLSLTSWLM TASRGFTRDI
PGGDIMLLRL SPLVLTSIAF SASYLIVPNR QVAWRHAIAG GVAAAIGFEI MKEGFAFYIT
RFPTYQAVYG TFATIPIFLL WLYLSWLMVL LGAVIAASLS SWRFREWRDD PNARGKQFFD
ALRLLGILGE ALKAGKVETA LSLQQQLMLS PEEVERILEL MVKANFVRQV QEGGWVQILD
PAEIRIADVY RLFAFRPEAL RGTAGGDTRL EQLLDDIAVG IDEKMSLPLS QLFTSAEPEP
PAEMSA