Gene Nmul_A1979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1979 
Symbol 
ID3785003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2275847 
End bp2277193 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content55% 
IMG OID637812068 
Producthypothetical protein 
Protein accessionYP_412666 
Protein GI82703100 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0621] 2-methylthioadenine synthetase 
TIGRFAM ID[TIGR00089] RNA modification enzyme, MiaB family
[TIGR01125] MiaB-like tRNA modifying enzyme YliG, TIGR01125 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCTC TGAAACAGCA GACTCCCCGT ATCGGCTTTG TCTCCCTCGG TTGCCCGAAG 
GCCTTGGTAG ATTCCGAGCA GATCCTTACC CAGCTTCGTG CCGAAGGTTA TGAAACTTCC
TCTACCTATG AGGATGCGGA CCTTGTAGTC GTCAATACCT GCGGTTTTAT CGATAGCGCA
GTGGAAGAGT CACTCGACGC TATCGGCGAG GCGCTGGCGG AAAACGGCAA GGTGATCGTC
ACCGGTTGCC TTGGAGCAAA AGAAGGCGGG GACGTGGTCA AGCAAGCCCA TCCACAAGTA
CTGGCGGTGA CAGGACCCCA CGCCCTGCCC GAAGTAATGG CTGCGGTCCA CATGCATCTG
CCCCAGCCGC ACGATCCCTA TACCAGTCTC ATTCCTCCAC AAGGCATAAA GCTGACGCCC
CGGCATTACG CTTATCTCAA GATCTCCGAA GGTTGCAATC ACCGCTGTAC CTTCTGCATC
ATCCCTTCGA TGCGCGGCGA CCTCGTCAGT CGCCCCATCC ATCAGGTGAT GGAGGAAGCG
GAAAACCTGG TGAACGCAGG GGTCAGGGAA TTGCTCGTCA TTTCGCAGGA TACGAGCGCT
TATGGGGTAG ACGTCAAATA CCGCACCGGT TTCTGGCAAG GCAGGCCGCT GAAAACGCGG
ATGACCGACT TGGCGCGCTC GCTCGGCGAA TTGGGAGTAT GGGTTCGCCT CCATTATGTT
TACCCTTACC CGCATGTCGA TGAAGTGATT CCGCTCATGG CCGAGGGGAA AATTCTTCCA
TATCTCGATG TACCGTTTCA GCACGCTAGT CCCCGTATCT TGAAGGCAAT GAAACGTCCG
GCCAACTCCG AAAACAATCT TTCCCGCATT CGGCGATGGC GTGAAGTCTG TCCGGATATC
ACCCTGCGCA GTACCTTCAT TGTCGGCTTT CCCGGAGAAA CAGAGGCGGA ATTCGAACAA
CTCCTGGAGT TTCTCGAGGA AGCGCAACTC GATCGTGTTG GCTGCTTTGC CTATTCACCT
GTCGAGGGTG CGGCGGCGAA CGCTCTTCCC GATCCCGTGC CGGAAGAGGT GAAAGAAGAG
CGACGGGCGT GTTTCATGGC AATACAGGAA AAAATCAGTG CTGAACGCCT GGCCCGCAAA
ATCGGCAAAC GCATGATTGT CCTGATAGAC GACGTAAGCA AAAACAAGGC TGTCGCCCGT
AGTACTGCCG ACGCTCCGGA AATCGATGGC CTGGTTTATA TCGGCAAGGC AAAAAACGTA
AAACCGGGTG AATTTATTGA AGTTGAAATT ATCCGCTCCG ACCCCCACGA TCTGCACGCT
CGACAGGTCA GCGACAACCG AACGTAA
 
Protein sequence
MASLKQQTPR IGFVSLGCPK ALVDSEQILT QLRAEGYETS STYEDADLVV VNTCGFIDSA 
VEESLDAIGE ALAENGKVIV TGCLGAKEGG DVVKQAHPQV LAVTGPHALP EVMAAVHMHL
PQPHDPYTSL IPPQGIKLTP RHYAYLKISE GCNHRCTFCI IPSMRGDLVS RPIHQVMEEA
ENLVNAGVRE LLVISQDTSA YGVDVKYRTG FWQGRPLKTR MTDLARSLGE LGVWVRLHYV
YPYPHVDEVI PLMAEGKILP YLDVPFQHAS PRILKAMKRP ANSENNLSRI RRWREVCPDI
TLRSTFIVGF PGETEAEFEQ LLEFLEEAQL DRVGCFAYSP VEGAAANALP DPVPEEVKEE
RRACFMAIQE KISAERLARK IGKRMIVLID DVSKNKAVAR STADAPEIDG LVYIGKAKNV
KPGEFIEVEI IRSDPHDLHA RQVSDNRT