Gene Nmul_A0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0037 
Symbol 
ID3784026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp36417 
End bp39539 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content58% 
IMG OID637810106 
ProductATPase, E1-E2 type 
Protein accessionYP_410738 
Protein GI82701172 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0474] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCTAC TTTTCGACCA GATGCAACGC CCTTCTTTCT TCGGGACAGG TGCGACTGAA 
GCCGAGCCTC CCGAACCGGT GCGCATCGTT CATGCAGCTG TGGGGGGCCG CGCAAGGCTT
AAGGTACGGG GTTTGTATCG ATCGGAGTCC GTCAGGCGTA AGCTCGAGTC TGCCCTGCCG
AGGCATGATG CGATCAGACA TGCTTCCGCG AACATTCTCA CGGGCCGGGT ATTGATCGTT
TTTGATCCGG CACGCGACCT TGAAGAAATC AAGACCTGGT TGGAGCAACT GCTGTTGGAG
GCCGGGCATG AAATCATTCC CGGGAACCCG GTAGACGATG GCCGTTTTTT GCGTGCCTCC
GGACGTCCTG CTTCTCCCGT CATTTCTTCT GATCCCGATT CAGTTCTTCT ACAGGATCGT
AAAAATTGGC ATGCCTTGCC TGGCGATACC GTGGTGGCTG CGCTTGAAAC TTCCATGGAA
TTCGGTCTTA GCCAAGCTTC CGTGGAGCGG AAGCTTGGCT ATTTCGGGGT CAATAGCCTA
CCGGAGACGC CGCCCCGTTC CGGGCTGAGC ATTTTTCTGG GGCAGTTTAA AAGCCTGCCT
GTGGCTTTAC TCGGTGTATC CGCTGTCCTT TCCGCAGCCA CTGGAGGACT GCTCGATGCG
GCCGTGATCC TCGGAGTCGT TTTGATCAAC GCGGGTACGG GCTATGTTGC CGAGTCTCAA
TCGGAGCGAA CGATTAATGC GCTCGGACAT GTGGCCGAGC AGAACGCAAT GGTCATCCGC
GATGCACGGC TTCTGGAGGT GCCGGCCAAG TCCCTGGTGC CGGGCGACAT CCTGGTGCTG
ACACCCGGGG CTCGGGTAGC TGCTGATGGC AGGGTGCTCG AGTCGCGCAA TCTGATGATG
GATGAATCCA TGCTTACCGG AGAAAGCCTT CCGGTGGCGA AGAGAGTGAG CCCGTTGGAC
AAGCCGGAGG TCTCTCTCGC AGACCGGCTT AACATGGTAT ACATGGGTAC GGTGGTAACG
GGGGGCAGCG GCCTCTCAGT AGTGGTTTCG ACCGGCCGGT ATACCGAGAT CGGCATGATT
CAATCGCTTG TAGGTGAAAC GCGCCCTCCG GAAACACCTA TGCAGCGCCA GCTTGCGATC
CTGGGGAATC AGATGGTGTT GCTGTCCCTG GGCATTTGCG GAGCCATGTT TCTGATCGGG
CTGGTCCGTG GTCACGCTTG GCTGCAAATG TTGAAAACCT CCATTTCCCT TGCCGTGGCC
GCTGTCCCGG AAGGCCTGCC CACCGTGGCA ACGACCACTC TCGCCTTGGG TATCCGGACA
ATGAACCGTC ACAAGGTGCT GGTGCGACGC CTCGATGCAG TGGAAACCTT GGGCGCAGTA
CAGGTAATCT GCCTGGACAA AACCGGAACG CTCACTCTCA ACCGTATGTC TGTCCTGGCC
CTGTACAGCG GAGGACGCCG CATTACAGTG GATAGTGATG TGTTTTACGA TGCGGGTATA
AGAATCAATC CCTATCAGAG CGATGAATTG TTGCGGCTCC TCCATGTGGC TGTTCTTTGC
AACGAAGTGG AACTGAACGG GGAGAAAGGC AACTATCTGC TCAAGGGATC CCCAACAGAA
AGCGCATTGA TGCACCTTGC GCTGGCGGCA GGCGTTTCAG TTGAGGCGCT TCGTCGGCGC
TATCCCAGAG AGCAAGTCGA ATACCGTACC GAAACCCGGA ACTACATGAG CACGACCCAT
AGCGCGCAAG GGGAGGGAAA GCTCGTGGCT GTCAAGGGTA ATCCGGTCGA GGTGCTGGAA
ATGTGTCGGT GGTGGTTGAA GGACGGCGCG CGTTTACCCC TCACGCAGGC GGAGCGAACG
GTCATACTCA TGGAAAATGA CCGCATGGCG GCAGAAGCGC TGCGTGTGCT GGGATTTGCC
TACGTCGAAC CGAGACAAGC CGATCAATCT TCAGCGGATG AACTCATCTG GCTGGGAATG
ACGGGAATGG CAGATCCCCT GCGGCAGGGG ATGAAGGAAC TCATCGCCTT GTTTCATCAG
GCCGGGATAG ACACGGTGAT GATAACCGGT GATCAGAGCG GCACTGCATA TGCGATAGGG
AAGGAACTTG GGTTATCCGG CGGCAGCGAA CTCGACATTC TCGATTCCAC CCGGCTTGAT
CAACTTGATG CAGAGGTCCT GGCTGGGCTT GCCCAAAAGG TCAATATCTT CTCCCGCGTC
AGCCCGGCCA ACAAGCTGCA AATCGTGCAG GCACTGCAGC GCGGGGGCAA AATCGCCGCG
ATGACGGGCG ACGGCATCAA TGACGGGCCC GCCCTCAAGG CCGCGGACAT TGGCGTCGCC
ATGGGCGGCA CCGGCACGGA GGTGGCGCGC AGCGTGGCCG ACGTTATACT GGAAGACGAT
AATCTCAGCA CCATGATTGT GGCCGTCTCG GAAGGCAGGA CGATTTACAA TAATATTCGC
AAATCGATTC ATTTCCTTAC CGCCACCAAT CTGTCCGAGA TCATGCTGAT GCTCGGCTCT
ATAGGGACAG GACTGGGTAC TCCGCTCACC ACCAGTCACT TGCTGTGGAT CAACCTGGTA
ACCGATGTCT TCCCCGGGCT GGCGCTTGCT GTGGAGCCGC CCGAGCCGGA TGTGCTGCGG
CAGCCCCCGC GAGACCCGGC GGAACCCATT GTAGGTCCCG CCGACTTCAA GCGCTACGGG
CTGGAGTCGC TGGCAATGGC TGCGGGGTCA ATGGGCGGTT ATGGCTATGC CATGACACGT
TATGGCGGAG GGCAGAAGGC CAGCACGATT GCCTTCATGA CCCTGACGAT GGGGCAGTTG
CTTCACGCAT ATAGTTGCCG TTCGGATCAT ATCGGCATAT TCAGCCACGA GACGCTCCGC
TCCAATCGCT ATCTCGATCT GGCGATCGGT GGAACGGCCT TGCTCCAATG GGCTACGGTG
CTGGTGCCAG GCGTCCGCAG TCTGCTGGGT AATACCCCGA TAGGCCCACT CGATGTGGCG
GCGATAGGCG CAGGTTCTGT ACTGCCTTTT TTCCTGAATG AAGCAACCAA AGAGACTGCC
TTCAAAGGCA AGCGACGTAA AGAGCCGCTA TCTCTACCAC CAGCGCCGCA GGAGTATGGA
TGA
 
Protein sequence
MSLLFDQMQR PSFFGTGATE AEPPEPVRIV HAAVGGRARL KVRGLYRSES VRRKLESALP 
RHDAIRHASA NILTGRVLIV FDPARDLEEI KTWLEQLLLE AGHEIIPGNP VDDGRFLRAS
GRPASPVISS DPDSVLLQDR KNWHALPGDT VVAALETSME FGLSQASVER KLGYFGVNSL
PETPPRSGLS IFLGQFKSLP VALLGVSAVL SAATGGLLDA AVILGVVLIN AGTGYVAESQ
SERTINALGH VAEQNAMVIR DARLLEVPAK SLVPGDILVL TPGARVAADG RVLESRNLMM
DESMLTGESL PVAKRVSPLD KPEVSLADRL NMVYMGTVVT GGSGLSVVVS TGRYTEIGMI
QSLVGETRPP ETPMQRQLAI LGNQMVLLSL GICGAMFLIG LVRGHAWLQM LKTSISLAVA
AVPEGLPTVA TTTLALGIRT MNRHKVLVRR LDAVETLGAV QVICLDKTGT LTLNRMSVLA
LYSGGRRITV DSDVFYDAGI RINPYQSDEL LRLLHVAVLC NEVELNGEKG NYLLKGSPTE
SALMHLALAA GVSVEALRRR YPREQVEYRT ETRNYMSTTH SAQGEGKLVA VKGNPVEVLE
MCRWWLKDGA RLPLTQAERT VILMENDRMA AEALRVLGFA YVEPRQADQS SADELIWLGM
TGMADPLRQG MKELIALFHQ AGIDTVMITG DQSGTAYAIG KELGLSGGSE LDILDSTRLD
QLDAEVLAGL AQKVNIFSRV SPANKLQIVQ ALQRGGKIAA MTGDGINDGP ALKAADIGVA
MGGTGTEVAR SVADVILEDD NLSTMIVAVS EGRTIYNNIR KSIHFLTATN LSEIMLMLGS
IGTGLGTPLT TSHLLWINLV TDVFPGLALA VEPPEPDVLR QPPRDPAEPI VGPADFKRYG
LESLAMAAGS MGGYGYAMTR YGGGQKASTI AFMTLTMGQL LHAYSCRSDH IGIFSHETLR
SNRYLDLAIG GTALLQWATV LVPGVRSLLG NTPIGPLDVA AIGAGSVLPF FLNEATKETA
FKGKRRKEPL SLPPAPQEYG