Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0037 |
Symbol | |
ID | 3784026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 36417 |
End bp | 39539 |
Gene Length | 3123 bp |
Protein Length | 1040 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637810106 |
Product | ATPase, E1-E2 type |
Protein accession | YP_410738 |
Protein GI | 82701172 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0474] Cation transport ATPase |
TIGRFAM ID | [TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCTAC TTTTCGACCA GATGCAACGC CCTTCTTTCT TCGGGACAGG TGCGACTGAA GCCGAGCCTC CCGAACCGGT GCGCATCGTT CATGCAGCTG TGGGGGGCCG CGCAAGGCTT AAGGTACGGG GTTTGTATCG ATCGGAGTCC GTCAGGCGTA AGCTCGAGTC TGCCCTGCCG AGGCATGATG CGATCAGACA TGCTTCCGCG AACATTCTCA CGGGCCGGGT ATTGATCGTT TTTGATCCGG CACGCGACCT TGAAGAAATC AAGACCTGGT TGGAGCAACT GCTGTTGGAG GCCGGGCATG AAATCATTCC CGGGAACCCG GTAGACGATG GCCGTTTTTT GCGTGCCTCC GGACGTCCTG CTTCTCCCGT CATTTCTTCT GATCCCGATT CAGTTCTTCT ACAGGATCGT AAAAATTGGC ATGCCTTGCC TGGCGATACC GTGGTGGCTG CGCTTGAAAC TTCCATGGAA TTCGGTCTTA GCCAAGCTTC CGTGGAGCGG AAGCTTGGCT ATTTCGGGGT CAATAGCCTA CCGGAGACGC CGCCCCGTTC CGGGCTGAGC ATTTTTCTGG GGCAGTTTAA AAGCCTGCCT GTGGCTTTAC TCGGTGTATC CGCTGTCCTT TCCGCAGCCA CTGGAGGACT GCTCGATGCG GCCGTGATCC TCGGAGTCGT TTTGATCAAC GCGGGTACGG GCTATGTTGC CGAGTCTCAA TCGGAGCGAA CGATTAATGC GCTCGGACAT GTGGCCGAGC AGAACGCAAT GGTCATCCGC GATGCACGGC TTCTGGAGGT GCCGGCCAAG TCCCTGGTGC CGGGCGACAT CCTGGTGCTG ACACCCGGGG CTCGGGTAGC TGCTGATGGC AGGGTGCTCG AGTCGCGCAA TCTGATGATG GATGAATCCA TGCTTACCGG AGAAAGCCTT CCGGTGGCGA AGAGAGTGAG CCCGTTGGAC AAGCCGGAGG TCTCTCTCGC AGACCGGCTT AACATGGTAT ACATGGGTAC GGTGGTAACG GGGGGCAGCG GCCTCTCAGT AGTGGTTTCG ACCGGCCGGT ATACCGAGAT CGGCATGATT CAATCGCTTG TAGGTGAAAC GCGCCCTCCG GAAACACCTA TGCAGCGCCA GCTTGCGATC CTGGGGAATC AGATGGTGTT GCTGTCCCTG GGCATTTGCG GAGCCATGTT TCTGATCGGG CTGGTCCGTG GTCACGCTTG GCTGCAAATG TTGAAAACCT CCATTTCCCT TGCCGTGGCC GCTGTCCCGG AAGGCCTGCC CACCGTGGCA ACGACCACTC TCGCCTTGGG TATCCGGACA ATGAACCGTC ACAAGGTGCT GGTGCGACGC CTCGATGCAG TGGAAACCTT GGGCGCAGTA CAGGTAATCT GCCTGGACAA AACCGGAACG CTCACTCTCA ACCGTATGTC TGTCCTGGCC CTGTACAGCG GAGGACGCCG CATTACAGTG GATAGTGATG TGTTTTACGA TGCGGGTATA AGAATCAATC CCTATCAGAG CGATGAATTG TTGCGGCTCC TCCATGTGGC TGTTCTTTGC AACGAAGTGG AACTGAACGG GGAGAAAGGC AACTATCTGC TCAAGGGATC CCCAACAGAA AGCGCATTGA TGCACCTTGC GCTGGCGGCA GGCGTTTCAG TTGAGGCGCT TCGTCGGCGC TATCCCAGAG AGCAAGTCGA ATACCGTACC GAAACCCGGA ACTACATGAG CACGACCCAT AGCGCGCAAG GGGAGGGAAA GCTCGTGGCT GTCAAGGGTA ATCCGGTCGA GGTGCTGGAA ATGTGTCGGT GGTGGTTGAA GGACGGCGCG CGTTTACCCC TCACGCAGGC GGAGCGAACG GTCATACTCA TGGAAAATGA CCGCATGGCG GCAGAAGCGC TGCGTGTGCT GGGATTTGCC TACGTCGAAC CGAGACAAGC CGATCAATCT TCAGCGGATG AACTCATCTG GCTGGGAATG ACGGGAATGG CAGATCCCCT GCGGCAGGGG ATGAAGGAAC TCATCGCCTT GTTTCATCAG GCCGGGATAG ACACGGTGAT GATAACCGGT GATCAGAGCG GCACTGCATA TGCGATAGGG AAGGAACTTG GGTTATCCGG CGGCAGCGAA CTCGACATTC TCGATTCCAC CCGGCTTGAT CAACTTGATG CAGAGGTCCT GGCTGGGCTT GCCCAAAAGG TCAATATCTT CTCCCGCGTC AGCCCGGCCA ACAAGCTGCA AATCGTGCAG GCACTGCAGC GCGGGGGCAA AATCGCCGCG ATGACGGGCG ACGGCATCAA TGACGGGCCC GCCCTCAAGG CCGCGGACAT TGGCGTCGCC ATGGGCGGCA CCGGCACGGA GGTGGCGCGC AGCGTGGCCG ACGTTATACT GGAAGACGAT AATCTCAGCA CCATGATTGT GGCCGTCTCG GAAGGCAGGA CGATTTACAA TAATATTCGC AAATCGATTC ATTTCCTTAC CGCCACCAAT CTGTCCGAGA TCATGCTGAT GCTCGGCTCT ATAGGGACAG GACTGGGTAC TCCGCTCACC ACCAGTCACT TGCTGTGGAT CAACCTGGTA ACCGATGTCT TCCCCGGGCT GGCGCTTGCT GTGGAGCCGC CCGAGCCGGA TGTGCTGCGG CAGCCCCCGC GAGACCCGGC GGAACCCATT GTAGGTCCCG CCGACTTCAA GCGCTACGGG CTGGAGTCGC TGGCAATGGC TGCGGGGTCA ATGGGCGGTT ATGGCTATGC CATGACACGT TATGGCGGAG GGCAGAAGGC CAGCACGATT GCCTTCATGA CCCTGACGAT GGGGCAGTTG CTTCACGCAT ATAGTTGCCG TTCGGATCAT ATCGGCATAT TCAGCCACGA GACGCTCCGC TCCAATCGCT ATCTCGATCT GGCGATCGGT GGAACGGCCT TGCTCCAATG GGCTACGGTG CTGGTGCCAG GCGTCCGCAG TCTGCTGGGT AATACCCCGA TAGGCCCACT CGATGTGGCG GCGATAGGCG CAGGTTCTGT ACTGCCTTTT TTCCTGAATG AAGCAACCAA AGAGACTGCC TTCAAAGGCA AGCGACGTAA AGAGCCGCTA TCTCTACCAC CAGCGCCGCA GGAGTATGGA TGA
|
Protein sequence | MSLLFDQMQR PSFFGTGATE AEPPEPVRIV HAAVGGRARL KVRGLYRSES VRRKLESALP RHDAIRHASA NILTGRVLIV FDPARDLEEI KTWLEQLLLE AGHEIIPGNP VDDGRFLRAS GRPASPVISS DPDSVLLQDR KNWHALPGDT VVAALETSME FGLSQASVER KLGYFGVNSL PETPPRSGLS IFLGQFKSLP VALLGVSAVL SAATGGLLDA AVILGVVLIN AGTGYVAESQ SERTINALGH VAEQNAMVIR DARLLEVPAK SLVPGDILVL TPGARVAADG RVLESRNLMM DESMLTGESL PVAKRVSPLD KPEVSLADRL NMVYMGTVVT GGSGLSVVVS TGRYTEIGMI QSLVGETRPP ETPMQRQLAI LGNQMVLLSL GICGAMFLIG LVRGHAWLQM LKTSISLAVA AVPEGLPTVA TTTLALGIRT MNRHKVLVRR LDAVETLGAV QVICLDKTGT LTLNRMSVLA LYSGGRRITV DSDVFYDAGI RINPYQSDEL LRLLHVAVLC NEVELNGEKG NYLLKGSPTE SALMHLALAA GVSVEALRRR YPREQVEYRT ETRNYMSTTH SAQGEGKLVA VKGNPVEVLE MCRWWLKDGA RLPLTQAERT VILMENDRMA AEALRVLGFA YVEPRQADQS SADELIWLGM TGMADPLRQG MKELIALFHQ AGIDTVMITG DQSGTAYAIG KELGLSGGSE LDILDSTRLD QLDAEVLAGL AQKVNIFSRV SPANKLQIVQ ALQRGGKIAA MTGDGINDGP ALKAADIGVA MGGTGTEVAR SVADVILEDD NLSTMIVAVS EGRTIYNNIR KSIHFLTATN LSEIMLMLGS IGTGLGTPLT TSHLLWINLV TDVFPGLALA VEPPEPDVLR QPPRDPAEPI VGPADFKRYG LESLAMAAGS MGGYGYAMTR YGGGQKASTI AFMTLTMGQL LHAYSCRSDH IGIFSHETLR SNRYLDLAIG GTALLQWATV LVPGVRSLLG NTPIGPLDVA AIGAGSVLPF FLNEATKETA FKGKRRKEPL SLPPAPQEYG
|
| |