Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2648 |
Symbol | |
ID | 3785259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 3033581 |
End bp | 3038629 |
Gene Length | 5049 bp |
Protein Length | 1682 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637812737 |
Product | hypothetical protein |
Protein accession | YP_413327 |
Protein GI | 82703761 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAATA ATCCGGATTT CAGCATCGGT TTTTCCACCA ATCTCGTTCG CGGCCTTAAC CCGAACTGGC AGCCGCAGAA GGGGGTGCTC AACATCCTCT ATTCTGCTGA ACACCAGGCG ATGGATTTAA AGCCCGTGGA ACCGGTCGAG CACCGGGGCT CGTGGTCTTA CCCTGTGATT ACGCTGCATG AAACGGCCCT GGAGTTGCTG GATGAGCTGA TTCCGTATGA GATCGTCAAG CCTTCCAGCG ACATCGATGC GCCTTCCTTT GGACCGGGGG CCTTGCTCGC GGGTGCTGCC GGCAGCCGGG CTTCCCTGGA AATACTTTCG AAAACCATCG GCGTGGATTT GACGAGGCCG GGATCAGGTT ATGCGCTGGT CAAACTGGTA AGGGTGGATG GAGCGGACAA CCATGCGTCG GAGGTGGGCG GCATTCTGGT ACACGCGCGT CCCCGTCAGC CCGATCCCGA GTATGGGCTC ACCCCTGCCT TCACTTCCGC TTCGACCAGG TTGCGGCACG CCGGGCGCAG CCGGCTGCAG GATTACGGCG ATCAACTGAC AAAGGATGAT GCCAACAAGG TCCTCGATTC TTTCCGCGAA TTTGGCACCC ATTATGTTTC CGGTGTGGAG CTGGGCGATA CCATCCTTCA GGTGTTTGCC TATCCGCCGG AACAGTTTGC GCGGATCACG TCGGCTTATG CGGACGGCAG CAATCCGCTG TCGGGTCATG GTTCTCAAAA TTTTGTCCAG TTTACGACCG ACCGCGCCGC CGGTATCTTC GGCTATGTGG AAGAGTATGG AAATGTCATT TGCCTGAGCA ATTCCGCCGT CTTCAATTCG ACGCTCAAGC GGGGCGATTG GCTGGATAAG CTGTGGTCAA AAAAGAACAG CGTGTTCTCG CTGTTCAACG TCAACTCCCT TTTGTCGCTT GCCGATCTCC AGCAGGAATT CGCCCAGCAG ACGGTGATAG GTGTGCGGCT TGCCAGCCTG AGCGTGATGA TCGAGCAGAA GCGTGGGCTG ATCTGGCAGC GCATCTTCAA GGGCGCGATG GCGCAGAAGT ACCGTACGAC GATACAGCCG AATTTTTCCA TTTATGACCC GCGCGATTTC GTCCGCATGC TGCCGCAGGA TACCGCCGGG ATCGTGTCAT TCATTGCCAC TCCAACTGTC AATGCGTACA AGGCGCGTCT GGATCTCGCA GACATGCAGC TCGTGGCTGC CGAAGAAGTG CAGGATTTCA TCCTTCTTGC CAACGTATTG TCCGTGGGCA GCAATAAAAC CATCGAAGTG CCCGGAAAAA AAGTGCGGCT GTTCGGGCAG GTGCTGGATA TGCGGGTACA AGGCCAGCCC AGGGCGATCT CCCTGGCGGA TGAAGCATTT GAATCGCTGC AGTTGGGATG TGACGAGTTT CTGGGTGCTT TGGCTATCAG AAACAGTTCC GGTTCACGCT ACAGTGTCAT CGTGGATGGA CTCAGGTTCA ACCTGGAAGG CGAGGGGCCG GAGGCAGTGC CTTTTGTCGC GGATGACGTG CGCGTTGTGC CTCCCGCCGA GGCGGTGCCG TTGCTTGTGA ACAGCCTTCA GTTTTCCATG ACATTTGCAG AGGCGGTCAT CAGCGACCAG TCGGGCAAAG CCAATGGCTG TCTCCAGCAG TTCGTGCGCG ATTATCTGAG CTGGTTGGCG GAATTTCTAT CGGGCGCAAG CGGTAGCGAA GAACTGATTG CGTTGCGCAT CCGTGCGATG GATCTCGCCA GCTACGCCAT CAATCCGGAA TATGGCAGCT TTGTTCCGAT TCTGCCTTAC ACGGACTATG AGCAGTATGT GCAAAGCATA CTCAGCTATC TGGACAGGAT ACAGCTGCAG GTGGCGCAAA ACGAACAGCG CATGGCCGTC CGGCGCCTGG AAGAAAGGGT GATCGATGTC GGCAGGAGGC TTAATGAGAA CATCATCAAG TCGGGCGAGC TGGTCAGCGA TGTGATCAAG GCGAATGCCG AGCAGCAGAG AGACCTGGAA GGATTCTACG ATTCGCTGAT TGCCCAGAGC AAGGCCGAGG CCGACCAGCA GCAGAACAAG ATCAACGATC TGCGCGCGTC ACTGTTCGAA GCCCAGGGAG AAGTGAACCT GGCGGTACAG AAATATAAAT CCGCGGTGCA GCAGTGGCAA ACCATGGAGG CGATCAAGTT CGGCCTGGAA GTGGCCACCA ATCTGTTCAG CCTTGGCACT TCCATTGCCA TCCCGGCATC CTCCATTTCC GCAGTGAAAG ATCTTGGCCT CACCGTGCAA CGTATCCAGA AGACGCTGAA TGTGCTCAAC GCCCTGTCCA AGCTCTATAC CGGCGCGAGC ACCGGCGCCA AGGCTCTGGA AAACGCCCAG CACGCGCTGG ATGATCTGGA TGATGGGCAG TTCGGTAATC CCTCGATGCT TTCCTGGGAC GAAATGTCGA TCCAGTTCAA CCAGATCATG GCATCGGGTC CGGACATCAA ACCGGAAAAG GCGGGCCTGC AAGCGGCTTT TGCCACGATG GTGCTGCGCG GCAAGGCTGT AACCAGCGCT GAATCCGCAT TGCACGTCCT GCAACGGGAC ATCTATACCA ATCAGAAGCA GAAGGAGATC AACAGCCGCC AGGCAAAACG GCTGGAGGAA TTGCAGCAGA AGCTGCATCC TGCCCAGGTG AAGGATCTGG ACCGGTCCGC CATCGATCTG GTGGCGTTGA CCGGGCACCT GACATTCATC CAGAACCAGA TGCTCACGAT TCTTGCCAAG GCATTCCTGA TGCAGGACCT GGCGCTTCAA TACGCCAACC TTCAGCCCGC CACACCCGTG CTGGCGTACA GCCTGCTGAA ATTCAGCGCC GCGGTGGTTC AGCAGAAAGC CGCCACGCTG GAAGCCAAAT CCCTCCTCGC GCAGTATCAG GTCACACGAA CCCGGCCGAT CGAATATGTC ATCGAAGGGG TGAAGCCCGA GGAATTGACT GGTGGCAATA TCTTTCGCAC CACCATCTTC CTGGATGCGC CGGCGTTCTA CCAGTACGTC AACGCCAGGA TCGTTTCCGT GGTTGCTTCC GTGGAAGGTG TCAGGTCGAC TGAAAGCGGC ACTTATCTGC TGCGCCTGGC CTACGGCGGA ACCCCCTTTC ATGACCGCAA CCTTCATCGC GATCCGCTGA CATTCCGCAC ACCGTGGCGC GAGCGCATAT ACAACTATAA CGCCCAGGAC AACTCGCCCA GGTTTTCCGA CGGGGGGCAA TCCTGGTCTG AAGGTGTAAG CCGGGTGACG CCCTTTTCCA TCTGGGAGGT TTCGCTGCCG AACACGAGCA CGAACAAGGG CCTGCAGTTC AATGGCGATA GTTTGACGAT CCGGTTGTCC TTCGTGCTGG AAGCAAGAAT TGCAGATGCG CAGAAAGTCA TGCAGCGCCG CGCATTGAAT CGCCTGCTCG CATCCGGCTT GCCCCCGGTG CTTGCGGCCC CGGCGGCCGA TGGTTTGGCT GAAGGTTTCC GCCCGCTGCT CGCTGCGGCT CCAGCTCCAA CCTTGCCCTC CGCAGACACG TTGCTGAAGC AGATGTACGC TCAGGGAAGC TGCACCAACG GCTGGGATGT GGTATTCAAC ATGAATCTGG AAGAGATCAA CAGAGCGCTG AAGAACCAGT ACGAAGCCCT GAAAAAGGAC ACGGCCTACA AAAACAGAAT CGTTGTCAAC ACCTCGGAGA AATACCCTGG CGACGTGACC GTGATCAACC GCTTTACCAT CGAATACGGC TACCCCCTTC TGAGTTTCAG CATCAATAAC AACAATACCG CGATGCTGAG AATGGAGGTT TTGAACGGTT CAGTGCAGCG TTGTTCGAAA GTGGGATCCT TTCCTGAGCA GTGCGATGCC CCCCAATCGA TCGGCGGCGA AACCCTGGCG GCGGTGATTC AGCTGGCCAA GGTGGCAGGA ACGGTCAAGA TTGACAACAG CAATCACAAC GTCTTGAAAG TGCAGCTCGA CATGCAGGAG GGAACGTTCT CGATCAGCAA CATCGATCTG AGCGATGCAA CGAAGGTGGA ATTCAACAAG GCGGTGAAGG AATACTTCGT CAATAACCCT GTGGTCTACC TGATCAATCA GCTCGACCTG ACCGCTATTC CCACGCTGGA GTCACTCAAG CCAAGCGATT TTATCTTCAA GCCGCTTCAG ACACCCGGCA ATAACGAGAT GCTGCAGCTC TTCATCATGA CCGGGGGACG GGCAGCGCTG AATTACTCCC AGGCTTTTCT GAACAATATC CCTGAGCCGA TTCCGCAGGG TCAGAGCAAC AGCATGATCG TCCGCTCTGC GCTGATATTC AAGGATGTGC TTCCGCAAAG CTTGAGAAAC AATGGCTGGG CACTGCAAGG GCTCGATCCA GGCAGTCCAG CCAAGGCGTG GTCGGCTAAA TTCACCAGTG CCAGCGTCAC CGGCAGCGTG GACTTGAGCA AACTCAACCA CACCTCGTCC ACCAGCAGCG GTCATGGCAG TGGATCGATG ACCCAGTATA CCTACTCGAT TCCGGGCGGA AACGACGTTT CCTGGTCGCT GGCGGACACC GCCATTGTCG TGCAGGCAAA CGGACAAATG TATTACAGCG GGTCGCGCGG GCAATCCCTG CAATACAACC AGCGTGCCTG CACGTCTTAC TATCCCTGCC TGTGGAATTG CGACCCCCGC TGCAGCGATT CCAAGCTGGC AACGGACGTG ACAATCGAGG TGAAAGCTGC GTTGCCACTG AATGTGGGTG GAAGCGGGCG CAACCAGACC ATCGAGATCA AGACTTCCGG TCAGGGAGTT GTGGTCAGCG GTCACTTGTC GGGAGGAGGA CCTAGCGGCT CTGACGATCT GGCCGCGCAG GTCAATCAGC AGATACAGAG CCAGGTGCCG GCGCAGATTG CGGAAAAGCT GTCCATCCAG TTCGATTCCA TTTCGGTATT CGCGCTGAAA AACCTGTTGT TTCCATCGAA CAACTACATC TCGTTCAGCA GTTGTGGTGT GCCGGGCGAC CTGATTCTCC TGGGCAACTT CAACTCGGCA AAATCCTAG
|
Protein sequence | MNNNPDFSIG FSTNLVRGLN PNWQPQKGVL NILYSAEHQA MDLKPVEPVE HRGSWSYPVI TLHETALELL DELIPYEIVK PSSDIDAPSF GPGALLAGAA GSRASLEILS KTIGVDLTRP GSGYALVKLV RVDGADNHAS EVGGILVHAR PRQPDPEYGL TPAFTSASTR LRHAGRSRLQ DYGDQLTKDD ANKVLDSFRE FGTHYVSGVE LGDTILQVFA YPPEQFARIT SAYADGSNPL SGHGSQNFVQ FTTDRAAGIF GYVEEYGNVI CLSNSAVFNS TLKRGDWLDK LWSKKNSVFS LFNVNSLLSL ADLQQEFAQQ TVIGVRLASL SVMIEQKRGL IWQRIFKGAM AQKYRTTIQP NFSIYDPRDF VRMLPQDTAG IVSFIATPTV NAYKARLDLA DMQLVAAEEV QDFILLANVL SVGSNKTIEV PGKKVRLFGQ VLDMRVQGQP RAISLADEAF ESLQLGCDEF LGALAIRNSS GSRYSVIVDG LRFNLEGEGP EAVPFVADDV RVVPPAEAVP LLVNSLQFSM TFAEAVISDQ SGKANGCLQQ FVRDYLSWLA EFLSGASGSE ELIALRIRAM DLASYAINPE YGSFVPILPY TDYEQYVQSI LSYLDRIQLQ VAQNEQRMAV RRLEERVIDV GRRLNENIIK SGELVSDVIK ANAEQQRDLE GFYDSLIAQS KAEADQQQNK INDLRASLFE AQGEVNLAVQ KYKSAVQQWQ TMEAIKFGLE VATNLFSLGT SIAIPASSIS AVKDLGLTVQ RIQKTLNVLN ALSKLYTGAS TGAKALENAQ HALDDLDDGQ FGNPSMLSWD EMSIQFNQIM ASGPDIKPEK AGLQAAFATM VLRGKAVTSA ESALHVLQRD IYTNQKQKEI NSRQAKRLEE LQQKLHPAQV KDLDRSAIDL VALTGHLTFI QNQMLTILAK AFLMQDLALQ YANLQPATPV LAYSLLKFSA AVVQQKAATL EAKSLLAQYQ VTRTRPIEYV IEGVKPEELT GGNIFRTTIF LDAPAFYQYV NARIVSVVAS VEGVRSTESG TYLLRLAYGG TPFHDRNLHR DPLTFRTPWR ERIYNYNAQD NSPRFSDGGQ SWSEGVSRVT PFSIWEVSLP NTSTNKGLQF NGDSLTIRLS FVLEARIADA QKVMQRRALN RLLASGLPPV LAAPAADGLA EGFRPLLAAA PAPTLPSADT LLKQMYAQGS CTNGWDVVFN MNLEEINRAL KNQYEALKKD TAYKNRIVVN TSEKYPGDVT VINRFTIEYG YPLLSFSINN NNTAMLRMEV LNGSVQRCSK VGSFPEQCDA PQSIGGETLA AVIQLAKVAG TVKIDNSNHN VLKVQLDMQE GTFSISNIDL SDATKVEFNK AVKEYFVNNP VVYLINQLDL TAIPTLESLK PSDFIFKPLQ TPGNNEMLQL FIMTGGRAAL NYSQAFLNNI PEPIPQGQSN SMIVRSALIF KDVLPQSLRN NGWALQGLDP GSPAKAWSAK FTSASVTGSV DLSKLNHTSS TSSGHGSGSM TQYTYSIPGG NDVSWSLADT AIVVQANGQM YYSGSRGQSL QYNQRACTSY YPCLWNCDPR CSDSKLATDV TIEVKAALPL NVGGSGRNQT IEIKTSGQGV VVSGHLSGGG PSGSDDLAAQ VNQQIQSQVP AQIAEKLSIQ FDSISVFALK NLLFPSNNYI SFSSCGVPGD LILLGNFNSA KS
|
| |