Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_B2808 |
Symbol | |
ID | 3786775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007615 |
Strand | - |
Start bp | 10897 |
End bp | 15696 |
Gene Length | 4800 bp |
Protein Length | 1599 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812884 |
Product | peptidase C39, bacteriocin processing |
Protein accession | YP_413471 |
Protein GI | 82703906 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCGCC TCTGGTCCAG TATGGCCGTT TCCCGTTTCA TTCCCTCTCT TTTTGCCACG CTGCTTTTGC TGGCAAGCCC TCTCCTCCAT GCCGCCCCTC CAGAAAACCC CGCGCCTCTT CCGCTTCTCA AGGTGGATGG ATTCGAGGAG CCTCTGGTTC CCATGTATGG AACCACGCCG AAGGAAGACC GGGCCCTCGC GGAAGCCATC TCCACCCATC GCAGCCAGTC GTCTTTTGAA GATCTGGCCG TATTCGAGAC TTTCCTTAAA GAGTATCCGC GCTCGGGATG GAATGTCGGG ATTCTCGCCA ATCTGGGGCT TGCCTATTAC CGCCAGGGCT ATTTCTCCAG GGCGATAAAT GCGTGGGAGC GGGCCTGGCA GACGGAAAGA ACTCGTCCCG CCGATAATCC TATGGGCAAG ATGCTGGCAG ACCGCGCCAT GGGGGAGCTG ATGCGCATGC ACTCCCGCCT GGGACATGCG GACGAGCTTG AAAAACTGCT TGCCGATATC AAGGACCGGC CCATCAGCGG TCCCGCCACG GAATTGATCA CGGACGCTCA TGAAAGCCTG TGGATGTTCC GAAACGAGTA CGACAAGGCT TACCTTTGCG GTCCCATGGC GCTCAAGAGT CTGCTGGTCT CGGCCCAGGC CGATCCGGCA AAAATCGCCA TCATGGATGC GGCGCGCTCA GGTCCGCATG GCTACACGCT TGCCCAGGTA TCGGAGCTGG CAAGCAAGGC CAACGTACCT CATCGGCTGA TTTTCCGTGC TCCCGGACAG CCCATCCCCG TCCCCTCAGT GGTGAACTGG AAACTGAATC ATTATGCTGC CATCATCGGT GAGCAGAACG GCCTCTACCA TGTCCAGGAT CCCACCTTCG CAAGCGGGGA TGCCTGGCTC ACCAAGGAGG CCATCGATGC GGAGGCAAGC GGGTATTTCC TGGTTCCGGC TGGAACCCAG AAAGTTGCGT CCTGGCGTAT GGCCACTCCT GAAGAGGCAA GGCGTATCTA TGGAATGGGA CAGACAAGTG TTAATCAGCC CGGTACGACA CAGAAGCTTG ATAGCAACTT GACCTGCTCC AATCCAAAAG GCCCTGTTCC TCTCACTGGC TCCGCAGCGT CACCACCAGC CATGTGTGTA GTGAACGCTA AAATGATGCT GGTGAGCCTG AACTTAAGCG ACGTTCCGGT CGGGTACCAG CCGCCGAAGG GGCCTTCCGC CAAAATCAGC CTGACCTACA ATCAGCGCGA AGCCAGCCAG CCAGCCAATT TCAGCTTTTT CAATATCAGC CCCAAATGGA CACTGAACGT GCTGTCCTGG GTTCAGGATA CTCCCGACTC GCCGGGCCGC TCAGTTCTGC GCTATGCGGC AGGCGGGGGA TCGGTGGATT ATTCGATCGG CTACAGCTAT TCAACGGCAT CCGGCGCGTT CTCGCCTGAA CGTCAGGGTC AGGCAGTGCT GGTTCGCATA CCCGCCACCG GTCCCGCCAC TTCCTATGAA TTGCGCATGC CCGATGGAAG CAAGCAGGTA TTCGCCCAGA GCGATGGCGC AACCGTATAC CCGCGCCGCA TGTTCTTAAG CCAAATAATC GATCCTTCGG GTAACGCGCT AACGTTCAAT TATGACAATC AGTTGCGCCT GGTATCGCTC ACGGATGCCA CTGGCCGCAA CACCATCTTC AGTTATGAGT TCCCTGCCAA TCCCCTGCTC GTAACCGGAA TAACCGATCC ATTCGGACGC CGTGCCGAAC TGACATACGA CACAAACAGA CGTCTCGCCT CGATCACCGA TGTGATCGGC ATTACTTCCT CCTTCAAATA CGATACGGGG GGGCTCATCA ATGAAATGAC GACACCCTAT GGTACCCATC ATTTCACCTA TGGGCAGAAC ATCTATGAAA ACTCACGCTT CTTGGAAATG ACCGATCCGA TGGGTTTTAC CGAACGACTG GAATTCCGGC ATTTTGCTTC CGGCATACCC AATGGTGATC CTGTGGCTCC GGCTGGCATG GGGATACTGA ACGGTTTTCT TTACTACCGT AATACTTTCC ACTGGGACAA GCATCTTTAT GCTATCACTC ATACCGATTA CACCCAAGCC CGAATCATTC ACTGGCTCCA TAACCCGGCT GGACAGACCT CGCCGATCAT AGAAAGCAGC AAACAGCCCC TTGAGCGCCG CGTCTGGAAT ACCTACCCGG GACAAGGTAA TTCCATCATG GAAGGAACTG CCGGGACACC GATAGCAATC GGCCGAGTAT TAGATGACGG CTCCACGCAA GCAAGAAGAT TTACCTATAA CAGCATCGGC AAGCCGCTCA CAGCCATAGA TCCTACAGGA CGCAAGACTT CCTTTACCTA CGCTTCCAAT GACATCGATC TTACTTCCAT CCAGCAGACC ACGGGCAGCG GCCAGGCCAC GCTGGCGGCC TTCACCTACA ACAGCCAGCA CCTTCCCCTG ACTGCGACCG ATGCGGCGGG AAAGACAACT TCCTATACCT ACAACTCGGA TGGCCAGGTA ACGGGCGTGA CCGACGCGCT CGGGCAGACC ACGCGCTATG CCTATGATGG GCTGGGACGG CTGATATCGA TCACCAACCC GGATAACGCG GTGCAGCACA GCTTCACCTA TGACGCCTTC GACCGCGTTG CCACCGCCAC CGATTCGGAA GGCCACACCC TGGCCTATGA ATACGACGCC CTCGACCGGG TTACGCAAAT CCTCTATCCC GATGGAACCA GCACGCTGAA TACCTATGAC CTGCTCGACC TGGTAGAAAC AAAAGATAGG ATGGGGAGGG CCACGAGCTA CAGCTATGAC GCCAACCGGC GCATGACCTC GATGACCGAT CCTGCGGGTC AGACCACCAG TTACAGCTAT TACCGTAACG GGGTGTTGCG CAGCATCACC GACGGCAACG GGAACATGAC GCGCTGGGAT ATCGATATCC AGAGCCGCCC CATTGCCAAG GTTTATGCCG ACGATTCCAG GGAAACCTAT GCCTATGACA GCGCAAGCCG CCTGATAAGC GTAACCGACG CTTTGGGACA GACCAGGCAG TATGGCTATA CGAAGGACGA CCGGCTCGCG GCGCTGAGTT ATGCCAATGC TATAAGTCCC ACACCGGGCG TAAGCTTCGC CTACGACCCC TATTTCCCGC GCAGAACCAC CATGACCGAT GGGGCGGGAA CCACCCAGTT CCAGTATGGC GCTGTCGGCT CCCTGGGGGC GCTCAAGCTG ACAGGTGAGA ACGGGCCTTA TACCAATGAC GAGATTTCCT ACCAGTACGA TGCATTGGGG CGGATGACCA GCCGCAAGGT GGATACGATA ACCGAGTCCT TTGCCTATGA CAGGCTCGAC CGCGTCACCC AGCATACCAA TCCGCTGGGC TCCTTCAATT TCAGCTATCT GGGTCAGACT GGGCAGCTTT TAAGCCAGCA GGCCGGGGCT GTGGGAACGC AGTGGGGCTA TGAGGACAAC ACCCATGACC GCCGCCTGAA GTCGATCACG AACAGCGGTC TGGCGCGCGG TTTCCACTAT GCCACCACCC CGGAAAACCA GATTTCCACT CTGACGGAAA CTGCGGGGGG AGCAGCGCAG AAGAGCTGGC ACTACGCCTA TGATGGCGCT GACCGCCTCC TCTCCGCCCA GCCTTCATCC GGGGGAAGCT TCAGCTATGG TTATGACGCC GCTGACAACC TGACTTCCCT CAATGGCGCG TTGACGCATT ACAACACCGT CAACCAATTG ACGAGTTTTA ACAGTGAAAG CTTCAGTTAC GACGCCAACG GCAATCTGAA GGATGACGGA GTCCGCACCT ATCAGTGGGA TGCCGAAAAC CGTCTGCTCT CCATCAGCTA CAAGAACGAT CCCAGCAAGG CGACCACGTT CCGCTACGAT GGCATGGGGC GGCGGCTGGC TATTGTCGAA AATAATGGAG GAGCAATAAC AGAAACCCGT CATCTCTGGT GCGGCGCTAC GCTGTGCCAG GCCAGAACGG CAGGCGATGT GGTCACACGC CGCTATTACC CGCAAGGCAT GGCAATTCCG CAGGGCGGCA CTCTGCTTTA TTACGGCACT GACCATCTAG GCTCGGTGCG GGACGTGATG GCGGCCCAGA ATGGGGCCAA GGTGGCAAGC TACGACTATG ATCCTTATGG AAACCAGATA GCGGGGAGTG GGCGGATTTC AGTTGACTTT CGCTATGCGG GGATGTTCTA TCATCAGCAG AGCGGATTGT ACCTGACGAA TTTCCGGGCC TATGACCCGA AAACGGCGAA GTGGCTATCG CGTGACCCGA TTGGAGAAAA AGGAGGATTA AATCTTTATG GGTATGTGGG AGGAAATCCA ATTAATATGA TTGATCCTTT AGGTCTGAGG GCACTTCCTT GGATACTTGG CGGGGCCAGT TCTGACATAG CAACCCCGGA TCCGAGCGAT ATAGCCTGGC AGAAGTGGGC TGGTTGGGCT ATTTTAATCA CTGGAGCAAC GATATATGAC GCGTGCTCAG GAAACTCAGA ATCCAAAACG GCTCAGAATA AAACTGCTCA ACCACCTATT TGCCCCCCCT GTAATCCACC TCAGGGAACC CAATGTTATG AACCCGACAC TGGACACACA CATAATGGGT GGGATCCACA CTATCATATT TGGAGTCGTG GACAAAATCC TAACACTTGT CAATGCTATT GGAATAGAGG GAGTGGATCG AAAGGAACAA CTCAATTTCC CCCGGTAGCT CCAGTAGGAA AAGAAGAAAT AAAAGATTGT GGCAATTATA CAACTTGGCC TCACCAATAA
|
Protein sequence | MARLWSSMAV SRFIPSLFAT LLLLASPLLH AAPPENPAPL PLLKVDGFEE PLVPMYGTTP KEDRALAEAI STHRSQSSFE DLAVFETFLK EYPRSGWNVG ILANLGLAYY RQGYFSRAIN AWERAWQTER TRPADNPMGK MLADRAMGEL MRMHSRLGHA DELEKLLADI KDRPISGPAT ELITDAHESL WMFRNEYDKA YLCGPMALKS LLVSAQADPA KIAIMDAARS GPHGYTLAQV SELASKANVP HRLIFRAPGQ PIPVPSVVNW KLNHYAAIIG EQNGLYHVQD PTFASGDAWL TKEAIDAEAS GYFLVPAGTQ KVASWRMATP EEARRIYGMG QTSVNQPGTT QKLDSNLTCS NPKGPVPLTG SAASPPAMCV VNAKMMLVSL NLSDVPVGYQ PPKGPSAKIS LTYNQREASQ PANFSFFNIS PKWTLNVLSW VQDTPDSPGR SVLRYAAGGG SVDYSIGYSY STASGAFSPE RQGQAVLVRI PATGPATSYE LRMPDGSKQV FAQSDGATVY PRRMFLSQII DPSGNALTFN YDNQLRLVSL TDATGRNTIF SYEFPANPLL VTGITDPFGR RAELTYDTNR RLASITDVIG ITSSFKYDTG GLINEMTTPY GTHHFTYGQN IYENSRFLEM TDPMGFTERL EFRHFASGIP NGDPVAPAGM GILNGFLYYR NTFHWDKHLY AITHTDYTQA RIIHWLHNPA GQTSPIIESS KQPLERRVWN TYPGQGNSIM EGTAGTPIAI GRVLDDGSTQ ARRFTYNSIG KPLTAIDPTG RKTSFTYASN DIDLTSIQQT TGSGQATLAA FTYNSQHLPL TATDAAGKTT SYTYNSDGQV TGVTDALGQT TRYAYDGLGR LISITNPDNA VQHSFTYDAF DRVATATDSE GHTLAYEYDA LDRVTQILYP DGTSTLNTYD LLDLVETKDR MGRATSYSYD ANRRMTSMTD PAGQTTSYSY YRNGVLRSIT DGNGNMTRWD IDIQSRPIAK VYADDSRETY AYDSASRLIS VTDALGQTRQ YGYTKDDRLA ALSYANAISP TPGVSFAYDP YFPRRTTMTD GAGTTQFQYG AVGSLGALKL TGENGPYTND EISYQYDALG RMTSRKVDTI TESFAYDRLD RVTQHTNPLG SFNFSYLGQT GQLLSQQAGA VGTQWGYEDN THDRRLKSIT NSGLARGFHY ATTPENQIST LTETAGGAAQ KSWHYAYDGA DRLLSAQPSS GGSFSYGYDA ADNLTSLNGA LTHYNTVNQL TSFNSESFSY DANGNLKDDG VRTYQWDAEN RLLSISYKND PSKATTFRYD GMGRRLAIVE NNGGAITETR HLWCGATLCQ ARTAGDVVTR RYYPQGMAIP QGGTLLYYGT DHLGSVRDVM AAQNGAKVAS YDYDPYGNQI AGSGRISVDF RYAGMFYHQQ SGLYLTNFRA YDPKTAKWLS RDPIGEKGGL NLYGYVGGNP INMIDPLGLR ALPWILGGAS SDIATPDPSD IAWQKWAGWA ILITGATIYD ACSGNSESKT AQNKTAQPPI CPPCNPPQGT QCYEPDTGHT HNGWDPHYHI WSRGQNPNTC QCYWNRGSGS KGTTQFPPVA PVGKEEIKDC GNYTTWPHQ
|
| |