Gene Nmul_B2808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_B2808 
Symbol 
ID3786775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007615 
Strand
Start bp10897 
End bp15696 
Gene Length4800 bp 
Protein Length1599 aa 
Translation table11 
GC content56% 
IMG OID637812884 
Productpeptidase C39, bacteriocin processing 
Protein accessionYP_413471 
Protein GI82703906 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCGCC TCTGGTCCAG TATGGCCGTT TCCCGTTTCA TTCCCTCTCT TTTTGCCACG 
CTGCTTTTGC TGGCAAGCCC TCTCCTCCAT GCCGCCCCTC CAGAAAACCC CGCGCCTCTT
CCGCTTCTCA AGGTGGATGG ATTCGAGGAG CCTCTGGTTC CCATGTATGG AACCACGCCG
AAGGAAGACC GGGCCCTCGC GGAAGCCATC TCCACCCATC GCAGCCAGTC GTCTTTTGAA
GATCTGGCCG TATTCGAGAC TTTCCTTAAA GAGTATCCGC GCTCGGGATG GAATGTCGGG
ATTCTCGCCA ATCTGGGGCT TGCCTATTAC CGCCAGGGCT ATTTCTCCAG GGCGATAAAT
GCGTGGGAGC GGGCCTGGCA GACGGAAAGA ACTCGTCCCG CCGATAATCC TATGGGCAAG
ATGCTGGCAG ACCGCGCCAT GGGGGAGCTG ATGCGCATGC ACTCCCGCCT GGGACATGCG
GACGAGCTTG AAAAACTGCT TGCCGATATC AAGGACCGGC CCATCAGCGG TCCCGCCACG
GAATTGATCA CGGACGCTCA TGAAAGCCTG TGGATGTTCC GAAACGAGTA CGACAAGGCT
TACCTTTGCG GTCCCATGGC GCTCAAGAGT CTGCTGGTCT CGGCCCAGGC CGATCCGGCA
AAAATCGCCA TCATGGATGC GGCGCGCTCA GGTCCGCATG GCTACACGCT TGCCCAGGTA
TCGGAGCTGG CAAGCAAGGC CAACGTACCT CATCGGCTGA TTTTCCGTGC TCCCGGACAG
CCCATCCCCG TCCCCTCAGT GGTGAACTGG AAACTGAATC ATTATGCTGC CATCATCGGT
GAGCAGAACG GCCTCTACCA TGTCCAGGAT CCCACCTTCG CAAGCGGGGA TGCCTGGCTC
ACCAAGGAGG CCATCGATGC GGAGGCAAGC GGGTATTTCC TGGTTCCGGC TGGAACCCAG
AAAGTTGCGT CCTGGCGTAT GGCCACTCCT GAAGAGGCAA GGCGTATCTA TGGAATGGGA
CAGACAAGTG TTAATCAGCC CGGTACGACA CAGAAGCTTG ATAGCAACTT GACCTGCTCC
AATCCAAAAG GCCCTGTTCC TCTCACTGGC TCCGCAGCGT CACCACCAGC CATGTGTGTA
GTGAACGCTA AAATGATGCT GGTGAGCCTG AACTTAAGCG ACGTTCCGGT CGGGTACCAG
CCGCCGAAGG GGCCTTCCGC CAAAATCAGC CTGACCTACA ATCAGCGCGA AGCCAGCCAG
CCAGCCAATT TCAGCTTTTT CAATATCAGC CCCAAATGGA CACTGAACGT GCTGTCCTGG
GTTCAGGATA CTCCCGACTC GCCGGGCCGC TCAGTTCTGC GCTATGCGGC AGGCGGGGGA
TCGGTGGATT ATTCGATCGG CTACAGCTAT TCAACGGCAT CCGGCGCGTT CTCGCCTGAA
CGTCAGGGTC AGGCAGTGCT GGTTCGCATA CCCGCCACCG GTCCCGCCAC TTCCTATGAA
TTGCGCATGC CCGATGGAAG CAAGCAGGTA TTCGCCCAGA GCGATGGCGC AACCGTATAC
CCGCGCCGCA TGTTCTTAAG CCAAATAATC GATCCTTCGG GTAACGCGCT AACGTTCAAT
TATGACAATC AGTTGCGCCT GGTATCGCTC ACGGATGCCA CTGGCCGCAA CACCATCTTC
AGTTATGAGT TCCCTGCCAA TCCCCTGCTC GTAACCGGAA TAACCGATCC ATTCGGACGC
CGTGCCGAAC TGACATACGA CACAAACAGA CGTCTCGCCT CGATCACCGA TGTGATCGGC
ATTACTTCCT CCTTCAAATA CGATACGGGG GGGCTCATCA ATGAAATGAC GACACCCTAT
GGTACCCATC ATTTCACCTA TGGGCAGAAC ATCTATGAAA ACTCACGCTT CTTGGAAATG
ACCGATCCGA TGGGTTTTAC CGAACGACTG GAATTCCGGC ATTTTGCTTC CGGCATACCC
AATGGTGATC CTGTGGCTCC GGCTGGCATG GGGATACTGA ACGGTTTTCT TTACTACCGT
AATACTTTCC ACTGGGACAA GCATCTTTAT GCTATCACTC ATACCGATTA CACCCAAGCC
CGAATCATTC ACTGGCTCCA TAACCCGGCT GGACAGACCT CGCCGATCAT AGAAAGCAGC
AAACAGCCCC TTGAGCGCCG CGTCTGGAAT ACCTACCCGG GACAAGGTAA TTCCATCATG
GAAGGAACTG CCGGGACACC GATAGCAATC GGCCGAGTAT TAGATGACGG CTCCACGCAA
GCAAGAAGAT TTACCTATAA CAGCATCGGC AAGCCGCTCA CAGCCATAGA TCCTACAGGA
CGCAAGACTT CCTTTACCTA CGCTTCCAAT GACATCGATC TTACTTCCAT CCAGCAGACC
ACGGGCAGCG GCCAGGCCAC GCTGGCGGCC TTCACCTACA ACAGCCAGCA CCTTCCCCTG
ACTGCGACCG ATGCGGCGGG AAAGACAACT TCCTATACCT ACAACTCGGA TGGCCAGGTA
ACGGGCGTGA CCGACGCGCT CGGGCAGACC ACGCGCTATG CCTATGATGG GCTGGGACGG
CTGATATCGA TCACCAACCC GGATAACGCG GTGCAGCACA GCTTCACCTA TGACGCCTTC
GACCGCGTTG CCACCGCCAC CGATTCGGAA GGCCACACCC TGGCCTATGA ATACGACGCC
CTCGACCGGG TTACGCAAAT CCTCTATCCC GATGGAACCA GCACGCTGAA TACCTATGAC
CTGCTCGACC TGGTAGAAAC AAAAGATAGG ATGGGGAGGG CCACGAGCTA CAGCTATGAC
GCCAACCGGC GCATGACCTC GATGACCGAT CCTGCGGGTC AGACCACCAG TTACAGCTAT
TACCGTAACG GGGTGTTGCG CAGCATCACC GACGGCAACG GGAACATGAC GCGCTGGGAT
ATCGATATCC AGAGCCGCCC CATTGCCAAG GTTTATGCCG ACGATTCCAG GGAAACCTAT
GCCTATGACA GCGCAAGCCG CCTGATAAGC GTAACCGACG CTTTGGGACA GACCAGGCAG
TATGGCTATA CGAAGGACGA CCGGCTCGCG GCGCTGAGTT ATGCCAATGC TATAAGTCCC
ACACCGGGCG TAAGCTTCGC CTACGACCCC TATTTCCCGC GCAGAACCAC CATGACCGAT
GGGGCGGGAA CCACCCAGTT CCAGTATGGC GCTGTCGGCT CCCTGGGGGC GCTCAAGCTG
ACAGGTGAGA ACGGGCCTTA TACCAATGAC GAGATTTCCT ACCAGTACGA TGCATTGGGG
CGGATGACCA GCCGCAAGGT GGATACGATA ACCGAGTCCT TTGCCTATGA CAGGCTCGAC
CGCGTCACCC AGCATACCAA TCCGCTGGGC TCCTTCAATT TCAGCTATCT GGGTCAGACT
GGGCAGCTTT TAAGCCAGCA GGCCGGGGCT GTGGGAACGC AGTGGGGCTA TGAGGACAAC
ACCCATGACC GCCGCCTGAA GTCGATCACG AACAGCGGTC TGGCGCGCGG TTTCCACTAT
GCCACCACCC CGGAAAACCA GATTTCCACT CTGACGGAAA CTGCGGGGGG AGCAGCGCAG
AAGAGCTGGC ACTACGCCTA TGATGGCGCT GACCGCCTCC TCTCCGCCCA GCCTTCATCC
GGGGGAAGCT TCAGCTATGG TTATGACGCC GCTGACAACC TGACTTCCCT CAATGGCGCG
TTGACGCATT ACAACACCGT CAACCAATTG ACGAGTTTTA ACAGTGAAAG CTTCAGTTAC
GACGCCAACG GCAATCTGAA GGATGACGGA GTCCGCACCT ATCAGTGGGA TGCCGAAAAC
CGTCTGCTCT CCATCAGCTA CAAGAACGAT CCCAGCAAGG CGACCACGTT CCGCTACGAT
GGCATGGGGC GGCGGCTGGC TATTGTCGAA AATAATGGAG GAGCAATAAC AGAAACCCGT
CATCTCTGGT GCGGCGCTAC GCTGTGCCAG GCCAGAACGG CAGGCGATGT GGTCACACGC
CGCTATTACC CGCAAGGCAT GGCAATTCCG CAGGGCGGCA CTCTGCTTTA TTACGGCACT
GACCATCTAG GCTCGGTGCG GGACGTGATG GCGGCCCAGA ATGGGGCCAA GGTGGCAAGC
TACGACTATG ATCCTTATGG AAACCAGATA GCGGGGAGTG GGCGGATTTC AGTTGACTTT
CGCTATGCGG GGATGTTCTA TCATCAGCAG AGCGGATTGT ACCTGACGAA TTTCCGGGCC
TATGACCCGA AAACGGCGAA GTGGCTATCG CGTGACCCGA TTGGAGAAAA AGGAGGATTA
AATCTTTATG GGTATGTGGG AGGAAATCCA ATTAATATGA TTGATCCTTT AGGTCTGAGG
GCACTTCCTT GGATACTTGG CGGGGCCAGT TCTGACATAG CAACCCCGGA TCCGAGCGAT
ATAGCCTGGC AGAAGTGGGC TGGTTGGGCT ATTTTAATCA CTGGAGCAAC GATATATGAC
GCGTGCTCAG GAAACTCAGA ATCCAAAACG GCTCAGAATA AAACTGCTCA ACCACCTATT
TGCCCCCCCT GTAATCCACC TCAGGGAACC CAATGTTATG AACCCGACAC TGGACACACA
CATAATGGGT GGGATCCACA CTATCATATT TGGAGTCGTG GACAAAATCC TAACACTTGT
CAATGCTATT GGAATAGAGG GAGTGGATCG AAAGGAACAA CTCAATTTCC CCCGGTAGCT
CCAGTAGGAA AAGAAGAAAT AAAAGATTGT GGCAATTATA CAACTTGGCC TCACCAATAA
 
Protein sequence
MARLWSSMAV SRFIPSLFAT LLLLASPLLH AAPPENPAPL PLLKVDGFEE PLVPMYGTTP 
KEDRALAEAI STHRSQSSFE DLAVFETFLK EYPRSGWNVG ILANLGLAYY RQGYFSRAIN
AWERAWQTER TRPADNPMGK MLADRAMGEL MRMHSRLGHA DELEKLLADI KDRPISGPAT
ELITDAHESL WMFRNEYDKA YLCGPMALKS LLVSAQADPA KIAIMDAARS GPHGYTLAQV
SELASKANVP HRLIFRAPGQ PIPVPSVVNW KLNHYAAIIG EQNGLYHVQD PTFASGDAWL
TKEAIDAEAS GYFLVPAGTQ KVASWRMATP EEARRIYGMG QTSVNQPGTT QKLDSNLTCS
NPKGPVPLTG SAASPPAMCV VNAKMMLVSL NLSDVPVGYQ PPKGPSAKIS LTYNQREASQ
PANFSFFNIS PKWTLNVLSW VQDTPDSPGR SVLRYAAGGG SVDYSIGYSY STASGAFSPE
RQGQAVLVRI PATGPATSYE LRMPDGSKQV FAQSDGATVY PRRMFLSQII DPSGNALTFN
YDNQLRLVSL TDATGRNTIF SYEFPANPLL VTGITDPFGR RAELTYDTNR RLASITDVIG
ITSSFKYDTG GLINEMTTPY GTHHFTYGQN IYENSRFLEM TDPMGFTERL EFRHFASGIP
NGDPVAPAGM GILNGFLYYR NTFHWDKHLY AITHTDYTQA RIIHWLHNPA GQTSPIIESS
KQPLERRVWN TYPGQGNSIM EGTAGTPIAI GRVLDDGSTQ ARRFTYNSIG KPLTAIDPTG
RKTSFTYASN DIDLTSIQQT TGSGQATLAA FTYNSQHLPL TATDAAGKTT SYTYNSDGQV
TGVTDALGQT TRYAYDGLGR LISITNPDNA VQHSFTYDAF DRVATATDSE GHTLAYEYDA
LDRVTQILYP DGTSTLNTYD LLDLVETKDR MGRATSYSYD ANRRMTSMTD PAGQTTSYSY
YRNGVLRSIT DGNGNMTRWD IDIQSRPIAK VYADDSRETY AYDSASRLIS VTDALGQTRQ
YGYTKDDRLA ALSYANAISP TPGVSFAYDP YFPRRTTMTD GAGTTQFQYG AVGSLGALKL
TGENGPYTND EISYQYDALG RMTSRKVDTI TESFAYDRLD RVTQHTNPLG SFNFSYLGQT
GQLLSQQAGA VGTQWGYEDN THDRRLKSIT NSGLARGFHY ATTPENQIST LTETAGGAAQ
KSWHYAYDGA DRLLSAQPSS GGSFSYGYDA ADNLTSLNGA LTHYNTVNQL TSFNSESFSY
DANGNLKDDG VRTYQWDAEN RLLSISYKND PSKATTFRYD GMGRRLAIVE NNGGAITETR
HLWCGATLCQ ARTAGDVVTR RYYPQGMAIP QGGTLLYYGT DHLGSVRDVM AAQNGAKVAS
YDYDPYGNQI AGSGRISVDF RYAGMFYHQQ SGLYLTNFRA YDPKTAKWLS RDPIGEKGGL
NLYGYVGGNP INMIDPLGLR ALPWILGGAS SDIATPDPSD IAWQKWAGWA ILITGATIYD
ACSGNSESKT AQNKTAQPPI CPPCNPPQGT QCYEPDTGHT HNGWDPHYHI WSRGQNPNTC
QCYWNRGSGS KGTTQFPPVA PVGKEEIKDC GNYTTWPHQ