Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4933 |
Symbol | |
ID | 8450564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 5507468 |
End bp | 5510428 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645043972 |
Product | NLP/P60 protein |
Protein accession | YP_003204196 |
Protein GI | 258655040 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCGGGC GACAGCGTCG GGTACGTGAC GCGCTCACGC GCGCCGTCCG ACCGCTGCGC CGCCGGTCCC CGCCGGCCGG AATTCCGGAC GGTTGGCTGC CGCTGACCCA GCGTCCCTCC TACGAACCGC CCGTGCCGGC CCAGCCACCG GTCGAACGTT CGGCCCTGTC GGTGCGGGTC CTGGCCGGCG CCGCGGTCGG CGCGGTCGTG ATGGCCCTGG CCGTGCCGTC GGCCTCCGCC CTAGCCGTGG GTCCCATCGG TCCCAGCAGC GTCAGCCTCG GCTCGGCCCT GCCGATCTGC GCTCCCGGTC AGGAATCCAC CCCGGTCTCC GGCTCCGGCA CCCCGGCCCC GTTGGGCAAC CTGCAGGGGT TGCCTGCCGA CCTGGCCGAC GGCGGGGTGT TCGCCGGCGT CGCGCTGGAC GGTGAGCAGG TCAAGATGGC CGCCACCGTC ATCGCCGTCG GCAAGCAGAT GGGCATCAGC AAGCGCGGCA TCCGGATCGG CATCGCGGTG GCCACCGAGC AGTCCAGCCT GCGGCCGGAG GCGGTCAACA AGGAATGGCT CGGCCTGTTC CAGCAGAACC CGGTCACCTA CACCCAGTAC CGGCGGACCG AGCCGGGCGG CGCCGCCTGG ATGTTCTACG ACCAGCTGAT CAAGCAGGTG CCCGGATACG ACACCGATGC CCGCTCCGAC GACGAGATCG GCGACGTGGT CCAGAAGACC ACCACCGGGT GGCGCTTCGC CGAGTACTCC GACATGGCCG GCGCGCTGGT CGACCAGTTG ATGGACGCGG TCTCGATCCA GCAGGACGAG GTGACCTGCA CGCCGGCCCC GGCCGAGCAG GCCGCCACCG GGTCGGCCTT CGATCCGGGG AACATCATCT CGGACGCGGT CTTCTACAAC AGCTCGGCGA TGACGGCCGA TCAGATCCGC ACCTTCCTGC TGTCCGAGGG GGAGGGCTGC ACCGCCGCCG CCTGTCTGAA GAACCTGCGG ATCAGCACCA CCAGCCAGCC GGCCGACCAG TACTGCCAGG CGTATCCGGG CGGGACCAAC GAGGACGCGG CCACGGTCAT CGCCAAGTTC TCCACCGCCT GCGGCATCAA TCCGCAGGTC ATGCTGGTCA CCCTGCAGAA GGAGTCCGGC CTGCTCAGCC GGACCGACAC CACCGCCGCC TCCTACAACG CGGCCTGGGG CTGGCACTGC CCGGACAGCG GGCCCGGCGG CACCGCCAAC TGCGACCCCG CCTACGCCGG CTTCTTCAAC CAGGGCTACG GCATGGCCAA GCAGTGGGCC CGGTACCGGG TCGATCCGGG CAAGTACAAC TACCAGGCCG GCCAGACGGT CACCATCCTG TGGAACGTCG CCGAGTCCGG CTGCGGCGGC GCCCCGGTGA CGATCAAGAA CCAGGCCACC GCGTCGCTGT ACAACTACAC GCCGTACCAG CCCAACGCCG CGTCCCTGGC CGCCTACCCC GGCGCGGGCG ACGCCTGCTC GGCCTACGGC AACCGCAACT TCTTCTTCCT GTTTCGCAAG TACTTCGGCT CGACCGGCGG CGGCGCCTCG ACCACCGCCG TCAACTCGGC CGTGCTGGCC ACCGGCACCA CGGTCAAGGT CCCCAATAAC CCCTATGTCT CCGGCGCCCT GGCCGGGCAG ACGATCACCG CGCCCACCCC CCAGGTCGCT GCCGGGCTGG CGGCCGGGTT CAGCGCGCTC GGCCTGCCCT ACGTCTGGGG TGGCGGCGGC TCCGGCGCCG GCCCCAACAA CGGCTGCGCC CGCGGCGGCG GCGACTACAA CAGCTGCGGA CCCGAGATCG GCTTCGACTG CTCGGGCCTG ACCGCATACG TGCTGGGCAT GGCCGGGTAC CAGACCCCCG GTGACTCCGG TTCGCAGCGA TCGGCCGGGG TCTCGGTCAG CTGGTCGCAG GCGCTGCCCG GCGACATCGT CGGCTTCCCC GGGCACGTGG CCGTCTACCT GGGCACCTTC GGTGGCCGGC CCTACATCCT GGAAGCGTCC TGGGTCGGCA CCCCCGTGCA CATCGTGCCG CTGACCCGCA CCGACATGGA CGACCGGGTG CACCGCTACT GGACCGGCGC CGCGGTCCGC CCGTCCGGCG TGGCCGACTT CTCCTCGATC GTGCGGACCT ACTCGTACAC GGCGCCCACC TACCGCTCGA GCATCACCTC GTTCAGCGGG GCGGTCGCCG CCGGCTCCGG CTCGTCGCCG ACCTACACGC CGAACATCCC CCGGATCCGC CCGGTGCCGT ACCCGGCGCC CGCGCCGGTG GTCACCGTGC CCGTCCCGGT TGCCGAGGTG CCACCGGTGA GCACGCCGAC CCCGCCGGCT GCGACCACCG CGCCCCCGGC CGAAGCGACC CCCACGCCGC CCGCCGCCCC GCCGTCATCG GTCACCGCCA CGACCGCTCC GGTCCCGACG ACCGCGACCA CTGGGTCTTC GACCTCGGTG ACCACCACGG CGACCTCAGC CGACTCGACC AGCCCGACAC CGACCACCAC CACCACCTCG CCCGCAACCA CGACGGGATC GCCGACCACC TCGGCCAGCC CGTCGTCTTT GGCGCCGACG TCATCATCGG GGACGTCATC GGGGACCTCG ACGGAATCGA GCGCGACCGG CACGACCGCA TCGCCGTCCG CGACCGCTGC GCCCCCCGCC ACCGAGTCGT CCACGGCCGA GTCCTCCACG GCCGAGTCCT CCGCGGCCGA GTCGTCCGCT GACTCGACCA CTGCGGAGTC GACCACTGCG GAGTCGGCCA CTGCGGAGTC GGCCAGCGCT GAATCAACCA CCGCGGAGTC GACCGGCGAC CCGGCCCCGA CCGACCCGCC GGCACCGATC AAGCCGGTGT CCTGCGCGGA CCTGTCGGCC GTGTTGACCG CCAAGAACGG TTTCGCCACC GCGACCGACG GCACCCCGCT GCCCGACTGC GACGAGCCCG ACCTGCCCTG A
|
Protein sequence | MTGRQRRVRD ALTRAVRPLR RRSPPAGIPD GWLPLTQRPS YEPPVPAQPP VERSALSVRV LAGAAVGAVV MALAVPSASA LAVGPIGPSS VSLGSALPIC APGQESTPVS GSGTPAPLGN LQGLPADLAD GGVFAGVALD GEQVKMAATV IAVGKQMGIS KRGIRIGIAV ATEQSSLRPE AVNKEWLGLF QQNPVTYTQY RRTEPGGAAW MFYDQLIKQV PGYDTDARSD DEIGDVVQKT TTGWRFAEYS DMAGALVDQL MDAVSIQQDE VTCTPAPAEQ AATGSAFDPG NIISDAVFYN SSAMTADQIR TFLLSEGEGC TAAACLKNLR ISTTSQPADQ YCQAYPGGTN EDAATVIAKF STACGINPQV MLVTLQKESG LLSRTDTTAA SYNAAWGWHC PDSGPGGTAN CDPAYAGFFN QGYGMAKQWA RYRVDPGKYN YQAGQTVTIL WNVAESGCGG APVTIKNQAT ASLYNYTPYQ PNAASLAAYP GAGDACSAYG NRNFFFLFRK YFGSTGGGAS TTAVNSAVLA TGTTVKVPNN PYVSGALAGQ TITAPTPQVA AGLAAGFSAL GLPYVWGGGG SGAGPNNGCA RGGGDYNSCG PEIGFDCSGL TAYVLGMAGY QTPGDSGSQR SAGVSVSWSQ ALPGDIVGFP GHVAVYLGTF GGRPYILEAS WVGTPVHIVP LTRTDMDDRV HRYWTGAAVR PSGVADFSSI VRTYSYTAPT YRSSITSFSG AVAAGSGSSP TYTPNIPRIR PVPYPAPAPV VTVPVPVAEV PPVSTPTPPA ATTAPPAEAT PTPPAAPPSS VTATTAPVPT TATTGSSTSV TTTATSADST SPTPTTTTTS PATTTGSPTT SASPSSLAPT SSSGTSSGTS TESSATGTTA SPSATAAPPA TESSTAESST AESSAAESSA DSTTAESTTA ESATAESASA ESTTAESTGD PAPTDPPAPI KPVSCADLSA VLTAKNGFAT ATDGTPLPDC DEPDLP
|
| |