Gene Namu_4933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4933 
Symbol 
ID8450564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5507468 
End bp5510428 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content72% 
IMG OID645043972 
ProductNLP/P60 protein 
Protein accessionYP_003204196 
Protein GI258655040 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGGGC GACAGCGTCG GGTACGTGAC GCGCTCACGC GCGCCGTCCG ACCGCTGCGC 
CGCCGGTCCC CGCCGGCCGG AATTCCGGAC GGTTGGCTGC CGCTGACCCA GCGTCCCTCC
TACGAACCGC CCGTGCCGGC CCAGCCACCG GTCGAACGTT CGGCCCTGTC GGTGCGGGTC
CTGGCCGGCG CCGCGGTCGG CGCGGTCGTG ATGGCCCTGG CCGTGCCGTC GGCCTCCGCC
CTAGCCGTGG GTCCCATCGG TCCCAGCAGC GTCAGCCTCG GCTCGGCCCT GCCGATCTGC
GCTCCCGGTC AGGAATCCAC CCCGGTCTCC GGCTCCGGCA CCCCGGCCCC GTTGGGCAAC
CTGCAGGGGT TGCCTGCCGA CCTGGCCGAC GGCGGGGTGT TCGCCGGCGT CGCGCTGGAC
GGTGAGCAGG TCAAGATGGC CGCCACCGTC ATCGCCGTCG GCAAGCAGAT GGGCATCAGC
AAGCGCGGCA TCCGGATCGG CATCGCGGTG GCCACCGAGC AGTCCAGCCT GCGGCCGGAG
GCGGTCAACA AGGAATGGCT CGGCCTGTTC CAGCAGAACC CGGTCACCTA CACCCAGTAC
CGGCGGACCG AGCCGGGCGG CGCCGCCTGG ATGTTCTACG ACCAGCTGAT CAAGCAGGTG
CCCGGATACG ACACCGATGC CCGCTCCGAC GACGAGATCG GCGACGTGGT CCAGAAGACC
ACCACCGGGT GGCGCTTCGC CGAGTACTCC GACATGGCCG GCGCGCTGGT CGACCAGTTG
ATGGACGCGG TCTCGATCCA GCAGGACGAG GTGACCTGCA CGCCGGCCCC GGCCGAGCAG
GCCGCCACCG GGTCGGCCTT CGATCCGGGG AACATCATCT CGGACGCGGT CTTCTACAAC
AGCTCGGCGA TGACGGCCGA TCAGATCCGC ACCTTCCTGC TGTCCGAGGG GGAGGGCTGC
ACCGCCGCCG CCTGTCTGAA GAACCTGCGG ATCAGCACCA CCAGCCAGCC GGCCGACCAG
TACTGCCAGG CGTATCCGGG CGGGACCAAC GAGGACGCGG CCACGGTCAT CGCCAAGTTC
TCCACCGCCT GCGGCATCAA TCCGCAGGTC ATGCTGGTCA CCCTGCAGAA GGAGTCCGGC
CTGCTCAGCC GGACCGACAC CACCGCCGCC TCCTACAACG CGGCCTGGGG CTGGCACTGC
CCGGACAGCG GGCCCGGCGG CACCGCCAAC TGCGACCCCG CCTACGCCGG CTTCTTCAAC
CAGGGCTACG GCATGGCCAA GCAGTGGGCC CGGTACCGGG TCGATCCGGG CAAGTACAAC
TACCAGGCCG GCCAGACGGT CACCATCCTG TGGAACGTCG CCGAGTCCGG CTGCGGCGGC
GCCCCGGTGA CGATCAAGAA CCAGGCCACC GCGTCGCTGT ACAACTACAC GCCGTACCAG
CCCAACGCCG CGTCCCTGGC CGCCTACCCC GGCGCGGGCG ACGCCTGCTC GGCCTACGGC
AACCGCAACT TCTTCTTCCT GTTTCGCAAG TACTTCGGCT CGACCGGCGG CGGCGCCTCG
ACCACCGCCG TCAACTCGGC CGTGCTGGCC ACCGGCACCA CGGTCAAGGT CCCCAATAAC
CCCTATGTCT CCGGCGCCCT GGCCGGGCAG ACGATCACCG CGCCCACCCC CCAGGTCGCT
GCCGGGCTGG CGGCCGGGTT CAGCGCGCTC GGCCTGCCCT ACGTCTGGGG TGGCGGCGGC
TCCGGCGCCG GCCCCAACAA CGGCTGCGCC CGCGGCGGCG GCGACTACAA CAGCTGCGGA
CCCGAGATCG GCTTCGACTG CTCGGGCCTG ACCGCATACG TGCTGGGCAT GGCCGGGTAC
CAGACCCCCG GTGACTCCGG TTCGCAGCGA TCGGCCGGGG TCTCGGTCAG CTGGTCGCAG
GCGCTGCCCG GCGACATCGT CGGCTTCCCC GGGCACGTGG CCGTCTACCT GGGCACCTTC
GGTGGCCGGC CCTACATCCT GGAAGCGTCC TGGGTCGGCA CCCCCGTGCA CATCGTGCCG
CTGACCCGCA CCGACATGGA CGACCGGGTG CACCGCTACT GGACCGGCGC CGCGGTCCGC
CCGTCCGGCG TGGCCGACTT CTCCTCGATC GTGCGGACCT ACTCGTACAC GGCGCCCACC
TACCGCTCGA GCATCACCTC GTTCAGCGGG GCGGTCGCCG CCGGCTCCGG CTCGTCGCCG
ACCTACACGC CGAACATCCC CCGGATCCGC CCGGTGCCGT ACCCGGCGCC CGCGCCGGTG
GTCACCGTGC CCGTCCCGGT TGCCGAGGTG CCACCGGTGA GCACGCCGAC CCCGCCGGCT
GCGACCACCG CGCCCCCGGC CGAAGCGACC CCCACGCCGC CCGCCGCCCC GCCGTCATCG
GTCACCGCCA CGACCGCTCC GGTCCCGACG ACCGCGACCA CTGGGTCTTC GACCTCGGTG
ACCACCACGG CGACCTCAGC CGACTCGACC AGCCCGACAC CGACCACCAC CACCACCTCG
CCCGCAACCA CGACGGGATC GCCGACCACC TCGGCCAGCC CGTCGTCTTT GGCGCCGACG
TCATCATCGG GGACGTCATC GGGGACCTCG ACGGAATCGA GCGCGACCGG CACGACCGCA
TCGCCGTCCG CGACCGCTGC GCCCCCCGCC ACCGAGTCGT CCACGGCCGA GTCCTCCACG
GCCGAGTCCT CCGCGGCCGA GTCGTCCGCT GACTCGACCA CTGCGGAGTC GACCACTGCG
GAGTCGGCCA CTGCGGAGTC GGCCAGCGCT GAATCAACCA CCGCGGAGTC GACCGGCGAC
CCGGCCCCGA CCGACCCGCC GGCACCGATC AAGCCGGTGT CCTGCGCGGA CCTGTCGGCC
GTGTTGACCG CCAAGAACGG TTTCGCCACC GCGACCGACG GCACCCCGCT GCCCGACTGC
GACGAGCCCG ACCTGCCCTG A
 
Protein sequence
MTGRQRRVRD ALTRAVRPLR RRSPPAGIPD GWLPLTQRPS YEPPVPAQPP VERSALSVRV 
LAGAAVGAVV MALAVPSASA LAVGPIGPSS VSLGSALPIC APGQESTPVS GSGTPAPLGN
LQGLPADLAD GGVFAGVALD GEQVKMAATV IAVGKQMGIS KRGIRIGIAV ATEQSSLRPE
AVNKEWLGLF QQNPVTYTQY RRTEPGGAAW MFYDQLIKQV PGYDTDARSD DEIGDVVQKT
TTGWRFAEYS DMAGALVDQL MDAVSIQQDE VTCTPAPAEQ AATGSAFDPG NIISDAVFYN
SSAMTADQIR TFLLSEGEGC TAAACLKNLR ISTTSQPADQ YCQAYPGGTN EDAATVIAKF
STACGINPQV MLVTLQKESG LLSRTDTTAA SYNAAWGWHC PDSGPGGTAN CDPAYAGFFN
QGYGMAKQWA RYRVDPGKYN YQAGQTVTIL WNVAESGCGG APVTIKNQAT ASLYNYTPYQ
PNAASLAAYP GAGDACSAYG NRNFFFLFRK YFGSTGGGAS TTAVNSAVLA TGTTVKVPNN
PYVSGALAGQ TITAPTPQVA AGLAAGFSAL GLPYVWGGGG SGAGPNNGCA RGGGDYNSCG
PEIGFDCSGL TAYVLGMAGY QTPGDSGSQR SAGVSVSWSQ ALPGDIVGFP GHVAVYLGTF
GGRPYILEAS WVGTPVHIVP LTRTDMDDRV HRYWTGAAVR PSGVADFSSI VRTYSYTAPT
YRSSITSFSG AVAAGSGSSP TYTPNIPRIR PVPYPAPAPV VTVPVPVAEV PPVSTPTPPA
ATTAPPAEAT PTPPAAPPSS VTATTAPVPT TATTGSSTSV TTTATSADST SPTPTTTTTS
PATTTGSPTT SASPSSLAPT SSSGTSSGTS TESSATGTTA SPSATAAPPA TESSTAESST
AESSAAESSA DSTTAESTTA ESATAESASA ESTTAESTGD PAPTDPPAPI KPVSCADLSA
VLTAKNGFAT ATDGTPLPDC DEPDLP