Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4719 |
Symbol | |
ID | 4595354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | - |
Start bp | 18154 |
End bp | 21393 |
Gene Length | 3240 bp |
Protein Length | 1079 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639772508 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_919168 |
Protein GI | 119714026 |
COG category | [S] Function unknown |
COG ID | [COG1652] Uncharacterized protein containing LysM domain |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGGAT TGGTCAAGCG ACTCGGAGCC CTCGCACTGC TGTTGGGCCT CCTCGTCGGG TTCCCTGCCC TGCTGCTCGC GGCTGTGGGC AACCCGTGGC CTGACGGCGG GTTGAACGAG CTGTCGCTCA TGAGCAACTC GGCCGTGCTC GGGATCGTCT CGCTGTTGGG CTGGTTCGTG TGGGGGCAGC TGCTGGTGTG CACCCTGTGG GAGATCCCGC CGGCCCTGCG TCACGAGACC GAGGGGGCCT CGAGGCTGCC GATCGCGGTC GGTGGTCAGC AGAGGTTCAT CCGGATACTG GTCCACACAG TGCTCGCGGT CGGAGTCACC TCGACGGCAC TGCTCGGGTC GCACGCAGCC ACCGCGGAGG CTGCACCGGC CGCACCGCTG CGGCCTGTCA CCCATGTCGC ACACCAGACT CCGGCCGCGA ACCCGGAGAC AAGTCCCGCC GTATCGCCGG CACAGGCCGT CCCGGACACC ACCCATCGGG CCGACCGCCC GAGGATCGTC ACCGACAAGG GCGACACCCT GTGGGGCCTG GCCGAGAAGC ACCTAGGAGA CGGGTTCCGT TGGCAGGAAA TCGCGGACCT GAACCATGGC CGGGTCATGG TCGACGGACG CACCTTCAAC AACCCGCGCA GCATCGAGCC TGGCTGGGAA CTGCTGCTGC CCGCCGGTGC GACCGGCCTG CCCGGCGACC AGACCTCCGC GACCGAGCAC GTCGTCGCGC CGGGTGAAAC GCTGACCGGG ATCACCAAGG AGACCACTGG CGACCCCGAC AACTGGCAGG CGCTCTACGA GGCAAACAAG GACGTCATCG GCAGCGACCC CGACCTGATC TACCCCGGCC AGGTTCTCGT ACTGCCGGGT ACTGGTGCCG TCGACCCGGA CACGGGCACC GACCGGCCGC ACCAGCGAGG GCCCGGTCAG CGCGACGAGC CCGGCACGTC CACCGGCGGC CAGGACGAGC AGGCGGACGA CACGGGGGAC GACCGCTCCA CGGGAGCCGA CGACGCGGAG GGACAGGAGC AGCAGCAGAC GCCCGCCGAG CCGGCCGCCC CGACCGCACC GTCGCAGCGA TCGGCGCAGG GGGCCCCGGA CAGCACGGAG CAGCAGGCCT CGACCGAAGC CGACGAGGCC AGCGACGAGG GCGGGATCAC GGGCCTGCGG GCCCTGCTGG CCACGGCCCT GTGTCTCTCG GCGGGAGCGC TGGGACTGGT GATCGCCAAC CGTCGTCGGC AGTTCCGTCA GCGCCCGATC GGTCGGACGA TCGCCTCGAC TCCCGACGAG CTGCTCGAGG TGGAACAGGC GATCCTCGAG CACGGCTCAG AGGCACAGAA GGACGCGGAG TTCCTCGACC GGGCACTGCG GCACGTCGCG GCGTCGTGCA AGGTGGCCGG CGACCGGCTT CCCCAGCTGG GTGCCGCCGT GCTCGGCGAT GAGGACCTGA CTCTCCTGTT CACCCAGCCC GCCCCCGGGC AGGTGCCGGA GGGCTGGACC GCGACCGATG ACGCCCAGGC CTGGATGCTG CCGCGCTGGA CCTTCCTCGA GGAGGACCTC GAGAACCAGC CGGCCCCGTA CCCGGCGCTG GTGAGCATCG GCCTCGACGA GAGCGGCCGC ACCTGGTACC TGGACCTGGA GACCCTGGGC GTGTGCGGCA TCGGAGGTGA GCCGGAGCAG GTCGCCGACA TGGCCCGCTT CATGGTGGCC GAGCTCGCCG TCAACGACTG GGCCAATGGC TGCGAGGTTC TGCTGGCCGA CCAGTTCGCC GCCGAAGCGA TCCGACTCAA CCCGGCGCGG CTCCGGCAGG TCAATCGCAG CGACGCGCTG GCCCGGGCTG CCGTGCTGAC CGGCGAGATG GGTGAGGGCG AGCACAACCT CGACGCTGAC GTGCTGACGC GACGCCGCGA CGGGCTGTTG CTGGACACGA CCAGCCCGGT GGTGGTTGTG CTTGCATCCC GTCCCGAGGG CGAGTTCGTC AGTGACATCG AGGACCGGGA CCGCTCCCGG GTGGTGGTGG TGCACGGGGA TGAGGAGTCG CCGGCGGTCG AGCTGTCCGG CGATGGGATG GCGTTCCTGC CGGTGTGGGG GATCAGTGTC AAGGCACTGA CCTTGAAGCC GGAGCACGTG GCGCCGATGG CGGACCTGTT GGCGGCCACC CGCAGCCTCG AGGACGAGCC GGTGCCGGCC ATGCAGTCCG ACGATGGTCC GCTGGGCAAG TACGCCCGTG CTGACGGGTC GCTGCGTGAG GAGTACACAG AGCCGCGGCA CACTGAGGGC AACGACCGCT CCTCGATGCT GCCCGATGCC GACGAGGTCT ACCTGGCCAC CGCGGCAACG ACCGCCGAAG ACCTCGCCGC GGCCGCGCCG AGTGTTCCCG TGGAGGTCCG CGCCGAGATC GCGGCGCTCG ACCCGACGCT GGACCAAGAC GTCGCTGACT GGTTCGACGA GTCCTCGCCG CGTCCCAAGG TGAACCTGCT CGGTCCGGTC TCGGTCACCG CACTCAACGG CGGCGACCCC GCGGCGATCG ACAACGCTGG CGGAACTGTC TCCTTCATCG CCTACCTGGC CTGGCAGGAC CGCGGAGTCA CCGGGGAGCG TGCCGCCGCA GCATGCGGGT GGCAGACCCA GAAGACGGTT CAGAATCGCG CCACCAACGC CCGCTTCCTG CTGGGAACCC GGCCCGACGG CAGCGACTGG CTCCCCGACG CGAGCATGAG CGCCAGTGCC CGCCGAGGCC ACAGCCCGAC CTACGAACTG GTGAGGGACA GCGGCGGGGT GCTGAGCGAC GCCGACCTGT TCATCCGGCT GAAGCACCGG GCACAGCGGC GAGGCGACAA CGGGTGCGAG GAGGACTTGG TGACCGCGCT GTCGCTGGTC ACCAGCACGC CGTTCGAGGG GATCACCGAG AGCCGGTTCA AGTGGGTCTT TAAGGAGGGG CTGCAGCGGC CCGACGTCAT CTTGGCTGGC GCGATCCATG ACGCGGCCCA CACTCTGGCG ACGCGGGCCG TCACCGAGGG CCGCACCGAT CTGGTCCGGC TGGCCTGTGA CGCGGCCCGG AGGGCAAACC CCCACGGCGA CATCGCATGG CTGGATCTGG CCGCGGCCAC CGAGGCGGAA TCCGGTCGCG AGGCCGCCGG TCAACTGGTC CGCGAGCAGG TCCTCGAGCA GTTCGACGAG GACCTCCCGC CGCGCTCGGA GGCCATCGTG GAGCAGCGCG AGTGGGGTGC CACCGGCTAG
|
Protein sequence | MLGLVKRLGA LALLLGLLVG FPALLLAAVG NPWPDGGLNE LSLMSNSAVL GIVSLLGWFV WGQLLVCTLW EIPPALRHET EGASRLPIAV GGQQRFIRIL VHTVLAVGVT STALLGSHAA TAEAAPAAPL RPVTHVAHQT PAANPETSPA VSPAQAVPDT THRADRPRIV TDKGDTLWGL AEKHLGDGFR WQEIADLNHG RVMVDGRTFN NPRSIEPGWE LLLPAGATGL PGDQTSATEH VVAPGETLTG ITKETTGDPD NWQALYEANK DVIGSDPDLI YPGQVLVLPG TGAVDPDTGT DRPHQRGPGQ RDEPGTSTGG QDEQADDTGD DRSTGADDAE GQEQQQTPAE PAAPTAPSQR SAQGAPDSTE QQASTEADEA SDEGGITGLR ALLATALCLS AGALGLVIAN RRRQFRQRPI GRTIASTPDE LLEVEQAILE HGSEAQKDAE FLDRALRHVA ASCKVAGDRL PQLGAAVLGD EDLTLLFTQP APGQVPEGWT ATDDAQAWML PRWTFLEEDL ENQPAPYPAL VSIGLDESGR TWYLDLETLG VCGIGGEPEQ VADMARFMVA ELAVNDWANG CEVLLADQFA AEAIRLNPAR LRQVNRSDAL ARAAVLTGEM GEGEHNLDAD VLTRRRDGLL LDTTSPVVVV LASRPEGEFV SDIEDRDRSR VVVVHGDEES PAVELSGDGM AFLPVWGISV KALTLKPEHV APMADLLAAT RSLEDEPVPA MQSDDGPLGK YARADGSLRE EYTEPRHTEG NDRSSMLPDA DEVYLATAAT TAEDLAAAAP SVPVEVRAEI AALDPTLDQD VADWFDESSP RPKVNLLGPV SVTALNGGDP AAIDNAGGTV SFIAYLAWQD RGVTGERAAA ACGWQTQKTV QNRATNARFL LGTRPDGSDW LPDASMSASA RRGHSPTYEL VRDSGGVLSD ADLFIRLKHR AQRRGDNGCE EDLVTALSLV TSTPFEGITE SRFKWVFKEG LQRPDVILAG AIHDAAHTLA TRAVTEGRTD LVRLACDAAR RANPHGDIAW LDLAAATEAE SGREAAGQLV REQVLEQFDE DLPPRSEAIV EQREWGATG
|
| |