Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1548 |
Symbol | |
ID | 4595488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1638813 |
End bp | 1641968 |
Gene Length | 3156 bp |
Protein Length | 1051 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639776147 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_922749 |
Protein GI | 119715784 |
COG category | [S] Function unknown |
COG ID | [COG1652] Uncharacterized protein containing LysM domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.637066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGTGA TTCGCCGTGC CGGGATCGTC CTTCGCGGCC TGCTCGCCGC GGTCGTCCTC CTGGTCCTGG TCGTGGGAGT TCCGGCCGCG CTGGCAGTCA CCGTCGGCAA CCCAGTTCCT GAAGGCTGGG CCTGGGGCCC ACCGCTCACG AATTCGGCTC TGCTCGGCAT CCTGGCCTGC GTTGCCTGGG TGCTCTGGGT ACAGCTCTTG GTCTGCGTCA TCGTCGAGAC AGTCGCCGAG ATCAGGCTCG CAGCAGGGCA CTCGGCGGAA TGGCTCGCGC GGGTTCCAGG CACCTTCGGC GTTCAGCAGT CGCTGGCCCG CGCTCTCGTC CAGGCGGTAG TCGCGATCGG TGCGACGACC ACCGCCGCCT CCGTCATTGC GACGCCCTGG ATCTTGCACG CCGATGCCTC GACCTCCAGC CCGGCGCCGG CATCAGATCC CGCCGTGGCC GATCCCCCCG CTGTGGCGCC GACGCCGCAA CTGCCGGCCA AGCACAGATC GCAGCAGACG ACCGAAGTCG TGGTCGCCCG CGGAGACACC CTGTGGTCGA TCGCGGAACA TCACCTTGGT GCGGGCGAGC GCTGGCGAGA GATCGCCGAT CTGAACAGGG ACCGCGAGAT GGTCGACGGG TCGAGGTTCG ACGACACTCG GACCATCCTG CCCGGCTGGA CGTTGCTGGT CCCATCGGCG GACCCGTCGC ACCATCCGGC CGTCACGGTT GCGCCGGGCG ACACCTTGTG GGAGATCGCG ACCGAGGAGT ACGGCGACGG CTCCAAGTGG CCGCGGATCT ACCGGGCCAA CGACGACGAG ATCGAGGACC CGGATCTGAT CTACCCCGGA CAGCGGCTCG ACGTATCAGG ATCGCGGACC GTCGAGCCGT CGAAGCCCCC GACGCAGCAC GAGACTGATC CGCCTGCCGC CGACACCACA ACACGTCCAC AGGAGCCGGG GCCTTCGGAT CACGCAGTGC CCTCGACCGA ACCGACCCCC GAGTCCACGC CGACGGAACC GGCCAATGCG GCCGCCGAAC CCGGGCGGGA ACACGACGAC GCCCCCGACG ACTCGGAATT CCACGTCGAC GGCGCCACGG TTGCTCGGGC ACTGCTGAGC GGGGGCGGCT TCCTCGCCGC CGGAATGCTC GCGATCTACG TCGCCCGGCG GCGGACTCAG TCACGCAACC GCAAGTCCGG CAAGGCGGCG CCGCCCGTGG CAGCGCACCT GCGGGCCGAG GACAAGGCGC TACGGGCCAT CGGCTCGTCG GCGAGCGAGC GCGCGGCGTA CTTCGATGCG CTGCTGCGCG AGTTCCCCGC GCTCTGTGAG CAGGCGAAAC TCGATCTGCC CGAGGCGGTC GCCGCCAGCC TCGGCAGCCA CGCGCTCGAA CTGCACCTGC GGATACCGTC GCCAACGGCG CCGCCTCCGT GGAAGGTGTC TCAGGGCGGG ACCGTGTGGA GTGTGTCGAC AGGGCGCCGA CCCACGTTGA CGGATCGCAC GCCGGCGTAC CCAGCGATGG TCACGGTCGG GGTGGACGAC GAGGGCTGCA CGCTGTTCAT CGACATCGAG GGCGCGGGCG TCGTACAGAT CGTCGGGGAC GCCGCGGCCG CGGTCGAGCT GGCTCGGTTC ATGGCTGCCG AACTGGCTCT GAACCCCTGG TCGGACTGCG AGTCGATCGA AGTGAGCGGA GGGGTCGAGG ACGTCCTGCC GCTCAACTAC GGGCGCCTGT ACGCCTCACC GAAGGCCGAA GTCGAGCAGC TCGCGAAGTT CGCTCATCGA ATGGCCGAGG GAGTCGAGTC GGCCGGGACG AGCGTCCTCG GGTCGCGGGT GACCGATCGG GACGCCTGCG TCCCGTCGCT CAGCATCTCC GCGTTGGCGA ACGAGGATCT GTCAACGGGG CGGGAGCACA CAGCCGCCCT GCTCGACGAG ATGGAGCGAG CGCCGGGTCG GACGTCGGCC GGGTTGGTCG TCGCTGCGGC CGACGTCGTG GATCCGCGTG CCACCACGCT CGAGTTCAGA TCAGACGGCG ATCTCGTCAC GCCGTGGGGT GTCGTCCGAC CGAACCGGCT GAGGACGGAG GAGGCCGCCG TTCTGGCGCA GCTGTTCGAG GACGCGGAAA CCGAAGGTGA CGTCGACGTA CCGGCCGCCG TTGATGCACA CGGTGAGACG ACCAACACGG ACCAAGCCGG GGCGATTGCA CAGGAGCTCA CCGAGGCACG TTGCGGCACC GGCGATCCCG AGTCGACGCT TCCCCGCCCG GATCGGGCGT ACGTCGAATC CGCGGCCACC ACATTCGAGG ACCTCGCCGT ACTCGCACCA GCTGTGCCCA AGAGTGAGGA AGCAGCCGCC CTCGCATCCG ACCCCACGCT CGACCAGGAC CTTGCCGACT GGGCCGACCC GGGGTGCCCT CGGCCGAAGC TCCGCGTCCT CGGGCCGGTC GAGCTTCGCG CGGTCGGGGA GAAGACCGCG GAGGTCGAGG GGCGACCGGC GTACTACTCC GAACTTGCCG CGTATCTGGC CTGCCATCCC GAAGGCGTAA CGCCGAACCG GGTCGCGGCG GACTTCGGCA TCCAGAACAA CACGCTCCAC TCTCGGCTCA CCGGGCTACG GAAGTGGATG GGGAACAAGC CAGGCACCGA GGAGTGGTAC CTGCCGGTCG CGCATCGGGT TCGCGGGCAG CAGGTCTACC AACTCTCCGA TGTGCTCGTC GACGGAGACC TCTTCCGGCG ACTGCGCGGT CGCGGCGAGG CTCGCGGCCC GAGCGGCATC GATGACTTCA GGACGGCGCT GGAGCTGGTG GCTGGCCAGC CCTACGAGCG GCAGCGCTCG AACGGCTTCG GGTGGCTCGT GGACACCCCG GTCGACCAGT ACGCCGTCGT CGCGATCGTC GACGTGGCTC ACGTGTATGC GACCCACTCG CTCGCGGAGG GTCGGCCCCG TGACGCCATG TGGGCGGCCG AGCGGGCCAT CGCGGCCGCA CCGTCGGAGG ACAAGCCGCG GCTCGACCTG GCCAAGGCGA TGCAGGCTTT GGGCGAGGTA GACGAGGCGG ATCGCTACCT CGGCCGAGAG GTGTTCAACC GAACCGACGA CGACCGTGCG CCTCTCGATC CGTCCGATCG CACGAAGGAG ATCTTCCACC GGATGGATCG GCCACACAGG GGCTGA
|
Protein sequence | MTVIRRAGIV LRGLLAAVVL LVLVVGVPAA LAVTVGNPVP EGWAWGPPLT NSALLGILAC VAWVLWVQLL VCVIVETVAE IRLAAGHSAE WLARVPGTFG VQQSLARALV QAVVAIGATT TAASVIATPW ILHADASTSS PAPASDPAVA DPPAVAPTPQ LPAKHRSQQT TEVVVARGDT LWSIAEHHLG AGERWREIAD LNRDREMVDG SRFDDTRTIL PGWTLLVPSA DPSHHPAVTV APGDTLWEIA TEEYGDGSKW PRIYRANDDE IEDPDLIYPG QRLDVSGSRT VEPSKPPTQH ETDPPAADTT TRPQEPGPSD HAVPSTEPTP ESTPTEPANA AAEPGREHDD APDDSEFHVD GATVARALLS GGGFLAAGML AIYVARRRTQ SRNRKSGKAA PPVAAHLRAE DKALRAIGSS ASERAAYFDA LLREFPALCE QAKLDLPEAV AASLGSHALE LHLRIPSPTA PPPWKVSQGG TVWSVSTGRR PTLTDRTPAY PAMVTVGVDD EGCTLFIDIE GAGVVQIVGD AAAAVELARF MAAELALNPW SDCESIEVSG GVEDVLPLNY GRLYASPKAE VEQLAKFAHR MAEGVESAGT SVLGSRVTDR DACVPSLSIS ALANEDLSTG REHTAALLDE MERAPGRTSA GLVVAAADVV DPRATTLEFR SDGDLVTPWG VVRPNRLRTE EAAVLAQLFE DAETEGDVDV PAAVDAHGET TNTDQAGAIA QELTEARCGT GDPESTLPRP DRAYVESAAT TFEDLAVLAP AVPKSEEAAA LASDPTLDQD LADWADPGCP RPKLRVLGPV ELRAVGEKTA EVEGRPAYYS ELAAYLACHP EGVTPNRVAA DFGIQNNTLH SRLTGLRKWM GNKPGTEEWY LPVAHRVRGQ QVYQLSDVLV DGDLFRRLRG RGEARGPSGI DDFRTALELV AGQPYERQRS NGFGWLVDTP VDQYAVVAIV DVAHVYATHS LAEGRPRDAM WAAERAIAAA PSEDKPRLDL AKAMQALGEV DEADRYLGRE VFNRTDDDRA PLDPSDRTKE IFHRMDRPHR G
|
| |