Gene Noca_4719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4719 
Symbol 
ID4595354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp18154 
End bp21393 
Gene Length3240 bp 
Protein Length1079 aa 
Translation table11 
GC content71% 
IMG OID639772508 
Productpeptidoglycan-binding LysM 
Protein accessionYP_919168 
Protein GI119714026 
COG category[S] Function unknown 
COG ID[COG1652] Uncharacterized protein containing LysM domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGGAT TGGTCAAGCG ACTCGGAGCC CTCGCACTGC TGTTGGGCCT CCTCGTCGGG 
TTCCCTGCCC TGCTGCTCGC GGCTGTGGGC AACCCGTGGC CTGACGGCGG GTTGAACGAG
CTGTCGCTCA TGAGCAACTC GGCCGTGCTC GGGATCGTCT CGCTGTTGGG CTGGTTCGTG
TGGGGGCAGC TGCTGGTGTG CACCCTGTGG GAGATCCCGC CGGCCCTGCG TCACGAGACC
GAGGGGGCCT CGAGGCTGCC GATCGCGGTC GGTGGTCAGC AGAGGTTCAT CCGGATACTG
GTCCACACAG TGCTCGCGGT CGGAGTCACC TCGACGGCAC TGCTCGGGTC GCACGCAGCC
ACCGCGGAGG CTGCACCGGC CGCACCGCTG CGGCCTGTCA CCCATGTCGC ACACCAGACT
CCGGCCGCGA ACCCGGAGAC AAGTCCCGCC GTATCGCCGG CACAGGCCGT CCCGGACACC
ACCCATCGGG CCGACCGCCC GAGGATCGTC ACCGACAAGG GCGACACCCT GTGGGGCCTG
GCCGAGAAGC ACCTAGGAGA CGGGTTCCGT TGGCAGGAAA TCGCGGACCT GAACCATGGC
CGGGTCATGG TCGACGGACG CACCTTCAAC AACCCGCGCA GCATCGAGCC TGGCTGGGAA
CTGCTGCTGC CCGCCGGTGC GACCGGCCTG CCCGGCGACC AGACCTCCGC GACCGAGCAC
GTCGTCGCGC CGGGTGAAAC GCTGACCGGG ATCACCAAGG AGACCACTGG CGACCCCGAC
AACTGGCAGG CGCTCTACGA GGCAAACAAG GACGTCATCG GCAGCGACCC CGACCTGATC
TACCCCGGCC AGGTTCTCGT ACTGCCGGGT ACTGGTGCCG TCGACCCGGA CACGGGCACC
GACCGGCCGC ACCAGCGAGG GCCCGGTCAG CGCGACGAGC CCGGCACGTC CACCGGCGGC
CAGGACGAGC AGGCGGACGA CACGGGGGAC GACCGCTCCA CGGGAGCCGA CGACGCGGAG
GGACAGGAGC AGCAGCAGAC GCCCGCCGAG CCGGCCGCCC CGACCGCACC GTCGCAGCGA
TCGGCGCAGG GGGCCCCGGA CAGCACGGAG CAGCAGGCCT CGACCGAAGC CGACGAGGCC
AGCGACGAGG GCGGGATCAC GGGCCTGCGG GCCCTGCTGG CCACGGCCCT GTGTCTCTCG
GCGGGAGCGC TGGGACTGGT GATCGCCAAC CGTCGTCGGC AGTTCCGTCA GCGCCCGATC
GGTCGGACGA TCGCCTCGAC TCCCGACGAG CTGCTCGAGG TGGAACAGGC GATCCTCGAG
CACGGCTCAG AGGCACAGAA GGACGCGGAG TTCCTCGACC GGGCACTGCG GCACGTCGCG
GCGTCGTGCA AGGTGGCCGG CGACCGGCTT CCCCAGCTGG GTGCCGCCGT GCTCGGCGAT
GAGGACCTGA CTCTCCTGTT CACCCAGCCC GCCCCCGGGC AGGTGCCGGA GGGCTGGACC
GCGACCGATG ACGCCCAGGC CTGGATGCTG CCGCGCTGGA CCTTCCTCGA GGAGGACCTC
GAGAACCAGC CGGCCCCGTA CCCGGCGCTG GTGAGCATCG GCCTCGACGA GAGCGGCCGC
ACCTGGTACC TGGACCTGGA GACCCTGGGC GTGTGCGGCA TCGGAGGTGA GCCGGAGCAG
GTCGCCGACA TGGCCCGCTT CATGGTGGCC GAGCTCGCCG TCAACGACTG GGCCAATGGC
TGCGAGGTTC TGCTGGCCGA CCAGTTCGCC GCCGAAGCGA TCCGACTCAA CCCGGCGCGG
CTCCGGCAGG TCAATCGCAG CGACGCGCTG GCCCGGGCTG CCGTGCTGAC CGGCGAGATG
GGTGAGGGCG AGCACAACCT CGACGCTGAC GTGCTGACGC GACGCCGCGA CGGGCTGTTG
CTGGACACGA CCAGCCCGGT GGTGGTTGTG CTTGCATCCC GTCCCGAGGG CGAGTTCGTC
AGTGACATCG AGGACCGGGA CCGCTCCCGG GTGGTGGTGG TGCACGGGGA TGAGGAGTCG
CCGGCGGTCG AGCTGTCCGG CGATGGGATG GCGTTCCTGC CGGTGTGGGG GATCAGTGTC
AAGGCACTGA CCTTGAAGCC GGAGCACGTG GCGCCGATGG CGGACCTGTT GGCGGCCACC
CGCAGCCTCG AGGACGAGCC GGTGCCGGCC ATGCAGTCCG ACGATGGTCC GCTGGGCAAG
TACGCCCGTG CTGACGGGTC GCTGCGTGAG GAGTACACAG AGCCGCGGCA CACTGAGGGC
AACGACCGCT CCTCGATGCT GCCCGATGCC GACGAGGTCT ACCTGGCCAC CGCGGCAACG
ACCGCCGAAG ACCTCGCCGC GGCCGCGCCG AGTGTTCCCG TGGAGGTCCG CGCCGAGATC
GCGGCGCTCG ACCCGACGCT GGACCAAGAC GTCGCTGACT GGTTCGACGA GTCCTCGCCG
CGTCCCAAGG TGAACCTGCT CGGTCCGGTC TCGGTCACCG CACTCAACGG CGGCGACCCC
GCGGCGATCG ACAACGCTGG CGGAACTGTC TCCTTCATCG CCTACCTGGC CTGGCAGGAC
CGCGGAGTCA CCGGGGAGCG TGCCGCCGCA GCATGCGGGT GGCAGACCCA GAAGACGGTT
CAGAATCGCG CCACCAACGC CCGCTTCCTG CTGGGAACCC GGCCCGACGG CAGCGACTGG
CTCCCCGACG CGAGCATGAG CGCCAGTGCC CGCCGAGGCC ACAGCCCGAC CTACGAACTG
GTGAGGGACA GCGGCGGGGT GCTGAGCGAC GCCGACCTGT TCATCCGGCT GAAGCACCGG
GCACAGCGGC GAGGCGACAA CGGGTGCGAG GAGGACTTGG TGACCGCGCT GTCGCTGGTC
ACCAGCACGC CGTTCGAGGG GATCACCGAG AGCCGGTTCA AGTGGGTCTT TAAGGAGGGG
CTGCAGCGGC CCGACGTCAT CTTGGCTGGC GCGATCCATG ACGCGGCCCA CACTCTGGCG
ACGCGGGCCG TCACCGAGGG CCGCACCGAT CTGGTCCGGC TGGCCTGTGA CGCGGCCCGG
AGGGCAAACC CCCACGGCGA CATCGCATGG CTGGATCTGG CCGCGGCCAC CGAGGCGGAA
TCCGGTCGCG AGGCCGCCGG TCAACTGGTC CGCGAGCAGG TCCTCGAGCA GTTCGACGAG
GACCTCCCGC CGCGCTCGGA GGCCATCGTG GAGCAGCGCG AGTGGGGTGC CACCGGCTAG
 
Protein sequence
MLGLVKRLGA LALLLGLLVG FPALLLAAVG NPWPDGGLNE LSLMSNSAVL GIVSLLGWFV 
WGQLLVCTLW EIPPALRHET EGASRLPIAV GGQQRFIRIL VHTVLAVGVT STALLGSHAA
TAEAAPAAPL RPVTHVAHQT PAANPETSPA VSPAQAVPDT THRADRPRIV TDKGDTLWGL
AEKHLGDGFR WQEIADLNHG RVMVDGRTFN NPRSIEPGWE LLLPAGATGL PGDQTSATEH
VVAPGETLTG ITKETTGDPD NWQALYEANK DVIGSDPDLI YPGQVLVLPG TGAVDPDTGT
DRPHQRGPGQ RDEPGTSTGG QDEQADDTGD DRSTGADDAE GQEQQQTPAE PAAPTAPSQR
SAQGAPDSTE QQASTEADEA SDEGGITGLR ALLATALCLS AGALGLVIAN RRRQFRQRPI
GRTIASTPDE LLEVEQAILE HGSEAQKDAE FLDRALRHVA ASCKVAGDRL PQLGAAVLGD
EDLTLLFTQP APGQVPEGWT ATDDAQAWML PRWTFLEEDL ENQPAPYPAL VSIGLDESGR
TWYLDLETLG VCGIGGEPEQ VADMARFMVA ELAVNDWANG CEVLLADQFA AEAIRLNPAR
LRQVNRSDAL ARAAVLTGEM GEGEHNLDAD VLTRRRDGLL LDTTSPVVVV LASRPEGEFV
SDIEDRDRSR VVVVHGDEES PAVELSGDGM AFLPVWGISV KALTLKPEHV APMADLLAAT
RSLEDEPVPA MQSDDGPLGK YARADGSLRE EYTEPRHTEG NDRSSMLPDA DEVYLATAAT
TAEDLAAAAP SVPVEVRAEI AALDPTLDQD VADWFDESSP RPKVNLLGPV SVTALNGGDP
AAIDNAGGTV SFIAYLAWQD RGVTGERAAA ACGWQTQKTV QNRATNARFL LGTRPDGSDW
LPDASMSASA RRGHSPTYEL VRDSGGVLSD ADLFIRLKHR AQRRGDNGCE EDLVTALSLV
TSTPFEGITE SRFKWVFKEG LQRPDVILAG AIHDAAHTLA TRAVTEGRTD LVRLACDAAR
RANPHGDIAW LDLAAATEAE SGREAAGQLV REQVLEQFDE DLPPRSEAIV EQREWGATG