Gene Noca_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2229 
Symbol 
ID4598727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2374124 
End bp2377384 
Gene Length3261 bp 
Protein Length1086 aa 
Translation table11 
GC content72% 
IMG OID639776829 
Productpeptidoglycan-binding LysM 
Protein accessionYP_923422 
Protein GI119716457 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGC CCACCCATCC CACCCTGGGC CAGCGGCTCA CCGGCCTCGC AGCCTCGATC 
GCCGTACTCG GCATCGTCCT CGGGCTGCCC GCACTATTCC TCGCGATCGG TGCCAGCCCA
ATCCCGGACC ACGTCCCGAC CCTGGACGGC ATCAAGAACG CGCTCATGGC GCCCGACGAC
GGCACCCTCG TCCTCGGCCT GTTCAAGGTG ATCGGCTGGG TCGCGTGGGC CTTCATGGCG
CTGAGCCTGG TCGTGGAGGC CATCGCACGG CTCCGCAAGG TCCAGGCGCC CCAGCTGCCG
GGCCTGGGCC GGCCGCAGGC CGCAGCACGC GGCCTCATCG GCCTCGCCGC TCTGCTGTTC
ATCGCCGCGC CCATCGCCGC CCAGGCAGCC ACGATCACCA CGGCCGCCGC GGCGCCGGTG
ACGGTGGGAC CTGTCAACGC CGGTGCCGCC GACCAGGCAC CCGCCCAGCA GGACGTCAAG
GTCGAGACGA AGCAGGAGCG CCAGCGCAAG ACCGTCGACC ACGCTGTGAA GCCGGGGGAG
AGCCTGTGGT CGATCGCCGA GAACCACTTC GGCGACGGTG CCCGCTACAA GGAGCTCGTC
GAACTCAACC GCGACCTCCT CGGCGCGCAG CCGAGCTTCC TCGAGCCCGG TTGGGTTCTC
AAGCTGCCCG CGGTCAACGG CGGGGCACCG GCGCACGACT ACACGGTGCA GCCCACCGAC
ACCCTCAGCG AGATCGCCCA AGAGCAGCTC GGCGACGCCG ACCGCTGGCC CGAGATCTAC
CGGGCCTCCA GCGGCATCAC CCAGCCCGGC GGGGCCCGGC TGACCGACCC CGACGTCATC
GAGGTCGGCT GGAAGCTCAA CATCCCCGGC GCCGAGCAGG GAGGCAGTCA CGACGACACC
CGGCACCAGC AGCCGCGCGA CGTCGAGCCC GAGACGCCCG CCAGCCCCGA GGACCAGCGG
CCCGTCGACC CGCCGGCCGA GGAGGAGCCG CCGGTCGCCC ACGAGCCTGA GGTGCCCGAG
GTACCCGAGA CGGCCGTGCC CGACGCGCCG CCCCAAGCCG AGGCGCCGGC GTCGCCGGCC
GCCGACGTCG ACCAGGTCGA CGACGCTGAC GACTCGATCC TCGACGCGCC GTGGGTCCTC
GCCGGTCTGA CCGGCGGGGG CGTCCTGCTG TCCGGGGCGC TGCTCATGGC CCTGCGGTCC
CGCCGCCGCG CCGGCTACCG GAACCGGACG CCCGGCCGCG CGATCGCCGC GCCGCCTTCG
GAGCTCGCGC CGGTCGAGAT GACCCTGAAC GCCACCGGTG CCGCGGCCGC GGCGACCGTG
GAGTTCGCGG ACGAGGCGCT GCGCCGCCTC GCGGCCGCCG TCGGCGCGCA GGGCACCACG
ATGCCGCCGC TCGCTGCCGT GGAGCTCGGG CACGGCAAGT TGACCCTGCA CCTGAGCGCG
CCGGCCGCAG TCCCGGCGCC CTGGGTAGGC AGCACGGACC AGACCCACTG GCAGGTCAGC
ACGGACACCG CCGTCGACGA GCTCGGCCCC GACACCGGCA ACGTCGAGCC GCCCTACCCG
CTACTGGCCA CAATCGGCAT GAGCGACACA GGGGAGACGT GGCTGCTCAA TTGCGAGGAG
CTGTCCACAC TCACCATTAG CGGCGACCCG ACCTACGGCC GGGACTTCGC CCGCCACCTC
GCCGCGCAGC TCGCCGTCAA CCCGTGGTCG CGCCGCGTCC AGGTCGACTG CATCGGCGTG
GCTGAGGAAG CCGTCGCCAT GGACGAGCGG ATCAGCTACT ACCCGACCGG ATCAGCCGGG
TCACCCGCGA CCGCGGAGGC CCTCGCGGCC GCCGTCACCA CTGTCGACCG GGCGAAGCGA
CATGACACCG ACGCGTCCAC GGCGCGCACC GGCAGCGTCG ACGACGACAC CTGGCCCGCC
CGGATGCTGT TGCTCGACGC GGCGGCCGGA GACCCCGAGG ACCTTGAGCA ACTGCTGCAG
CTGGTCACCA ACCACGTCGG GCAGTCCGCC ACCTCCATCG TCGTCGCCGG CGAGCGTCCC
AACACACCGG GCGCCGTCCT GCACATGACG AACACCGGCC GCGTCGTCCT CGAGCACGCC
GGCCTCAACC TCATCGCCGT CGGCCTCACC AGCGACGAAG CCCGCGGCTG CGCCCTGGTC
TACGCACAGA GCGAGGTCGC CGAGGACGTC CCCGTGCCGG TCGACGAGAC CGTCACCGAC
GGCTGGGAGG CCTACACCGA CCAGTCCGGG GCACTGCGCC GCGAGTACAC CCTGCCGCGC
AACACCCCCG CGGACACCGT CGACGAGCCG CTGTCCTCCC TCCTGGAGGG CGACGACGAG
GACTACATCC GTCAGAGCGC GATCGTGCAG GAGGACCTCG AGACGCTGGC GCCGAAGGTG
CCCGAGCACG TCCGAGCCGA GGTCGAGCAG AGCGACCCCA CTCTCGACCA GGACATCGCC
GACTGGTTCT CGACCAACAG CGACCGTCCA CGGCTCAGCC TGCTCGGTCC GGTCACCGCG
CGCACCCACG GCAAGGCCCT GGCCAAGCGC AAGCCCTACT TCACCGAGCT CCTCGCGTAC
TTGGCCCTGC ACCGCAAGCA CGGAGCCACC CGCGAGGAGA TCGGCGAGGC ATTCGGGATC
ACCCCCGGCA AGGTGCGCGA CTACACCAAC ACCGTCCGCG AATGGCTGGG CACCAACCCC
ACAACTGGAG AGCCCCACCT GCCCCACGCC GACAAGGCCC CTGCCACCAA GCTCCGCGGC
GTCAACGTCT ACCAGGTCGA CGACGGCCTC CTGGTCGACC TGCACCTCTT CCTCCGACTG
CGCAAGCGCG GGCAGGCCCG GGGCGGCGCG GAAGGCGTCG CCGACCTCTG CACCGCGCTC
GAGCTCGTGG GCGAGGCGAA GCCGTTCAGC CAGCTGCGTG AAGAGGGATG GTCTTGGCTC
GTCAACGAGC CCGACCGCGT CGACCTCATG GCTTCTGGCT GGATCGCCGA CGTCGCCCTC
ATCGTCGTCA CCGAGGCTCT CGCCGCAGGC GACCTGGTCA AGGCCCGCTC CGTCGCCTAC
GTCGCCAACC GGGCCGACCC TGACGGCGAG AGCACCCGCC TGTGCCTGGC CCACGTCATG
AAGGCCGAGG GCGACCAGCT CGAGGCCGAC CGGATCCTCC GCGAGGAGAT CTGCAACCGG
TCCGATGACG GCGACGCCCC CATGGAACTG TCGGAGCGCA CTAAGACCAT CATCAGTACC
CACGGCTGGC TCGCGAGCTG A
 
Protein sequence
MTTPTHPTLG QRLTGLAASI AVLGIVLGLP ALFLAIGASP IPDHVPTLDG IKNALMAPDD 
GTLVLGLFKV IGWVAWAFMA LSLVVEAIAR LRKVQAPQLP GLGRPQAAAR GLIGLAALLF
IAAPIAAQAA TITTAAAAPV TVGPVNAGAA DQAPAQQDVK VETKQERQRK TVDHAVKPGE
SLWSIAENHF GDGARYKELV ELNRDLLGAQ PSFLEPGWVL KLPAVNGGAP AHDYTVQPTD
TLSEIAQEQL GDADRWPEIY RASSGITQPG GARLTDPDVI EVGWKLNIPG AEQGGSHDDT
RHQQPRDVEP ETPASPEDQR PVDPPAEEEP PVAHEPEVPE VPETAVPDAP PQAEAPASPA
ADVDQVDDAD DSILDAPWVL AGLTGGGVLL SGALLMALRS RRRAGYRNRT PGRAIAAPPS
ELAPVEMTLN ATGAAAAATV EFADEALRRL AAAVGAQGTT MPPLAAVELG HGKLTLHLSA
PAAVPAPWVG STDQTHWQVS TDTAVDELGP DTGNVEPPYP LLATIGMSDT GETWLLNCEE
LSTLTISGDP TYGRDFARHL AAQLAVNPWS RRVQVDCIGV AEEAVAMDER ISYYPTGSAG
SPATAEALAA AVTTVDRAKR HDTDASTART GSVDDDTWPA RMLLLDAAAG DPEDLEQLLQ
LVTNHVGQSA TSIVVAGERP NTPGAVLHMT NTGRVVLEHA GLNLIAVGLT SDEARGCALV
YAQSEVAEDV PVPVDETVTD GWEAYTDQSG ALRREYTLPR NTPADTVDEP LSSLLEGDDE
DYIRQSAIVQ EDLETLAPKV PEHVRAEVEQ SDPTLDQDIA DWFSTNSDRP RLSLLGPVTA
RTHGKALAKR KPYFTELLAY LALHRKHGAT REEIGEAFGI TPGKVRDYTN TVREWLGTNP
TTGEPHLPHA DKAPATKLRG VNVYQVDDGL LVDLHLFLRL RKRGQARGGA EGVADLCTAL
ELVGEAKPFS QLREEGWSWL VNEPDRVDLM ASGWIADVAL IVVTEALAAG DLVKARSVAY
VANRADPDGE STRLCLAHVM KAEGDQLEAD RILREEICNR SDDGDAPMEL SERTKTIIST
HGWLAS