Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2229 |
Symbol | |
ID | 4598727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2374124 |
End bp | 2377384 |
Gene Length | 3261 bp |
Protein Length | 1086 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639776829 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_923422 |
Protein GI | 119716457 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACGC CCACCCATCC CACCCTGGGC CAGCGGCTCA CCGGCCTCGC AGCCTCGATC GCCGTACTCG GCATCGTCCT CGGGCTGCCC GCACTATTCC TCGCGATCGG TGCCAGCCCA ATCCCGGACC ACGTCCCGAC CCTGGACGGC ATCAAGAACG CGCTCATGGC GCCCGACGAC GGCACCCTCG TCCTCGGCCT GTTCAAGGTG ATCGGCTGGG TCGCGTGGGC CTTCATGGCG CTGAGCCTGG TCGTGGAGGC CATCGCACGG CTCCGCAAGG TCCAGGCGCC CCAGCTGCCG GGCCTGGGCC GGCCGCAGGC CGCAGCACGC GGCCTCATCG GCCTCGCCGC TCTGCTGTTC ATCGCCGCGC CCATCGCCGC CCAGGCAGCC ACGATCACCA CGGCCGCCGC GGCGCCGGTG ACGGTGGGAC CTGTCAACGC CGGTGCCGCC GACCAGGCAC CCGCCCAGCA GGACGTCAAG GTCGAGACGA AGCAGGAGCG CCAGCGCAAG ACCGTCGACC ACGCTGTGAA GCCGGGGGAG AGCCTGTGGT CGATCGCCGA GAACCACTTC GGCGACGGTG CCCGCTACAA GGAGCTCGTC GAACTCAACC GCGACCTCCT CGGCGCGCAG CCGAGCTTCC TCGAGCCCGG TTGGGTTCTC AAGCTGCCCG CGGTCAACGG CGGGGCACCG GCGCACGACT ACACGGTGCA GCCCACCGAC ACCCTCAGCG AGATCGCCCA AGAGCAGCTC GGCGACGCCG ACCGCTGGCC CGAGATCTAC CGGGCCTCCA GCGGCATCAC CCAGCCCGGC GGGGCCCGGC TGACCGACCC CGACGTCATC GAGGTCGGCT GGAAGCTCAA CATCCCCGGC GCCGAGCAGG GAGGCAGTCA CGACGACACC CGGCACCAGC AGCCGCGCGA CGTCGAGCCC GAGACGCCCG CCAGCCCCGA GGACCAGCGG CCCGTCGACC CGCCGGCCGA GGAGGAGCCG CCGGTCGCCC ACGAGCCTGA GGTGCCCGAG GTACCCGAGA CGGCCGTGCC CGACGCGCCG CCCCAAGCCG AGGCGCCGGC GTCGCCGGCC GCCGACGTCG ACCAGGTCGA CGACGCTGAC GACTCGATCC TCGACGCGCC GTGGGTCCTC GCCGGTCTGA CCGGCGGGGG CGTCCTGCTG TCCGGGGCGC TGCTCATGGC CCTGCGGTCC CGCCGCCGCG CCGGCTACCG GAACCGGACG CCCGGCCGCG CGATCGCCGC GCCGCCTTCG GAGCTCGCGC CGGTCGAGAT GACCCTGAAC GCCACCGGTG CCGCGGCCGC GGCGACCGTG GAGTTCGCGG ACGAGGCGCT GCGCCGCCTC GCGGCCGCCG TCGGCGCGCA GGGCACCACG ATGCCGCCGC TCGCTGCCGT GGAGCTCGGG CACGGCAAGT TGACCCTGCA CCTGAGCGCG CCGGCCGCAG TCCCGGCGCC CTGGGTAGGC AGCACGGACC AGACCCACTG GCAGGTCAGC ACGGACACCG CCGTCGACGA GCTCGGCCCC GACACCGGCA ACGTCGAGCC GCCCTACCCG CTACTGGCCA CAATCGGCAT GAGCGACACA GGGGAGACGT GGCTGCTCAA TTGCGAGGAG CTGTCCACAC TCACCATTAG CGGCGACCCG ACCTACGGCC GGGACTTCGC CCGCCACCTC GCCGCGCAGC TCGCCGTCAA CCCGTGGTCG CGCCGCGTCC AGGTCGACTG CATCGGCGTG GCTGAGGAAG CCGTCGCCAT GGACGAGCGG ATCAGCTACT ACCCGACCGG ATCAGCCGGG TCACCCGCGA CCGCGGAGGC CCTCGCGGCC GCCGTCACCA CTGTCGACCG GGCGAAGCGA CATGACACCG ACGCGTCCAC GGCGCGCACC GGCAGCGTCG ACGACGACAC CTGGCCCGCC CGGATGCTGT TGCTCGACGC GGCGGCCGGA GACCCCGAGG ACCTTGAGCA ACTGCTGCAG CTGGTCACCA ACCACGTCGG GCAGTCCGCC ACCTCCATCG TCGTCGCCGG CGAGCGTCCC AACACACCGG GCGCCGTCCT GCACATGACG AACACCGGCC GCGTCGTCCT CGAGCACGCC GGCCTCAACC TCATCGCCGT CGGCCTCACC AGCGACGAAG CCCGCGGCTG CGCCCTGGTC TACGCACAGA GCGAGGTCGC CGAGGACGTC CCCGTGCCGG TCGACGAGAC CGTCACCGAC GGCTGGGAGG CCTACACCGA CCAGTCCGGG GCACTGCGCC GCGAGTACAC CCTGCCGCGC AACACCCCCG CGGACACCGT CGACGAGCCG CTGTCCTCCC TCCTGGAGGG CGACGACGAG GACTACATCC GTCAGAGCGC GATCGTGCAG GAGGACCTCG AGACGCTGGC GCCGAAGGTG CCCGAGCACG TCCGAGCCGA GGTCGAGCAG AGCGACCCCA CTCTCGACCA GGACATCGCC GACTGGTTCT CGACCAACAG CGACCGTCCA CGGCTCAGCC TGCTCGGTCC GGTCACCGCG CGCACCCACG GCAAGGCCCT GGCCAAGCGC AAGCCCTACT TCACCGAGCT CCTCGCGTAC TTGGCCCTGC ACCGCAAGCA CGGAGCCACC CGCGAGGAGA TCGGCGAGGC ATTCGGGATC ACCCCCGGCA AGGTGCGCGA CTACACCAAC ACCGTCCGCG AATGGCTGGG CACCAACCCC ACAACTGGAG AGCCCCACCT GCCCCACGCC GACAAGGCCC CTGCCACCAA GCTCCGCGGC GTCAACGTCT ACCAGGTCGA CGACGGCCTC CTGGTCGACC TGCACCTCTT CCTCCGACTG CGCAAGCGCG GGCAGGCCCG GGGCGGCGCG GAAGGCGTCG CCGACCTCTG CACCGCGCTC GAGCTCGTGG GCGAGGCGAA GCCGTTCAGC CAGCTGCGTG AAGAGGGATG GTCTTGGCTC GTCAACGAGC CCGACCGCGT CGACCTCATG GCTTCTGGCT GGATCGCCGA CGTCGCCCTC ATCGTCGTCA CCGAGGCTCT CGCCGCAGGC GACCTGGTCA AGGCCCGCTC CGTCGCCTAC GTCGCCAACC GGGCCGACCC TGACGGCGAG AGCACCCGCC TGTGCCTGGC CCACGTCATG AAGGCCGAGG GCGACCAGCT CGAGGCCGAC CGGATCCTCC GCGAGGAGAT CTGCAACCGG TCCGATGACG GCGACGCCCC CATGGAACTG TCGGAGCGCA CTAAGACCAT CATCAGTACC CACGGCTGGC TCGCGAGCTG A
|
Protein sequence | MTTPTHPTLG QRLTGLAASI AVLGIVLGLP ALFLAIGASP IPDHVPTLDG IKNALMAPDD GTLVLGLFKV IGWVAWAFMA LSLVVEAIAR LRKVQAPQLP GLGRPQAAAR GLIGLAALLF IAAPIAAQAA TITTAAAAPV TVGPVNAGAA DQAPAQQDVK VETKQERQRK TVDHAVKPGE SLWSIAENHF GDGARYKELV ELNRDLLGAQ PSFLEPGWVL KLPAVNGGAP AHDYTVQPTD TLSEIAQEQL GDADRWPEIY RASSGITQPG GARLTDPDVI EVGWKLNIPG AEQGGSHDDT RHQQPRDVEP ETPASPEDQR PVDPPAEEEP PVAHEPEVPE VPETAVPDAP PQAEAPASPA ADVDQVDDAD DSILDAPWVL AGLTGGGVLL SGALLMALRS RRRAGYRNRT PGRAIAAPPS ELAPVEMTLN ATGAAAAATV EFADEALRRL AAAVGAQGTT MPPLAAVELG HGKLTLHLSA PAAVPAPWVG STDQTHWQVS TDTAVDELGP DTGNVEPPYP LLATIGMSDT GETWLLNCEE LSTLTISGDP TYGRDFARHL AAQLAVNPWS RRVQVDCIGV AEEAVAMDER ISYYPTGSAG SPATAEALAA AVTTVDRAKR HDTDASTART GSVDDDTWPA RMLLLDAAAG DPEDLEQLLQ LVTNHVGQSA TSIVVAGERP NTPGAVLHMT NTGRVVLEHA GLNLIAVGLT SDEARGCALV YAQSEVAEDV PVPVDETVTD GWEAYTDQSG ALRREYTLPR NTPADTVDEP LSSLLEGDDE DYIRQSAIVQ EDLETLAPKV PEHVRAEVEQ SDPTLDQDIA DWFSTNSDRP RLSLLGPVTA RTHGKALAKR KPYFTELLAY LALHRKHGAT REEIGEAFGI TPGKVRDYTN TVREWLGTNP TTGEPHLPHA DKAPATKLRG VNVYQVDDGL LVDLHLFLRL RKRGQARGGA EGVADLCTAL ELVGEAKPFS QLREEGWSWL VNEPDRVDLM ASGWIADVAL IVVTEALAAG DLVKARSVAY VANRADPDGE STRLCLAHVM KAEGDQLEAD RILREEICNR SDDGDAPMEL SERTKTIIST HGWLAS
|
| |