Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0979 |
Symbol | |
ID | 4078141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1045222 |
End bp | 1047177 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638006282 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_612974 |
Protein GI | 99080820 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAGA CAAGCGGGAT AGGCGCGGGG GCCAGTTTGG CCATTGGCAC CGTCGCCACG GTGGTGGTCG TTGGAGGCGG GGTCTTTCTC GCACGGGGTG GGATCTTGGG CGAAGGGGCC AGATCCATGG TCGAGCAGCA GCTGGTGGCC CTGGGCCTTG CAGCGCCGCC GGCGCCCGAA GTGGTGCCCG TGAAGCCTGT GGTGACACAG CCGCAGACGG CCGATCCCGA GACGCGCGTG GTCGAGCCTG AGCCGACGCC CGAAGCCACG GCTGGCGAGA CGGTAGAAAC GCAACAGGAC GCACCAACCG CAGAGCCCGC TTTTGTACTG CAGGCGCCCA AGCTGGAGAT CGCCCGGTTT GAGCCAGACG GCTCCGGTAT CGTAGCGGCG TCCGCTCAGG CGGGGGTCGA GGTGCAGGTG CTTCTTGACG ATGAGGTTCT CGATACGCAA ACCGTGCCCG CTGCGGGGGA GTTCGTGTCC TTTGTGACCA TCGACCTCAG TGACAAGCCG CGGCTGCTGA CGCTGCTGGC GCGCCACAAC GGGCAGGAGC TGGCCTCGGA AGACAGCTTT ATCCTTGCGC CGATGCCCGC GCCCGCCGCG CCGGAACCGC AGGTCGATCA GCTTGCCGCG GCGCAGACCG ATTCTGATAT CGCTGCCCCC GAGGAGGAGC CAATTGAGCT CGCCGAGGCA ACCGAAACCG CCGATCCGAA TGTGGCAGAT CAGGCGACTG ATGCGCCAGA TCCAGACGCG CCAGGCGACG GCGCAGCGGA GGGGAGCACA ACGGTGACGG CGCAGTCCGA AGAGGTCGCA TTGGCTGATG TCGCAGTCGA TAGCACCGAT CCGGACGCGG AGGGCGATGC CTCCTCGACG GAGTCGGCTG CGAGTGGTGC CGCTGACAAT GGAGTGGCAA CTGATATGGC CGCCGTCGAA AACACCGGTG ATCAACTCCC CGATGCTGCC TCTGAGGCCG TATCTGAAGC GTCACCCGAA GCGGTAGACC CCTCTGTAGA TGTGGCCGAG GCCACCAGTG CTTTGCCGGA GACCGAGGTC ACAGCGGAAG ACGCGCCCGC AGCAGAGGCA CCGGAAGAGA CAGTCGAGAC CGCCGCCTTG GAACAGGCGA TCGACGACGA GTCCGCTCGC GAATCTTCTG AAGACTCGGT CCCGGCTCCT GAGCCTGAGG TGGCAGCTGT TGCAGACACA TCAGAGCCGC CCGCTCCGGA CACGACACCT GCACCGCAGT CCCCGGTCGA GGTTGCCGAG GCCGTTGACA CACCCGAGGT GCCATCTTCC AAGACGGACA CAATGGCTGC CGTAGAAGAG GTGCAGCAGC CGCAGCCGCA AGACCCCGAC ACAGAGAGCC CCTCTGGCGA GGCGGCTCCC GCGCCGCAGG CGACTTCCTC TGTCGCGGTG CTGCGCGCTG GTCGCGATGG GGTGACGCTG GTTCAACCTG CGGCCCCAGC CGCACCAGAG CTGGTGGGCA AGGTGGCGCT CGATACGATC AGCTACACCG AGACGGGCGA TGTTCAGCTT GCGGGACGGG CCAGGCCCGA GGCCCTGGTG CGTGTCTACC TCGACAACAG CCCTGTGGCC GAGCTTGCCG CCGCGTCCGA TGGTCAATGG AGCGGCAGCC TCACCTCGGT GGCGCCGGGG ATCTACACCC TGCGCCTTGA TGAGATCGAC CCTGTTGACG GTATCGTCCT GAGCCGCCTT GAGACCCCGT TCAAACGCGA GGCTCCAGAG GTCCTGCAGC CTGCGGTGAC GGCGGATCAG GCGCCAGATC AGGCTGCGCC TGTGGTGCGC GCCGTGACGG TGCAGGAAGG CGATACGCTC TGGGCGATTT CCCAGCAGCG CTATGGCAGC GGTTTTCTCT ATGTGCGGGT GTTTGAGGCC AACAAGGGCG ATATCCGCGA TCCAGACCTG ATCTACCCTG GTCAGATCTT CACTCTGCCC GAGTAA
|
Protein sequence | MTKTSGIGAG ASLAIGTVAT VVVVGGGVFL ARGGILGEGA RSMVEQQLVA LGLAAPPAPE VVPVKPVVTQ PQTADPETRV VEPEPTPEAT AGETVETQQD APTAEPAFVL QAPKLEIARF EPDGSGIVAA SAQAGVEVQV LLDDEVLDTQ TVPAAGEFVS FVTIDLSDKP RLLTLLARHN GQELASEDSF ILAPMPAPAA PEPQVDQLAA AQTDSDIAAP EEEPIELAEA TETADPNVAD QATDAPDPDA PGDGAAEGST TVTAQSEEVA LADVAVDSTD PDAEGDASST ESAASGAADN GVATDMAAVE NTGDQLPDAA SEAVSEASPE AVDPSVDVAE ATSALPETEV TAEDAPAAEA PEETVETAAL EQAIDDESAR ESSEDSVPAP EPEVAAVADT SEPPAPDTTP APQSPVEVAE AVDTPEVPSS KTDTMAAVEE VQQPQPQDPD TESPSGEAAP APQATSSVAV LRAGRDGVTL VQPAAPAAPE LVGKVALDTI SYTETGDVQL AGRARPEALV RVYLDNSPVA ELAAASDGQW SGSLTSVAPG IYTLRLDEID PVDGIVLSRL ETPFKREAPE VLQPAVTADQ APDQAAPVVR AVTVQEGDTL WAISQQRYGS GFLYVRVFEA NKGDIRDPDL IYPGQIFTLP E
|
| |