Gene TM1040_0979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0979 
Symbol 
ID4078141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1045222 
End bp1047177 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content66% 
IMG OID638006282 
Productpeptidoglycan-binding LysM 
Protein accessionYP_612974 
Protein GI99080820 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAGA CAAGCGGGAT AGGCGCGGGG GCCAGTTTGG CCATTGGCAC CGTCGCCACG 
GTGGTGGTCG TTGGAGGCGG GGTCTTTCTC GCACGGGGTG GGATCTTGGG CGAAGGGGCC
AGATCCATGG TCGAGCAGCA GCTGGTGGCC CTGGGCCTTG CAGCGCCGCC GGCGCCCGAA
GTGGTGCCCG TGAAGCCTGT GGTGACACAG CCGCAGACGG CCGATCCCGA GACGCGCGTG
GTCGAGCCTG AGCCGACGCC CGAAGCCACG GCTGGCGAGA CGGTAGAAAC GCAACAGGAC
GCACCAACCG CAGAGCCCGC TTTTGTACTG CAGGCGCCCA AGCTGGAGAT CGCCCGGTTT
GAGCCAGACG GCTCCGGTAT CGTAGCGGCG TCCGCTCAGG CGGGGGTCGA GGTGCAGGTG
CTTCTTGACG ATGAGGTTCT CGATACGCAA ACCGTGCCCG CTGCGGGGGA GTTCGTGTCC
TTTGTGACCA TCGACCTCAG TGACAAGCCG CGGCTGCTGA CGCTGCTGGC GCGCCACAAC
GGGCAGGAGC TGGCCTCGGA AGACAGCTTT ATCCTTGCGC CGATGCCCGC GCCCGCCGCG
CCGGAACCGC AGGTCGATCA GCTTGCCGCG GCGCAGACCG ATTCTGATAT CGCTGCCCCC
GAGGAGGAGC CAATTGAGCT CGCCGAGGCA ACCGAAACCG CCGATCCGAA TGTGGCAGAT
CAGGCGACTG ATGCGCCAGA TCCAGACGCG CCAGGCGACG GCGCAGCGGA GGGGAGCACA
ACGGTGACGG CGCAGTCCGA AGAGGTCGCA TTGGCTGATG TCGCAGTCGA TAGCACCGAT
CCGGACGCGG AGGGCGATGC CTCCTCGACG GAGTCGGCTG CGAGTGGTGC CGCTGACAAT
GGAGTGGCAA CTGATATGGC CGCCGTCGAA AACACCGGTG ATCAACTCCC CGATGCTGCC
TCTGAGGCCG TATCTGAAGC GTCACCCGAA GCGGTAGACC CCTCTGTAGA TGTGGCCGAG
GCCACCAGTG CTTTGCCGGA GACCGAGGTC ACAGCGGAAG ACGCGCCCGC AGCAGAGGCA
CCGGAAGAGA CAGTCGAGAC CGCCGCCTTG GAACAGGCGA TCGACGACGA GTCCGCTCGC
GAATCTTCTG AAGACTCGGT CCCGGCTCCT GAGCCTGAGG TGGCAGCTGT TGCAGACACA
TCAGAGCCGC CCGCTCCGGA CACGACACCT GCACCGCAGT CCCCGGTCGA GGTTGCCGAG
GCCGTTGACA CACCCGAGGT GCCATCTTCC AAGACGGACA CAATGGCTGC CGTAGAAGAG
GTGCAGCAGC CGCAGCCGCA AGACCCCGAC ACAGAGAGCC CCTCTGGCGA GGCGGCTCCC
GCGCCGCAGG CGACTTCCTC TGTCGCGGTG CTGCGCGCTG GTCGCGATGG GGTGACGCTG
GTTCAACCTG CGGCCCCAGC CGCACCAGAG CTGGTGGGCA AGGTGGCGCT CGATACGATC
AGCTACACCG AGACGGGCGA TGTTCAGCTT GCGGGACGGG CCAGGCCCGA GGCCCTGGTG
CGTGTCTACC TCGACAACAG CCCTGTGGCC GAGCTTGCCG CCGCGTCCGA TGGTCAATGG
AGCGGCAGCC TCACCTCGGT GGCGCCGGGG ATCTACACCC TGCGCCTTGA TGAGATCGAC
CCTGTTGACG GTATCGTCCT GAGCCGCCTT GAGACCCCGT TCAAACGCGA GGCTCCAGAG
GTCCTGCAGC CTGCGGTGAC GGCGGATCAG GCGCCAGATC AGGCTGCGCC TGTGGTGCGC
GCCGTGACGG TGCAGGAAGG CGATACGCTC TGGGCGATTT CCCAGCAGCG CTATGGCAGC
GGTTTTCTCT ATGTGCGGGT GTTTGAGGCC AACAAGGGCG ATATCCGCGA TCCAGACCTG
ATCTACCCTG GTCAGATCTT CACTCTGCCC GAGTAA
 
Protein sequence
MTKTSGIGAG ASLAIGTVAT VVVVGGGVFL ARGGILGEGA RSMVEQQLVA LGLAAPPAPE 
VVPVKPVVTQ PQTADPETRV VEPEPTPEAT AGETVETQQD APTAEPAFVL QAPKLEIARF
EPDGSGIVAA SAQAGVEVQV LLDDEVLDTQ TVPAAGEFVS FVTIDLSDKP RLLTLLARHN
GQELASEDSF ILAPMPAPAA PEPQVDQLAA AQTDSDIAAP EEEPIELAEA TETADPNVAD
QATDAPDPDA PGDGAAEGST TVTAQSEEVA LADVAVDSTD PDAEGDASST ESAASGAADN
GVATDMAAVE NTGDQLPDAA SEAVSEASPE AVDPSVDVAE ATSALPETEV TAEDAPAAEA
PEETVETAAL EQAIDDESAR ESSEDSVPAP EPEVAAVADT SEPPAPDTTP APQSPVEVAE
AVDTPEVPSS KTDTMAAVEE VQQPQPQDPD TESPSGEAAP APQATSSVAV LRAGRDGVTL
VQPAAPAAPE LVGKVALDTI SYTETGDVQL AGRARPEALV RVYLDNSPVA ELAAASDGQW
SGSLTSVAPG IYTLRLDEID PVDGIVLSRL ETPFKREAPE VLQPAVTADQ APDQAAPVVR
AVTVQEGDTL WAISQQRYGS GFLYVRVFEA NKGDIRDPDL IYPGQIFTLP E