Gene TM1040_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0149 
Symbol 
ID4078816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp163129 
End bp165096 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content63% 
IMG OID638005443 
Productlytic transglycosylase, catalytic 
Protein accessionYP_612144 
Protein GI99079990 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0831767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGCA GACTCGCCTG CTCATTGCTG ATTTGCGCGA CCCTTTTTGG CGCGCCCGCC 
CTAGCGCGCG GGCTGGACAC CGCCCTGCCC CTGATGCGCG ACGGACAATG GCAGGAGGCG
CTGATCGATG CAGGAAGCGA AGGCTCGATC CAGCGTGACA TCATCGAGTG GCATCGCCTG
CGCGCGGGTG AAGGCACGGC ACGCGATGTA TTGTCCTTCA TCGACCGCCG TCCCGACTGG
CCCGGCATGG ACTATCTGCG CCGCCAGAGC GAAGAGACTC TTGCCAAGGC CGGACACTCC
GCCATCCTCG CGTTTTATCG CGATATGCCC TCGGCCCAGA CCGCCGAAGG CGCGCTCAGC
CTCGGGGAAG CGCTGATCGA GGCCGGGCGC ACAGGCGAAG GCCAGGCAGA AATCGTGCGC
GCTTGGCGTA GCATGGCGAT GCCGCAGGAG GTCCACGACG CCTATCTAGC GCGACACAAA
ACCCTTTTGG CCGCACACCA CGAAGCACGC CTCAGCCGTT TGCTTTGGGA CGGGCACAAA
GTGTCCTCGC GTCGGATGCT GGAGCTTGTG AGCGATGGTG CGCGCAAACT GGCGGAAGCG
CGCATTGCCT TGCAGGAGCA GGCCAACGGC GTAGACGCGG CAATTGCTGC AGTGCCGGAT
GCGCTCAAGG ACGATGCAGG GCTGGCCTAT GACCGTTTCG CCTGGCGCGA CGCAAAGCGA
CGCCAGGATG ATGCGATTGC CATGATGTTT GAACGCTCCA CCTCTGCCGA GGCGCTGGGG
GAGCCGGCAA AATGGCTGCG CAGACGTGCG GATTTCGCGC GTCAGCTGAT GCGCGACAAT
GACAATACCC GCGCCTATAA GCTCGCGGCC TATCACTTTG CCACGCCCGA TGCTGGCTAT
GGCTACGCCG ACAACGAATG GATTGCGGGC TATGTCGCGT TGCGCAAGCT CGACGATGCG
GAACTTGCCG TCTATCATTT CACGCGTTTC CTTGCCGCGG TTGAAAGCCC GATCTCCGTG
GGCCGTGCCG GATATTGGCT CGGGCGGGCC TATGCGCAGC TCGGTGAGAT CGACAAGGCT
CACGCGGCCT ATCGGCTGGG GGCGAAATAT CAAAGCTCCT ATTACGGGCT GTTCTCTGCC
CAGGCGCTTG GGCGCAGCTT CGATCCGCGC CTGACCCGTC CAGAGGCCCC GAACTGGCGC
AACGCCCCAT TTTTGCAGTC CTCAGTCTAT GCCGCAGGCA TTGCCTTGCT GGAGGCCGGT
GAGGTGTCGC TTGCAGAGCG GTTCCTCACT CATCTCGTGG AAAGCCTCCC CCCGGATCAG
GCGTTGCAAC TGGGGCAGAT GGCTGTCGAC ATGAAACTGC CGCATCTGGC CGTGATGATC
TCCAAGCGAG CTGCTCAGGA CGGATTGGAG CTTGCGGGTG CCTATTACCC GCTGCATCCG
GTTACCGACC TCGACCTCCC CATGGCGCCC GAGATGGTTC TGGCGATTGC CCGCCGCGAG
AGCGAATTTG ACCCTGTGGT CGTAAGCCAT GCGGGCGCGC GCGGGTTGAT GCAAGTGATG
CCCGGCACCG CAGAACTGGT CGCCAAGGGC CTCGGCATTC TGGAGGAGCA CGCGGTCGCG
CGCCTCACCT CGGACTGGCA ATATAACGCA AAGCTGGGCG CCACCTACCT TGCGGCGCTG
GCCAAGGAAT TCGACGGCAA CGTGGTCCTG ATGGCGGCGG GCTACAACGC GGGGCCGCAC
CGTCCGATCG CCTGGATGGA ACGCTATGGT GATCCCCGTG ATCCAAAGGT CGATGTGATC
GACTGGATCG AGCACATCCC CTTTAATGAA ACCCGCAATT ACGTCATGCG CGTGACCGAG
AGCCTGCCGG TTTATCGAGC GCGGCTTGGA GAAGCGCCTC TGCCCGTGCC CTTCTCACGC
GAACTATCGG GCCGCACGCT TCAGGCTTTC GCGCCAAAAG GTGAATAG
 
Protein sequence
MTRRLACSLL ICATLFGAPA LARGLDTALP LMRDGQWQEA LIDAGSEGSI QRDIIEWHRL 
RAGEGTARDV LSFIDRRPDW PGMDYLRRQS EETLAKAGHS AILAFYRDMP SAQTAEGALS
LGEALIEAGR TGEGQAEIVR AWRSMAMPQE VHDAYLARHK TLLAAHHEAR LSRLLWDGHK
VSSRRMLELV SDGARKLAEA RIALQEQANG VDAAIAAVPD ALKDDAGLAY DRFAWRDAKR
RQDDAIAMMF ERSTSAEALG EPAKWLRRRA DFARQLMRDN DNTRAYKLAA YHFATPDAGY
GYADNEWIAG YVALRKLDDA ELAVYHFTRF LAAVESPISV GRAGYWLGRA YAQLGEIDKA
HAAYRLGAKY QSSYYGLFSA QALGRSFDPR LTRPEAPNWR NAPFLQSSVY AAGIALLEAG
EVSLAERFLT HLVESLPPDQ ALQLGQMAVD MKLPHLAVMI SKRAAQDGLE LAGAYYPLHP
VTDLDLPMAP EMVLAIARRE SEFDPVVVSH AGARGLMQVM PGTAELVAKG LGILEEHAVA
RLTSDWQYNA KLGATYLAAL AKEFDGNVVL MAAGYNAGPH RPIAWMERYG DPRDPKVDVI
DWIEHIPFNE TRNYVMRVTE SLPVYRARLG EAPLPVPFSR ELSGRTLQAF APKGE