Gene Shewmr4_0030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0030 
Symbol 
ID4250716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp34721 
End bp35845 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content48% 
IMG OID638116569 
Productpeptidoglycan-binding LysM 
Protein accessionYP_732169 
Protein GI113968376 
COG category[S] Function unknown 
COG ID[COG1652] Uncharacterized protein containing LysM domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.772252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00540943 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACCGCA CCATGAAACG GCTAATTTTA CTTGCGTTAA TGACATTTAA TTGCACGTTG 
GTTTCCGCTG ATACTCTCAC GCTGAAAGCA GGGCATCCCG AGTCATATGT GGTAAAGAAG
GGCGATACCC TTTGGGATAT CTCGGCCACC TTTTTAAATG ACCCTTGGAA GTGGACACGT
TTATGGGATG TTAACCCCCA AATTGCTAAC CCTCATTTAA TTTATCCCGG CGATCAGCTC
ACTTTAGTCT TTATCGATGG TCAACCCCGA TTAGTGCGTA ATGGCGCTAA CGAGGGAAAA
CCCCATATTC GCAAAACCCC TGAGGGACGC GTGATTGCTA AGAGCAATGC GGTGCCTGCG
GTTGATTTAG CGTTAATCCA AAACTATCTG GTGCAAAACC GTGTGGTCGA TGCCGACTGG
TTTGCACAGC AACCTATGGT GCTCGCGGGT GAAAGTCCTT CACGTCACCA TGTGGTTGGC
GATGTGATTT ATATCGATAG CGAACTGCCT TTAAACCAAA AGCTGGGTAT GTATGAACGC
GGTCGTGACT TCTTCAACAA ACAAACGGGC GAAGCCTTAG GACAAGAAGC GATTCTGGCG
TCTACCGGCC AAGTTATTGA ATCAGGCAAA GTGTCTAAGG TTAAAATCCT CAGTAACTAC
CGTGAAACCA AGGCGGGTTT TAGGGTGCTA CCTATGGAAG ATGAAGCCTT AATGTCAGCC
TATTTTACGC CAAAACCTGC TGAGATTAAG ACGCCGGCGA CCGTACTAGC TATTGAGTCG
AAAATGCGTG AGGCGGGTAA GCTCAATGTG GTGTACCTAG ATAAAGGCAC GCAGGATGGT
GTTGAGCCGG GTGAAGTGTT CTCCATTTAC CGCGATGGCG AAGAGATAGT GATTAATAAC
GATGGCCAAC CCGTACCTAC GACCGAGCGC ACCGCCTATG ACAATGTGGT GGCGTCACTG
TCGTCAGATC GCGCCATCAA GATGCCCGAT ATTTACCATG GCAAACTCTT AGTCTTTAAA
GTGTTTGATA AAGCGAGCTT AGGTTTGATT GTCTCGACTG AACGTTCTGT ACGTGTCGAC
GATAAATTAA TTGCGCCAGA CTCCTTAGCC TTTAGAGGTG AATAA
 
Protein sequence
MDRTMKRLIL LALMTFNCTL VSADTLTLKA GHPESYVVKK GDTLWDISAT FLNDPWKWTR 
LWDVNPQIAN PHLIYPGDQL TLVFIDGQPR LVRNGANEGK PHIRKTPEGR VIAKSNAVPA
VDLALIQNYL VQNRVVDADW FAQQPMVLAG ESPSRHHVVG DVIYIDSELP LNQKLGMYER
GRDFFNKQTG EALGQEAILA STGQVIESGK VSKVKILSNY RETKAGFRVL PMEDEALMSA
YFTPKPAEIK TPATVLAIES KMREAGKLNV VYLDKGTQDG VEPGEVFSIY RDGEEIVINN
DGQPVPTTER TAYDNVVASL SSDRAIKMPD IYHGKLLVFK VFDKASLGLI VSTERSVRVD
DKLIAPDSLA FRGE