Gene Smed_0815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0815 
Symbol 
ID5321652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp878050 
End bp879381 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content61% 
IMG OID640789752 
Productpeptidoglycan binding domain-containing protein 
Protein accessionYP_001326506 
Protein GI150396039 
COG category[S] Function unknown 
COG ID[COG2989] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.418141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.712241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAA AAAACGGAAT TGATGCTTTC TCGCGCCGCG CATTTCTGCG TTCGGCCGCA 
ACCTTCGGTG CCGCCGCATG GGCGGGCGCC GCCAGCGCCC AGGATGCGCT GAACGAAATC
ATCAATTCGC CGCGCCGCGG CTCCTGGGAC GACCAGTTCG ACGCAAAGGC TTCGCGTACG
GCTACGGCGG TACTGTCGAA CACGCCAGTC TTCGGCCCCG AAACGATTGC GCTTCTGCAG
CAGGCGATCC TGGATTACCA GCAGATCGCA GCCGCCGGAG GCTGGCCTAT GGTCAATCCC
GCGAGCACCC AAAGGCTCGA ACTCGGCGTT AACGATCCTG CCGTTCAGCA ATTGCGCCAG
CGTCTGATGA TCTCGGGCGA TCTGCCGCAA TCGGCGGGTA TTTCCCCTTC CTTCGATTCC
TATGTCGATG GTGCTGTCAA GCGCTTCCAG GCACGTCATG GCCTGCCGGC GGACGGCGTC
ATCGGCGAAT ACAGCCTGAA GGCTCTGAAC GTCGACGCCT CGACCCGGCT TGCCCAGCTC
GAGACCAACC TGGTGCGGCT TCAGTCGATG TCGGGCGATC TCGGCCGGCG CTACGTCATG
GTCAACATTC CGGCAGCCTA TATCGAGGCG GTGGAGAACG GCCGGGTGGC GCTGCGCCAT
ACGGCTATCG TCGGCAAGAT CGATCGCCAG TCGCCGATTC TCAATTCAAA GATTTACGAG
GTCATCCTCA ACCCCTATTG GACGGCGCCG CGCTCGATCA TCCAGAAAGA TATCATGCCG
CTGATGCGTA AGGATCCGAC TTATCTCGAG CGCAATGCGA TCCGTCTCCT CGACGGCAAT
GGCAACGAAG TGTCGCCGGA AACCATCGAC TGGCAGGCCG AGAAGGCGCC GAACCTTATG
TTCCGCCAGG ATCCCGGCAA GATCAACGCG ATGTCTTCGA CGAAGATCAA TTTCCATAAC
GAGCATGCCG TCTATATGCA CGACACCCCG CAGCAGGGCC TGTTCAACAA GCTGATGCGC
TTCGAATCCT CCGGTTGCGT CCGTGTGCAG AACGTTCGCG ATCTATCCAC CTGGCTGCTC
AAGGAAACGC CCGGCTGGTC GCGCCAGCAG ATCGAGGGTA CGATCAAGTC CGGCGTCAAC
ACGCCGATCA AGCTTGCCGA AGAGGTCCCG GTCTATTTCA CCTATATAAC CGCCTGGTCG
GCGAAGGACC GCGTCGTCCA GTTCCGCGAC GATATCTACC AGCGCGACGG CGCGGCGGAG
CTTGCGCTGC AGACGACGAC GGGCATCGAG CAATCGGCTG GTCCGATCGA CGCGGACGCC
TTGCCGCAAT AA
 
Protein sequence
MSKKNGIDAF SRRAFLRSAA TFGAAAWAGA ASAQDALNEI INSPRRGSWD DQFDAKASRT 
ATAVLSNTPV FGPETIALLQ QAILDYQQIA AAGGWPMVNP ASTQRLELGV NDPAVQQLRQ
RLMISGDLPQ SAGISPSFDS YVDGAVKRFQ ARHGLPADGV IGEYSLKALN VDASTRLAQL
ETNLVRLQSM SGDLGRRYVM VNIPAAYIEA VENGRVALRH TAIVGKIDRQ SPILNSKIYE
VILNPYWTAP RSIIQKDIMP LMRKDPTYLE RNAIRLLDGN GNEVSPETID WQAEKAPNLM
FRQDPGKINA MSSTKINFHN EHAVYMHDTP QQGLFNKLMR FESSGCVRVQ NVRDLSTWLL
KETPGWSRQQ IEGTIKSGVN TPIKLAEEVP VYFTYITAWS AKDRVVQFRD DIYQRDGAAE
LALQTTTGIE QSAGPIDADA LPQ