Gene Smed_4927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4927 
Symbol 
ID5318243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1436117 
End bp1437367 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID640776710 
Productpeptidase S41 
Protein accessionYP_001313642 
Protein GI150377046 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.966695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCGA AGATGCCCCG GGTCCTGCTG GCCGGCTTGC TGTTTCTCCT TGTACCCAGC 
CAAGGTTTCG GCCTCGACCT GCCGCGCTCG AATCATCCTG TATTCGACCG GGTCGTCACG
CTTGTCGTCG ACAATTTCCA TGATCCCGCT GCTCTCGACG GTTTTGGCGC AGCGGTTCGG
CGGGAACTCG ATGACGGCCA ACCATTGAAC TCAGAAAGCC CGGGGCCCCG GGTCGATACC
GCTATCGAGA GGATCCTCGC AAGTCTCGAG GCCTCGCACA CGGCCCGCTT CAAGCAGGAT
ACGATCGCAT ATTACGAACT TGCCGACATA TTCCGGTTTG CTATTCGCCA CGACATGGAG
CGGCTGTTCC CGCCGGATGG CGCGGCGACC TATGCCGGGG TAGGCATGGT CACCCGGATG
GAAAACGGGC TGCGGTTCGT AAGCGACGTC TATGACGGCT CGCCTGCGGA CAAGGCCGGC
ATACGCGTTG GCGACGAAGT TCTTTCGGTG GATGGCGCGC CGTATCGCGA GATCGATTCC
TTTCGAGACA GGATCGGACG AACGGTGGAG ATCCGCCTTC GGCGCGAGGC GGACGCGCAA
CCTTTTAAGG CCACGGTCGC TGTGGAGCGC TTGCGGCCCT TGCCCACCTT CGAAAAGGCT
ATCGACAAAA GCGTCACCCT CCACGAAGAG GGGGGACGGA GCATCGGTTA TCTCCGTCTA
TGGACGCTTT CCAGTCCCGA TGGGCTCGAT ATCGTGGCGC GCGAACTTGC CACAGGTCGC
CTCAAGGATG CCGATGGCGT CATCGTCGAT CTGCGCGGCC GGTGGGGCGG CGGCCCACCC
GACGCGGCGG AGCTCTTCGT CGGCGACACG CCCAACTTCC GGCTGATTGC GCGCGGCAGC
AAGGACATGC TGGCAAATGT ACGCTGGCGC CGTCCGGTCG TCGCCATCAT AGACGAGGGC
TCTAGAAGCG GCCTCGAACT TTTCGCCTAT GCGCTGAAGT CCAATGGAAT CCCGCTTGTG
GGGACGCGCA CGGCAGGCGC TCTTCTTGCC GGCCGCGCCT ATCTCCTGCC GGACGACAGC
ATTCTGGAGC TTGCAGTATC GGACGCGGTG ATCGATGACG ATGTGCGACT GGAAGGGCGC
GGCGTCGAAC CGGACATTAT TGCTACCTTC CCCCTGCCTT ACGCCGCCGG CCGCGACCCG
CAGCGCGAAG CTGCCTTTGA AGAAATGCGG CGAACCCTTG CCAAAGACTG A
 
Protein sequence
MHAKMPRVLL AGLLFLLVPS QGFGLDLPRS NHPVFDRVVT LVVDNFHDPA ALDGFGAAVR 
RELDDGQPLN SESPGPRVDT AIERILASLE ASHTARFKQD TIAYYELADI FRFAIRHDME
RLFPPDGAAT YAGVGMVTRM ENGLRFVSDV YDGSPADKAG IRVGDEVLSV DGAPYREIDS
FRDRIGRTVE IRLRREADAQ PFKATVAVER LRPLPTFEKA IDKSVTLHEE GGRSIGYLRL
WTLSSPDGLD IVARELATGR LKDADGVIVD LRGRWGGGPP DAAELFVGDT PNFRLIARGS
KDMLANVRWR RPVVAIIDEG SRSGLELFAY ALKSNGIPLV GTRTAGALLA GRAYLLPDDS
ILELAVSDAV IDDDVRLEGR GVEPDIIATF PLPYAAGRDP QREAAFEEMR RTLAKD