Gene Smed_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0020 
Symbol 
ID5320847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp17332 
End bp18291 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content63% 
IMG OID640788951 
Productsignal peptide peptidase SppA, 36K type 
Protein accessionYP_001325715 
Protein GI150395248 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000184229 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGAAC TCGCCATAGC CGACCGCCGC AGCCTGCGCA GAAAGCTGAG CTTCTGGCGT 
TGGGCGGCGG CGGCCGTTCT CTTCGCCGGC GGCCTTGCCC TGTTTGCCTT TTCCGGCTGG
GGCGATGTGA CGGAGCGCGC CCGCGATCAC GTTGCGCGCG TCGCCGTCAC AGGTTTGATC
CAGGACGACC GGGAACTGGT CGAGCGGCTG GAACGGATTG CCGAGAACCA GTCAGTCAAG
GCATTGATCG TGACGATCTC GTCCCCGGGG GGCACGACCT ATGGGGGTGA GGTCATATAC
AAGGCGGTCC GCAAGGTGGC CGCCAAAAAA CCGGTGGTCT CGGACGTGCG CACGCTTGCC
GCGTCGGCAG GCTACCTGAT CGCGCTCGCG GGCGACCGTA TCGTCGCCGG CGAGACGTCG
ATTACCGGTT CGATCGGCGT CATCTTCCAA TATCCCCAGG TCAAGACCCT GATGGACAAG
CTCGGCGTGT CGCTCGAATC GATAAAGTCG AGGCCCCTCA AGGCCGAGCC CTCGCCGTTC
CATCCTCCGA GCGACGAGGC GAGGGCCATG ATTCAGGCGA TGATCGACGA CAGCTACGGA
TGGTTCGTCG ACCTGGTGGC GGAGCGGCGC AAACTGCCGC GGGCGGAAGC GCTCGGCCTT
GCGGATGGTC GGATCTTCAC CGGCCGGCAG GCACTGGAAG GCAAGCTCGT CGACGAACTC
GGCGGCGATG ATGAAATCAG GGCTTTCCTG GCCGAAAGGA AGGTCTCGAA GGACCTGCCC
GTCCTCGATT GGGAAGCTCC GAGCAGCACG CTGTCTTTCG GCCTCGGCTC GCTCCTGGCC
GAAGCCGTCA AGGCGTTGGG ATATGAGGCT TTTCCGGCAA TGAAGGGCCT CGAAAAGACC
GGCCTGGACA AGTTGTTTCT TGACGGTCTT CTTTCGGTTT GGCAGGTTGA AGGGCAATGA
 
Protein sequence
MDELAIADRR SLRRKLSFWR WAAAAVLFAG GLALFAFSGW GDVTERARDH VARVAVTGLI 
QDDRELVERL ERIAENQSVK ALIVTISSPG GTTYGGEVIY KAVRKVAAKK PVVSDVRTLA
ASAGYLIALA GDRIVAGETS ITGSIGVIFQ YPQVKTLMDK LGVSLESIKS RPLKAEPSPF
HPPSDEARAM IQAMIDDSYG WFVDLVAERR KLPRAEALGL ADGRIFTGRQ ALEGKLVDEL
GGDDEIRAFL AERKVSKDLP VLDWEAPSST LSFGLGSLLA EAVKALGYEA FPAMKGLEKT
GLDKLFLDGL LSVWQVEGQ