Gene Smed_3110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3110 
Symbol 
ID5323989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3256627 
End bp3258138 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content64% 
IMG OID640792060 
Productpeptidase M48 Ste24p 
Protein accessionYP_001328771 
Protein GI150398304 
COG category[R] General function prediction only 
COG ID[COG4784] Putative Zn-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAGGC AGGCAGTCAA GAATGACTGG TTTCCGCGGC GGCATGGAAC CGTGCGCGCA 
CGAGCGTTGG TTGTATTGGC GGCCACGACG CTGGTCGCCG GCTGCCAGTC GGTAATCGAA
CAGACCTACG AGCCGACCGT TTCGCCATCC TCCAATCCGC AGATCGTCGA AGAGGTGCAG
AAGAACGATC CGCGTGCCCA ACTCGGCGCA CGCGAACATC CGCGCATCGT TGCGAGCTAT
GGCGGTGAGT ACCGCGACGC CAAGACAGAG CGGCTGGTCG CCCGGATCAC CGGCGCCCTG
ACGGCCGTTT CGGAAAATCC CCAGCAATCC TACCGCATCA CCATTCTCAA TTCGCCGGCG
ATCAACGCTT TCGCGCTCCC CGGCGGCTAT CTCTACGTCA CGCGGGGTCT GCTGGCGCTT
GCCAACGACG CCTCTGAAGT GGCAGCCGTG CTCTCGCACG AGATGGCCCA TGTGACGGCC
AATCACGGCA TTCAGCGACA GCAGCGCGAG GAGGCGGAGG TGATCGCCAG CCGCGTCGTT
TCCGAGGTGC TGTCTTCCGA TCTCGCCGGC AAACAGGCGC TTGCCCGGGG TAAGCTGCGC
CTCGCGGCCT TTTCACGCAA TCAGGAACTG CAGGCGGACG TGATCGGCGT CCGCATGCTG
GGAGAAGCGG GCTACGATCC CTATGCGGCG GCACGCTTTC TCGATTCCAT GGCTGCCTAT
AGCCGCTTCA GTGCGGTCGA TCCGGAGGCA GACCAGAGCC TGGACTTTCT GTCGAGCCAC
CCTAACGCGC CGCAGCGCGT CGACCTCGCG CGACGGCATG CGCGTGCCTT TGGTCTCGAG
GGCACAAGCG GCGATCGCGG GCGCGACTAC TATCTCGCGG GAATCGACGG TATCCTCTAT
GGCGACAGTC CGCAGGAAGG CTATGTCCGT GGACAGACCT TCCTGCACGG CCAGCTCGGC
ATACGCTTCG ACGTGCCCAC AGGTTTTCAG ATCGACAACA AGGCAGAGGC CGTACTTGCC
ACGGGTCCTG GTGAAGTGGC CGTCCGTTTC GACGGCATCG CCGATACGAG CGGGCGCAAC
CTGACCGATT ACATCGCCAG CGGCTGGGTG ACCGGCCTCA AGCCCGATAC GATCCGTTCC
ATCCGGGTCA ATGGTCTGGA GGCCGCGACG GCGCGCGCCT CCGCCGACCG CTGGGATTTC
GACGTCACGG TGATCAGGCT CGGCGAGCGC ATATACCGCT TTCTGACGGC CGTTCCAAAG
GGTTCCAGCG CACTGCAGCC GACAGCGGAC CAGCTGCGCA CGTCCTTCCG ACGGATGACG
TCTGGCGAAG TTCAGTCGCT GAAGCCGCTG CGTGTCCGGG TGGTCACGGT CCGGTCCGGC
GACACGACCG CGACACTGGC GGCGCGCATG ATGGGCACCG ACCGGAAGCT CGACCTCTTT
CGCCTGATCA ATGCGATGCA GATCACCTCC ACCGTCAGAC CGGGCGACAA GGTGAAGATC
ATTTCCGAGT GA
 
Protein sequence
MIRQAVKNDW FPRRHGTVRA RALVVLAATT LVAGCQSVIE QTYEPTVSPS SNPQIVEEVQ 
KNDPRAQLGA REHPRIVASY GGEYRDAKTE RLVARITGAL TAVSENPQQS YRITILNSPA
INAFALPGGY LYVTRGLLAL ANDASEVAAV LSHEMAHVTA NHGIQRQQRE EAEVIASRVV
SEVLSSDLAG KQALARGKLR LAAFSRNQEL QADVIGVRML GEAGYDPYAA ARFLDSMAAY
SRFSAVDPEA DQSLDFLSSH PNAPQRVDLA RRHARAFGLE GTSGDRGRDY YLAGIDGILY
GDSPQEGYVR GQTFLHGQLG IRFDVPTGFQ IDNKAEAVLA TGPGEVAVRF DGIADTSGRN
LTDYIASGWV TGLKPDTIRS IRVNGLEAAT ARASADRWDF DVTVIRLGER IYRFLTAVPK
GSSALQPTAD QLRTSFRRMT SGEVQSLKPL RVRVVTVRSG DTTATLAARM MGTDRKLDLF
RLINAMQITS TVRPGDKVKI ISE