Gene Smed_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0637 
Symbol 
ID5321473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp682614 
End bp684167 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content63% 
IMG OID640789573 
Productprotease Do 
Protein accessionYP_001326328 
Protein GI150395861 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.136267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATGA TCAAAACGTC CCGTCCCTCC TTGAAGACAG TTCTGAAGAC CACGACTGTT 
GCCGGCGTTG CCGCCGTACT GCTGACCACC GGGCTTCCGG CACAGATCAC TCAGTCCTTT
GCCGAAGCGG TCAGCGTGCC GGCTCCCGCC GTTCCGAGCT TTGCCAACGT CGTCGAGGCG
GTTTCGCCGG CGGTCGTTTC CGTTCGCGTG CAGGCACGTG AACAAGCCAG TGACGACGAA
AGCAACTTCA CCTTTGATTT CGGCGGTCGC GGCTTCGACG ATCTGCCGGA AGACCATCCG
CTTCGGCGCT TCTTCCGCGA ATTCGACCCG CGTGACAACG ACCGTGCCGA CCGGTGGCGC
GACCGCCGCG GCCCGCGCGG TGAGGGCCGT CTGCGTCCGC GGGCGCAAGG CTCCGGCTTC
TTCATCACCG AAGACGGCTA CCTCGTCACC AACAACCACG TCATTTCTGA CGGATCGGCC
TTCACCGTCA TTATGGATGA CGGTACCGAG CTCGAAGCCA AGCTCGTCGG CAAAGACAGC
CGGACAGATC TTGCAGTGCT CAAGGTGGAC GCCAAGCGAA AGTTCACACA TGTGAGCTTC
GCCGATGACG AAAAGGTGCG TGTCGGCGAC TGGGTGGTCG CTGTCGGTAA TCCCTTCGGC
CTTGGCGGCA CCGTGACAGC GGGGATCATC TCCGCTCGGG GCCGCGATAT CGGCTCCGGT
CCTTACGACG ATTACCTGCA GGTCGACGCA GCGGTGAACC GTGGGAATTC CGGAGGCCCG
ACCTTCAACC TCTCCGGAGA GGTGGTCGGA ATCAACACGG CCATATTCTC GCCTTCCGGC
GGCAATGTCG GCATCGCCTT CGCAATTCCC GCCTCCGTCG CGAAGGACGT CGTTGACTCC
TTGATCAAGG ACGGCACCGT TTCGCGTGGC TGGCTGGGTG TCCAGATCCA GCCGGTGACG
AAGGATATTG CCGAGTCGCT CGGCCTTGCC GAGGCGAAAG GTGCTCTCGT CGTAGAGCCT
CAAACGGGCT CGCCGGGCGA AAAGGCCGGC ATCAAGAACG GCGACGTCGT GACGGCCCTT
AATGGCGAGC CGGTCAAGGA TCCGCGTGAT CTTGCCCGGC GAGTGGCGGC ACTGCGCCCC
GGCTCCACTG CCGAGGTCAC TCTTTGGCGC TCCGGCAAGT CCGAAACGGT CAAGCTCGAG
ATCGGCACGC TGCCGAGCGA TGCCAAGGAG ACTGCACCGA CAACCGGCGA AGCACAGCCG
GACGAAGGTC AGGCAAGCGA CGAGGCACTG GCCGGGCTCG GCCTGACGGT GACCCCGTCG
GAAGACGACA GGGGCGTCAC GATCACATCC GTCGACCCGG ACTCCGACGC TAGCGATCGC
GGTCTGAAGC AAGGCGAGAA GATCGTCTCC GTCAACAATC AGGAAGTGAA ATCGGCGGAC
GACATTCTCA AGGTGATCAA CAACGCCAGA AAGGACAATC GGACCAAGGC GCTGTTCCAG
ATCGAAGCCC AGGAAGGCAG CCGCTTCGTC GCACTCCCGA TCGCTCAGGG CTGA
 
Protein sequence
MSMIKTSRPS LKTVLKTTTV AGVAAVLLTT GLPAQITQSF AEAVSVPAPA VPSFANVVEA 
VSPAVVSVRV QAREQASDDE SNFTFDFGGR GFDDLPEDHP LRRFFREFDP RDNDRADRWR
DRRGPRGEGR LRPRAQGSGF FITEDGYLVT NNHVISDGSA FTVIMDDGTE LEAKLVGKDS
RTDLAVLKVD AKRKFTHVSF ADDEKVRVGD WVVAVGNPFG LGGTVTAGII SARGRDIGSG
PYDDYLQVDA AVNRGNSGGP TFNLSGEVVG INTAIFSPSG GNVGIAFAIP ASVAKDVVDS
LIKDGTVSRG WLGVQIQPVT KDIAESLGLA EAKGALVVEP QTGSPGEKAG IKNGDVVTAL
NGEPVKDPRD LARRVAALRP GSTAEVTLWR SGKSETVKLE IGTLPSDAKE TAPTTGEAQP
DEGQASDEAL AGLGLTVTPS EDDRGVTITS VDPDSDASDR GLKQGEKIVS VNNQEVKSAD
DILKVINNAR KDNRTKALFQ IEAQEGSRFV ALPIAQG