Gene Smed_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1014 
Symbol 
ID5321859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1084606 
End bp1086003 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content62% 
IMG OID640789956 
Productprotease Do 
Protein accessionYP_001326702 
Protein GI150396235 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.370386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.451123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTTTG GCCACCGAGC CGCCGCGGCT ATCGTCCTCG CAATTGCCAT TTCGACGCCT 
GCGATGGCTC AGGATACCAG GACTGTGCCG CAGTCGCGGG CGGAGATGCA ACTCTCTTTC
GCGCCGCTCG TCAAACAGAC GGCGAATGCG GTCGTGAACG TCTATGCCGA GCGCGCGGTG
GAACGGCGGT CGATCTTTGC CGGGGACCCT TTCTTCGAGG AGTTTTTCGG TCAGCGGATG
CCCAATCGCA CCGAAAAGCA GTCATCGCTC GGATCAGGCG TGATCGTCGG CCGCAACGGC
CTGGTCGTCA CCAACAACCA TGTCATCGAT GGTGCCGACG ATATCAAGGT GGCACTGGCC
GACGGGCGGG AGTTCCCTTG CAAGCTTATA TTGAAGGACG ATCGCCTGGA CCTCGCGGTC
ATGAAAATCC AGTCGGACGG CCCGTTCGAC ATCATCCCGA TCGGCGATTC CGACGCGGTG
GAAGTCGGGG ACCTTGTGCT GGCGATGGGT AATCCTTTCG GTGTCGGGCA GACGGTCACG
AGCGGCATCG TGTCGGCACT CGCCCGTAAC CAGATTTCCA ACGGGGATTT CGGTTTTTTC
ATCCAGACGG ATGCAGCCAT CAATCCCGGC AATTCCGGCG GGGGTCTGAT CGACATGAAG
GGCGAGTTGA TCGGAATCAA CACCGCGATT TTCTCAAGAG GCGGCGGTTC CAACGGTGTC
GGCTTTGCGA TCCCTGCCAA TCTGGTCAAG GTTTTCGTGG CCTCCGCTGA AGGAGGCAAT
GGCTCATTCA TTCGGCCCTT CGTCGGAGCG ACCTTCGAAC CGGTGACGTC CGACGTGGCC
GAGGCGCTTG GACTTGAACG GGCGCGTGGG GCGCTGGTGA CGGCGGTTGT CGCGGGCGGT
CCGGCCGAGA GCGCCGGCAT GCGCCCCGGC CAGGTGGTCA CCGCCGTCAA CGATATACCG
GTCGAACACC CCGATGCGCT CGGCTACCGC CTGACGACGG TCGGGATCGG GCATGAGGCG
CGCGTGACGG TTTCGGAGAA CGGCGATTTG CGCGAAATCA CCCTCCGGCT GGAGCGGGCG
CCGGAAACTC AGCCGCGTGA CGAACGGCTG ATCGAGGGTC GCAATCCCTT CGCCGGCGCC
GTGGTGGCAA ATCTCTCACC CCGGCTTGCC GAGGAGTTGC GCATGCCGAC GTCGCTGCAG
GGCGTGGTGG TCACCGAGAT CAATCGCGGC TCGCCGGCCG CTCGCATCGG CCTCGAACCG
AAAGACATTG TTCGTTCTGT CAACGGCACC GCAATCGAGA GTTCGAAGAC ACTGGAAAGC
GTCGTCGCCG AGGATGCTTC CTTCTGGCGT GTCGAGATCG AACGCAACGG CCAGATCATC
CGTCAGTTCT TCCGATGA
 
Protein sequence
MIFGHRAAAA IVLAIAISTP AMAQDTRTVP QSRAEMQLSF APLVKQTANA VVNVYAERAV 
ERRSIFAGDP FFEEFFGQRM PNRTEKQSSL GSGVIVGRNG LVVTNNHVID GADDIKVALA
DGREFPCKLI LKDDRLDLAV MKIQSDGPFD IIPIGDSDAV EVGDLVLAMG NPFGVGQTVT
SGIVSALARN QISNGDFGFF IQTDAAINPG NSGGGLIDMK GELIGINTAI FSRGGGSNGV
GFAIPANLVK VFVASAEGGN GSFIRPFVGA TFEPVTSDVA EALGLERARG ALVTAVVAGG
PAESAGMRPG QVVTAVNDIP VEHPDALGYR LTTVGIGHEA RVTVSENGDL REITLRLERA
PETQPRDERL IEGRNPFAGA VVANLSPRLA EELRMPTSLQ GVVVTEINRG SPAARIGLEP
KDIVRSVNGT AIESSKTLES VVAEDASFWR VEIERNGQII RQFFR