Gene Smed_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2014 
Symbol 
ID5322873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2064642 
End bp2066150 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content61% 
IMG OID640790951 
Productprotease Do 
Protein accessionYP_001327682 
Protein GI150397215 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGAAAC GACCCGAATT CTTCCGCGGC CTGGCGCTCG CGGCCGCGAC GGCACTCATT 
TTCACCAATA CGGCGCTTGC CCAGACGGCC ACGTCACGGC CTCCGGCCGG TCCTCCATCC
GTCGCGGATC TTGCCGAGGG ACTGCTCGAC GCAGTCGTCA ACATATCGAT TTCGCAGAAC
GTCAAGGGCG ACGACGACAA TGCGCCGATA CCGCAGGTGC CGGAAGGGTC ACCCCATCAG
GAGTTCTTCG ACGAATTTTT CAGGGGTCCG GGAGGCGAGG GCGGCCGGCC GCGCACCGTC
AATTCGCTGG GTTCAGGCTT TATCATCGAT CCCGCGGGCT ACATTGTCAC CAACAACCAC
GTGATCCAGG ATGCCGACGA TATCGAAATC AATTTCTCCG ACGGCACGAA GCTGAAGGCG
AAACTGGTCG GCATGGATAC GAAAACCGAC CTCGCCCTGC TGAAGGTCGA ACCGAAAAAG
CCGCTCAAAG CCGTCTCTTT CGGTGATTCG CGAAAGATCA GGATCGGCGA TTGGGTGATG
GTGGTCGGCA ATCCGTTCGG TCTCGGCGTT TCCGTTTCCG TGGGCGTCGT CTCCGCACGC
GGCCGCAATA TCAATGCCGG CCCCTATGAC AGTTTCATCC AGACCGACGC GGCGATCAAT
CGCGGCAATT CGGGCGGACC GCTTTTCAAC ATGCAAGGCG AAGTCGTCGG CATCAATACG
GCGATCCTTT CGCAGACCGG CATGTCGGTC GGAATAGGCT TCGCCGTGCC GGCGGAGCTT
GCCGTCAACG TTGTAAACCA GCTCAAGGAA TTCGGCGAAA CCCGCCGCGG TTGGCTTGGC
GTTCGAATCC AGCCCGTCAC CGATGACATC GCCGAGAGCC TGAAAATGGA GCTGCCGCGC
GGCGCGCTCG TATCGGGCAT CATCGAGGGC GGCCCGATTA CCAAAGGTGA AATCAAGCCG
GGCGACATTA TCATCCGCTT CGACGGAACG GATATCGCGG AGATTCGGGA CCTCATGCGC
ACCGTCGGAG AGAGCCCCGT CGGCAAGGCC GTCGATGTGG TGATCATCCG TGACGGCAAG
GAGCAGTTGG TCCGCGTGAC ACTGGGCCGG CTGGAAGATG GCGAGCAGCT CGCCAATGCG
AAACCGGGTG AGATGCCGGA GAGCAGTGAT GCGAAGCCCG GCGAACCGCC TGCCGCGCAA
CTGCCGGCAA GCGACACCGT GCTCGGCATG AAGCTCGCAG AGCTCGACAC TGACCGCCGC
CAGAGTTTCG GAATCGCCGA AAACGTCAAA GGCGTCGTCA TTACCGAGGT GCAGCCCAAT
TCGCCCGCGG CCGAGCGCCG GGTTGAGGTG GGCGAGGTGA TCGTCGAACT TGGTCAGGAA
GCAATGGAGA CGCCGGAGGA TGTCACCTCA CGCGTCACGG AACTCAAGGC TGACGGGCGC
CGCAACGCTT TGCTGATGAT CGCCAACAAA AGCGGCGAAC TGCGCTTCGT GACGGTACGG
ATGGAGTGA
 
Protein sequence
MSKRPEFFRG LALAAATALI FTNTALAQTA TSRPPAGPPS VADLAEGLLD AVVNISISQN 
VKGDDDNAPI PQVPEGSPHQ EFFDEFFRGP GGEGGRPRTV NSLGSGFIID PAGYIVTNNH
VIQDADDIEI NFSDGTKLKA KLVGMDTKTD LALLKVEPKK PLKAVSFGDS RKIRIGDWVM
VVGNPFGLGV SVSVGVVSAR GRNINAGPYD SFIQTDAAIN RGNSGGPLFN MQGEVVGINT
AILSQTGMSV GIGFAVPAEL AVNVVNQLKE FGETRRGWLG VRIQPVTDDI AESLKMELPR
GALVSGIIEG GPITKGEIKP GDIIIRFDGT DIAEIRDLMR TVGESPVGKA VDVVIIRDGK
EQLVRVTLGR LEDGEQLANA KPGEMPESSD AKPGEPPAAQ LPASDTVLGM KLAELDTDRR
QSFGIAENVK GVVITEVQPN SPAAERRVEV GEVIVELGQE AMETPEDVTS RVTELKADGR
RNALLMIANK SGELRFVTVR ME