Gene Smed_0713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0713 
SymbolcysS 
ID5321550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp765466 
End bp766866 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content62% 
IMG OID640789650 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_001326404 
Protein GI150395937 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.652412 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGAA TGACGGTTTT GAAGCTGTAC AACACGCTGA CGCGGGAAAA GACCGATTTC 
AGGCCGATCG ATCCGAAAAA CGTCCGCATG TATGTCTGCG GCCCGACGGT CTACGACTTT
GCCCATATCG GCAATGCCCG GCCGATCATC GTCTTCGACG TCCTCTTCCG GCTGCTGCGG
CACGTCTACG GCACGGATCA TGTCACCTAT GCGCGCAACA TCACCGACGT GGATGACAAG
ATAAATGCGC GGGCACTGCG CGACTATCCC GGCCTGCCGC TGAACGAGGC TATCCGGCAT
GTGACTGAGA GGACCGAGAC GCGATTCCTC GAGGATGCCG CATTGCTCGG CTGCCTCGAT
CCCACAGTGC AGCCTCGCGC CACCGAGAAC ATCCCGGGGA TGATCGAGAT TATCGAAACG
CTCATCGCCA AGGGCCATGC TTATGAGGCT GAGGGCGAGG TACTCTTCGA CACGCGGTCG
ATGGCGGAAT ACGGGCAGCT TTCCAAACGC AATCTCGACG AGCAGCAGGC GGGTGCCCGT
GTTGCCGTCG AGGCGCACAA GAGAAACCCG GGCGATTTCG TGCTCTGGAA ACTATCCGCC
GGGCACGAGC CGGGATGGGA GAGCCCCTGG GGTCGCGGGC GGCCCGGTTG GCACATCGAA
TGCTCCGCTA TGAGCGGGCG CTATCTCGGC GAAGTCTTCG ACATTCACGG CGGCGGCATC
GACCTGATCT TCCCGCATCA CGAAAACGAG ATCGCCCAGT CGCGGTGCGC CCACGGAACC
GCGGTGATGG CGAATGTCTG GATGCACAAT GGCTTCCTGC AGGTCGAGGG CCGCAAGATG
TCGAAGTCCG AAGGCAACTT CATCACCATC TACGACCTTC TTCACACCGA AAAGTTCGGC
GGCCGCAAAT GGCCGGGCGA GGTGCTCCGG TTGGCGATGC TGATGACCCA TTATAGGGAG
CCGATCGACT TCTCGATCAA GCGGCTGGAG GAGGCCGAGC ATCTGCTGTC GAAATGGCCG
GTTCACGGGT CCGCCTCGGG CGAGGCGGAC CCGGCTGTCG TGGCCGCGCT CACCGATGAT
CTCAATACCG TGGCCGCAAT ACAGGCTCTG CATGCTCTGG CGCAAAAGGC CACTGCCGAT
GCCCGCCATC TGGGCGCCTT CGCCGCCAGC GCGGCCCTTC TCGGCGTGGA GCCGAAGGAG
ATCGAGCTCG ACGAGGCGGT CGTGCAGGAG ATCGACGGCC GCGTCCGTGA ACGCCTGGAA
CTCCTGAAGA GCAAGAATTA CGCGGAGGCC GACGGCATTC GCGCCGATCT CCTCGCAAGG
GGAATTCAGC TCAAGGACGG CAAGGATCCT GAGACTGGCG AACGGGTAAC GACCTGGGAG
GTGAAACGGT CTCAGGTCTA G
 
Protein sequence
MGGMTVLKLY NTLTREKTDF RPIDPKNVRM YVCGPTVYDF AHIGNARPII VFDVLFRLLR 
HVYGTDHVTY ARNITDVDDK INARALRDYP GLPLNEAIRH VTERTETRFL EDAALLGCLD
PTVQPRATEN IPGMIEIIET LIAKGHAYEA EGEVLFDTRS MAEYGQLSKR NLDEQQAGAR
VAVEAHKRNP GDFVLWKLSA GHEPGWESPW GRGRPGWHIE CSAMSGRYLG EVFDIHGGGI
DLIFPHHENE IAQSRCAHGT AVMANVWMHN GFLQVEGRKM SKSEGNFITI YDLLHTEKFG
GRKWPGEVLR LAMLMTHYRE PIDFSIKRLE EAEHLLSKWP VHGSASGEAD PAVVAALTDD
LNTVAAIQAL HALAQKATAD ARHLGAFAAS AALLGVEPKE IELDEAVVQE IDGRVRERLE
LLKSKNYAEA DGIRADLLAR GIQLKDGKDP ETGERVTTWE VKRSQV