Gene Smed_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0403 
SymbolhisS 
ID5321237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp432749 
End bp434272 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content62% 
IMG OID640789338 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001326095 
Protein GI150395628 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.326468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0636459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCCA TGAATGAAAA ACAGAAGAAA ACGCAGAAGC TCAGGGCGCG CTTGCCGCGT 
GGTTTCGTAG ATCGTCCGGC CGCGGACATC CGCGCCGTGG ACGAGATGGT TTCGAAGATC
CGCGAGGTCT ATGAACGCTA CGGCTTCGAT CCGGTGGAAA CCCCGCTTTT TGAATATACT
GACGCGCTCG GAAAATTCCT GCCGGACAGC GACCGGCCGA ATGAAGGCGT CTTCTCGCTC
ACCGACGACG ACGACCAGTG GCTTAGTCTT CGCTACGACC TGACGGCGCC GCTGGCGCGC
CACGTGGCCG AGAATTTCAA CGAGATCCAG CTTCCCTACC GGACCTATCG CGCCGGCTAC
GTCTTCCGCA ACGAGAAGCC GGGGCCGGGC CGTTTCCGCC AGTTCATGCA GTTCGATGCC
GATACGGTGG GTGCCGCCGG CGTGCAGGCC GATGCCGAAA TGTGCATGAT GATGGCCGAC
ACGATGGAGG CGCTCGGCAT TGCCCGCGGC GACTACGTGA TCCGCGTCAA CAACCGCAAG
GTTCTCGACG GCGTTCTGGA GGCGATAGGT CTCGGCGGCG GCGAGCAGAT GAATACGCGC
CTGACGGTGC TGCGTGCCAT CGACAAGCTC GACAAATTCG GCCCGGAAGG CGTGCGGCTG
TTGCTGGGCG AAGGACGCAA GGACGAGAGC GGCGACTTCA CCAAGGGCGC CGGGCTCGAC
GATGAGCAGA TCGGCAAGAT CCTGTTCTTT GTCGGCATCA CCGACTATGC CGAGAGCGCG
GACGCGTTGG CGGCACTCGT CGCCGGGACG GCCCGGGGCG CCGAGGGCGT CAATGAGTTG
AACACGATCC GCAGCCTTGT GCTCAGCGCC GGCTACGAGG CGGACCGCAT CAAGATCGAC
CCCTCGGTCG TGCGTGGTCT GGAATATTAT ACGGGTCCCG TTTTCGAGGC GGAGTTGCAG
TTCGCCGTCA CCAACGAAAA GGGGGAGAAG GTCGTCTTCG GCTCCGTCGG CGGAGGCGGC
CGCTATGACG GTCTCGTGTC GCGCTTCATG GGACAGCCGG TGCCGGCGAC CGGCTTTTCG
ATCGGCGTGT CTCGGCTGAT GACGGCGCTC AAGAATCTGG GCAAACTGGG CGTAGAACAG
GTCACCGCGC CGGTCGTCGT TTGCGTCATG GACCGCGACA TCGAAAGCAT GGGGCGCTAC
CAGCGCTTTG TGCAGGATCT GCGTCATGCC GGCATCCGCG CCGAAATGTA CCAGGGCAAC
AAGAAGAACT TCGGCGACCA GCTCAAATAT GCCGATCGTC GCGGTTCGCC GATCGCTGTC
ATTCAGGGCG GCGACGAGCG CGCTTCCGGC GTCGTACAGA TCAAGGACCT GATCGAAGGC
AAGCGGCTCT CTGGCGAAAT CGAAGATAAT GTCGCTTGGC GCGAGGCGCG TGCCGCTCAG
ATTTCCGTCC CGGAAGGCGA ACTCGTCGTT AAGATTCGCG AAATTCTCGC GGACCAGGCC
GAGGACCGGA AAAGGGCGGG TTGA
 
Protein sequence
MLSMNEKQKK TQKLRARLPR GFVDRPAADI RAVDEMVSKI REVYERYGFD PVETPLFEYT 
DALGKFLPDS DRPNEGVFSL TDDDDQWLSL RYDLTAPLAR HVAENFNEIQ LPYRTYRAGY
VFRNEKPGPG RFRQFMQFDA DTVGAAGVQA DAEMCMMMAD TMEALGIARG DYVIRVNNRK
VLDGVLEAIG LGGGEQMNTR LTVLRAIDKL DKFGPEGVRL LLGEGRKDES GDFTKGAGLD
DEQIGKILFF VGITDYAESA DALAALVAGT ARGAEGVNEL NTIRSLVLSA GYEADRIKID
PSVVRGLEYY TGPVFEAELQ FAVTNEKGEK VVFGSVGGGG RYDGLVSRFM GQPVPATGFS
IGVSRLMTAL KNLGKLGVEQ VTAPVVVCVM DRDIESMGRY QRFVQDLRHA GIRAEMYQGN
KKNFGDQLKY ADRRGSPIAV IQGGDERASG VVQIKDLIEG KRLSGEIEDN VAWREARAAQ
ISVPEGELVV KIREILADQA EDRKRAG