Gene RSc1216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc1216 
SymbolhisS 
ID1220038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp1285118 
End bp1286416 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content65% 
IMG OID637237592 
Producthistidyl-tRNA synthetase 
Protein accessionNP_519337 
Protein GI17545935 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.420305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGAAGA TCGCGGGCGT CAAGGGCATG AATGACCTGC TGCCCGGCGA TGCCCCGCTG 
TGGGAGCATT TCGACAATGC CGTGCGCAGC ATGCTGCGTG CCTACGGCTA CCAGCAGATC
CGCACGCCGA TCGTCGAGCA GACGCAGCTG TTCGTGCGCG GCATCGGCGA GGTGACGGAC
ATCGTCGAGA AGGAGATGTA CTCCTTCACC GACGCCCTGA ACGGCGAGCA GCTGACGATG
CGTCCGGAAG GCACTGCGGC CGCCGTGCGC GCCGTGATCG AGCACAACCT GCTGTACGAC
GGCCCGAAGC GTCTGTGGTA CACCGGCCCG ATGTTCCGCC ACGAGAAACC GCAGCGCGGC
CGTTATCGCC AGTTCCACCA GGTGGGCGTG GAAGCGCTGG GCTTTGCCGG CCCGGACATC
GATGCGGAAG TCATCCTGAT GTGCCAGCGC CTGTGGGATG ACCTCGGCTT GGTCGGTCTG
AAGCTGGAGC TGAACTCGCT CGGCCAGGCG GAGGAGCGTG CAGCGCACCG CGCCGATCTG
ATCAAGTATC TGGAAGGCTT CCAGGACATC CTGGACGAAG ACAGCAAGCG GCGCCTGTAC
ACCAATCCGC TGCGTGTGCT CGACACCAAG AACCCGGCCC TGCAGGAGAT GGCTGCCGGC
GCGCCCAAGC TGATCGACTA CCTGGGTGAA GAGTCGTGCG CGCACTTCGA GGGCGTGCAG
AAGCTGCTCA AGGCCAACAA CATTCCGTTC ACCATCAACC CGCGTCTGGT GCGTGGCCTG
GACTACTACA ACCTGACCGT GTTCGAATGG ACGACCGACA AGCTCGGCGC GCAGGGCACG
GTGGCCGGCG GCGGCCGCTA TGACCCGCTG ATCGAACAGA TCGGCGGCAA GCCGGCGCCG
GCTTGCGGTT GGGCGATGGG CGTGGAGCGC ATCATCGAAC TGCTGCGCGA AGAGAATCTG
GCACCGGAGC CGCAGGGCAG CGACGTCTAC ATCGTCCACC AGGGTGATGA GGCACAGGTG
CAGGCGCTGG TGGCGGCCGA GCGCCTGCGC GACGCGGGGC TGGACGTGAT CCTCCACGCC
TCGGCCGAAG GGCGCAACGG CAGCTTCAAA TCGCAGTTCA AGCGCGCTGA CGCAAGTGGC
GCAGCCTATG CGGTTATCAT TGGCGACGAC GAAGTCGCCT CCGGCGTGGT GCAGATCAAG
CCCCTGCGCG GTGATCCGAA TGCGGACGCC CAGCAGACCG TGCCGTCCGA CCAACTGGTC
GATCGCCTGA TTGATGCCAT GGTGGCAAAT AGCGACTGA
 
Protein sequence
MQKIAGVKGM NDLLPGDAPL WEHFDNAVRS MLRAYGYQQI RTPIVEQTQL FVRGIGEVTD 
IVEKEMYSFT DALNGEQLTM RPEGTAAAVR AVIEHNLLYD GPKRLWYTGP MFRHEKPQRG
RYRQFHQVGV EALGFAGPDI DAEVILMCQR LWDDLGLVGL KLELNSLGQA EERAAHRADL
IKYLEGFQDI LDEDSKRRLY TNPLRVLDTK NPALQEMAAG APKLIDYLGE ESCAHFEGVQ
KLLKANNIPF TINPRLVRGL DYYNLTVFEW TTDKLGAQGT VAGGGRYDPL IEQIGGKPAP
ACGWAMGVER IIELLREENL APEPQGSDVY IVHQGDEAQV QALVAAERLR DAGLDVILHA
SAEGRNGSFK SQFKRADASG AAYAVIIGDD EVASGVVQIK PLRGDPNADA QQTVPSDQLV
DRLIDAMVAN SD