Gene Sare_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1809 
SymbolhisS 
ID5707112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2083708 
End bp2085030 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content70% 
IMG OID641271311 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001536686 
Protein GI159037433 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.796045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000022389 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAGC CCACGCCCAT CTCCGGCTTT CCGGAGTGGA CGCCCGACCA GCGAATGATC 
GAGCAGTACG TCCTGGACCG GATCCGGAGC ACCTTCGAGC GGTACGGGTT CGCGCCGTTG
GAGACTCGCG CGGTCGAGCC CCTCGACCAG TTGTTGCGCA AGGGAGAGAC CTCCAAGGAG
GTCTACCTGC TCCGGCGGTT GCAGGCCGAC GTCGACGGAC CGGCCGGTGA CGACGCCCTC
GGACTGCACT TCGACCTGAC CGTGCCGTTC GCCCGGTACG TGCTGGAGAA CGCCGGCAAG
CTTCAGTTCC CGTTCCGCCG CTACCAGATC CAGAAGGTGT GGCGGGGCGA GCGACCGCAG
GAGGGTCGCT ACCGCGAGTT CCTGCAGGCC GACATCGACA TCGTCGACCG GGACACGCTG
GCTCCACACC ACGAGGCCGA GATGCCGCTG GTGATCGGAG ACGCGCTGCG CTCGCTGCCG
ATCCCGCCGG TGCGGATCCA GGTCAACAAC CGCAAGATCT GCGAGGGCTT CTACCGGGGG
CTCGGGCTCA CCGACCCGGA GGCGGCGCTG CGCGCGGTCG ACAAGCTCGA CAAGATCGGT
CCGGTGCGGG TGGCCGAGTT GCTGATCGCC GCGGCCGGGG CGACCGAGGC GCAGGCCAAG
GCCGTGCTGG CGTTGGCGGA GATCTCGGCG CCGGACGCGT CGTTCGCGGA CGCGGTGAAT
GCGCTCGGGG TGAGTCACCC GCTGCTCGAC GAGGGCGTCG CGGAGCTGGT CCAGGTGATG
CAGACGGCCG CCGAGCACGC TCCCGGCCTC TGCGTGGCCG ACCTGCGTAT CGCCCGGGGC
CTGGACTACT ACACCGGGAC CGTCTACGAG ACGCAGCTGG TCGGGTACGA GCGGTTCGGC
TCGATCTGTT CCGGTGGTCG GTACGACAAC CTGGCCAGCG CCGGTGCCGT CGCGTTCCCG
GGAGTGGGGA TCTCGATCGG CGTGACCCGG CTACTCGGCC TGCTCTTCGG CGCCGGGGCG
TTGACCATCT CCCGTCAGGT CCCCACCTGT GTGGTGGTGG CGGTGGCCAG CGAGGCGCAG
CGGGCGGTGA GTAATCGGGT TGCCGAGACG CTGCGCGCCC GGGGGATCTC CACCGAGGTC
GCGCCGAGTG CGGCGAAGTT CGGCAAGCAG ATTCGGTACG CCGAGCGGCG TGGCATCCCG
TACGTGTGGT TCCCGGGTGC GGACGAGGAC GAGGTCAAGG ACATTCGTAC CGGTGCGCAG
GTGCCGGCGC GGGCCGAGGT GTGGGAGCCG CCGGCGGCAG ACCTGCGGCC GATGGTGAGC
TGA
 
Protein sequence
MSKPTPISGF PEWTPDQRMI EQYVLDRIRS TFERYGFAPL ETRAVEPLDQ LLRKGETSKE 
VYLLRRLQAD VDGPAGDDAL GLHFDLTVPF ARYVLENAGK LQFPFRRYQI QKVWRGERPQ
EGRYREFLQA DIDIVDRDTL APHHEAEMPL VIGDALRSLP IPPVRIQVNN RKICEGFYRG
LGLTDPEAAL RAVDKLDKIG PVRVAELLIA AAGATEAQAK AVLALAEISA PDASFADAVN
ALGVSHPLLD EGVAELVQVM QTAAEHAPGL CVADLRIARG LDYYTGTVYE TQLVGYERFG
SICSGGRYDN LASAGAVAFP GVGISIGVTR LLGLLFGAGA LTISRQVPTC VVVAVASEAQ
RAVSNRVAET LRARGISTEV APSAAKFGKQ IRYAERRGIP YVWFPGADED EVKDIRTGAQ
VPARAEVWEP PAADLRPMVS