Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1809 |
Symbol | hisS |
ID | 5707112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2083708 |
End bp | 2085030 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641271311 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_001536686 |
Protein GI | 159037433 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.796045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000022389 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAAGC CCACGCCCAT CTCCGGCTTT CCGGAGTGGA CGCCCGACCA GCGAATGATC GAGCAGTACG TCCTGGACCG GATCCGGAGC ACCTTCGAGC GGTACGGGTT CGCGCCGTTG GAGACTCGCG CGGTCGAGCC CCTCGACCAG TTGTTGCGCA AGGGAGAGAC CTCCAAGGAG GTCTACCTGC TCCGGCGGTT GCAGGCCGAC GTCGACGGAC CGGCCGGTGA CGACGCCCTC GGACTGCACT TCGACCTGAC CGTGCCGTTC GCCCGGTACG TGCTGGAGAA CGCCGGCAAG CTTCAGTTCC CGTTCCGCCG CTACCAGATC CAGAAGGTGT GGCGGGGCGA GCGACCGCAG GAGGGTCGCT ACCGCGAGTT CCTGCAGGCC GACATCGACA TCGTCGACCG GGACACGCTG GCTCCACACC ACGAGGCCGA GATGCCGCTG GTGATCGGAG ACGCGCTGCG CTCGCTGCCG ATCCCGCCGG TGCGGATCCA GGTCAACAAC CGCAAGATCT GCGAGGGCTT CTACCGGGGG CTCGGGCTCA CCGACCCGGA GGCGGCGCTG CGCGCGGTCG ACAAGCTCGA CAAGATCGGT CCGGTGCGGG TGGCCGAGTT GCTGATCGCC GCGGCCGGGG CGACCGAGGC GCAGGCCAAG GCCGTGCTGG CGTTGGCGGA GATCTCGGCG CCGGACGCGT CGTTCGCGGA CGCGGTGAAT GCGCTCGGGG TGAGTCACCC GCTGCTCGAC GAGGGCGTCG CGGAGCTGGT CCAGGTGATG CAGACGGCCG CCGAGCACGC TCCCGGCCTC TGCGTGGCCG ACCTGCGTAT CGCCCGGGGC CTGGACTACT ACACCGGGAC CGTCTACGAG ACGCAGCTGG TCGGGTACGA GCGGTTCGGC TCGATCTGTT CCGGTGGTCG GTACGACAAC CTGGCCAGCG CCGGTGCCGT CGCGTTCCCG GGAGTGGGGA TCTCGATCGG CGTGACCCGG CTACTCGGCC TGCTCTTCGG CGCCGGGGCG TTGACCATCT CCCGTCAGGT CCCCACCTGT GTGGTGGTGG CGGTGGCCAG CGAGGCGCAG CGGGCGGTGA GTAATCGGGT TGCCGAGACG CTGCGCGCCC GGGGGATCTC CACCGAGGTC GCGCCGAGTG CGGCGAAGTT CGGCAAGCAG ATTCGGTACG CCGAGCGGCG TGGCATCCCG TACGTGTGGT TCCCGGGTGC GGACGAGGAC GAGGTCAAGG ACATTCGTAC CGGTGCGCAG GTGCCGGCGC GGGCCGAGGT GTGGGAGCCG CCGGCGGCAG ACCTGCGGCC GATGGTGAGC TGA
|
Protein sequence | MSKPTPISGF PEWTPDQRMI EQYVLDRIRS TFERYGFAPL ETRAVEPLDQ LLRKGETSKE VYLLRRLQAD VDGPAGDDAL GLHFDLTVPF ARYVLENAGK LQFPFRRYQI QKVWRGERPQ EGRYREFLQA DIDIVDRDTL APHHEAEMPL VIGDALRSLP IPPVRIQVNN RKICEGFYRG LGLTDPEAAL RAVDKLDKIG PVRVAELLIA AAGATEAQAK AVLALAEISA PDASFADAVN ALGVSHPLLD EGVAELVQVM QTAAEHAPGL CVADLRIARG LDYYTGTVYE TQLVGYERFG SICSGGRYDN LASAGAVAFP GVGISIGVTR LLGLLFGAGA LTISRQVPTC VVVAVASEAQ RAVSNRVAET LRARGISTEV APSAAKFGKQ IRYAERRGIP YVWFPGADED EVKDIRTGAQ VPARAEVWEP PAADLRPMVS
|
| |