Gene Saro_0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0289 
SymbolhisS 
ID3916226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp310668 
End bp311927 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content68% 
IMG OID640443018 
Producthistidyl-tRNA synthetase 
Protein accessionYP_495571 
Protein GI87198314 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGC AAACGCCCCA GCCCATCCGC GGCACCCAGG ACATCTTCGG CCCCGACGCG 
GAAGCCTTCG CCTTCGTCGT CGAGACCTTC GAGCGCGTGC GCAAGCTCTA CCGCTTCCGC
CGCGTCGAGA TGCCGGTGTT CGAAAAGACC GCCGTGTTCA GCCGCTCGCT GGGCGAGACG
ACCGACGTGG TCTCCAAGGA AATGTACTCG TTCGAGGATC GCGGCGGGGA ATCGCTGACG
CTGCGCCCCG AATTCACCGC CGGCATCGCC CGCGCCTACC TCACCGACGG CTGGCAGCAG
TACGCGCCGC TCAAGGTCGC GACGCATGGG CCGCTGTTCC GCTACGAACG CCCGCAGAAG
GGCCGCTACC GCCAGTTCCA CCAGATCGAC GCCGAGATCA TCGGCGCGGG CGAACCGCAG
GCGGACGTCG AACTCCTCGT CATGGCCGAC CAGCTCCTCA AGGAACTGGG CATTGGCGCC
AATGATCCCG GCGCGGTCAC GCTCCAGCTC AACACCCTTG GCGACGGCGC CAGCCGCGAG
GCCTGGCGGG CCGCGCTGGT CGAATACTTC CGCGCCCACA AGGCCGAGCT TTCCGAGGAT
TCGCAGGACC GGCTCGAGCG CAACCCCTTG CGCATCCTCG ACAGCAAGGA CCCGCGCGAC
AAGCCCTTCA CCGCCGACGC GCCCAGGATC GACGATTTCC TGAGCGCCGA GGCACAGGAC
TTCTTCGGCA AGGTCACCTC GGGCCTCGAC GCTGCCGGCG TCGAATGGAC TCGCGCCCCC
GCCCTGGTGC GTGGCCTCGA CTACTACCGC CACACCGCCT TCGAATTCGT CACGGACCGC
CTCGGCGCGC AAGGCACCGT GCTCGGCGGC GGCCGCTATG ACGGCCTGAT GGAAGCGCTG
GGCGGCGCGG CAACCCCTGC GGTCGGCTGG GCGGCGGGGA TCGAGCGGCT GGCGATGCTG
GTTGGCGAAA AGGGCGAAGC CAGGACCGAT GTAATCGTGG TGGTCGAGGA CGACGCTCTT
CTCACCAGCG GCATCGAACA GGTCTCCCGC TTGCGCCGCG AGGGCGTTTC AGCGGAACTC
GTCGCTTCCG GTTCGGCGCG CAAGCGGTTC GACAAGGCGG TCAAGATGGG AGCGAAGGCC
ATCCTTGCGC TGGCCATGCG CGATGGCCAG CCGGCTGCAC GCTTTCGTGT CGAGGACGAC
GCGGCAACTG CGCTTCGCGG ACAGCTCGAA GCAGCAGTTG CAAAGTATGG CGCTCAGTGA
 
Protein sequence
MSTQTPQPIR GTQDIFGPDA EAFAFVVETF ERVRKLYRFR RVEMPVFEKT AVFSRSLGET 
TDVVSKEMYS FEDRGGESLT LRPEFTAGIA RAYLTDGWQQ YAPLKVATHG PLFRYERPQK
GRYRQFHQID AEIIGAGEPQ ADVELLVMAD QLLKELGIGA NDPGAVTLQL NTLGDGASRE
AWRAALVEYF RAHKAELSED SQDRLERNPL RILDSKDPRD KPFTADAPRI DDFLSAEAQD
FFGKVTSGLD AAGVEWTRAP ALVRGLDYYR HTAFEFVTDR LGAQGTVLGG GRYDGLMEAL
GGAATPAVGW AAGIERLAML VGEKGEARTD VIVVVEDDAL LTSGIEQVSR LRREGVSAEL
VASGSARKRF DKAVKMGAKA ILALAMRDGQ PAARFRVEDD AATALRGQLE AAVAKYGAQ