Gene Clim_0295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0295 
SymbolhisS 
ID6353812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp331922 
End bp333214 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content53% 
IMG OID642667924 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001942368 
Protein GI189345839 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000190773 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCACT ATCAGGTAGT CAAAGGCGCG AGAGATATTT TCCCCGATGA AATATTGCAA 
TGGAAACATG TCGAAGGGGT TATTCACAGG CTTGCCGCAC TATACGGTTT CAATGAAATC
AGAACCCCGG TTTTTGAATA CACCGAGCTT TTTCAGCGCA GTATCGGTTC CTCCACCGAT
ATTGTCGGTA AGGAGATGTT CTCTTTTCTG CCTGACCCCT CCGGGCGCTC GATCACCCTT
CGTCCTGAAA TGACCGCAGG CGTTATGAGG GCTGCATTGC AGAAAAATCT GCTTGCGGCT
GCTCCGGTGC AGAAGCTCTA CTATATCAGC GAACTTTTCC GCAAGGAACG GCCTCAGGCC
GGCCGGCAAA GGCAGTTCTC CCAGTTTGGA GCGGAACTGC TCGGGGTCTC TTCTCCGGCA
GCCGTAGCTG AAGTGATAAC GTTCATGATG CATGTTTTCG AAGATCTTGG TCTGCAGGGG
CTGAAGCTCA GGATCAATAC TCTCGGCAAT ATGGACGACC GCAAACGCTA TCGGGATGCT
CTCCGGAATT ATCTGGCGCC ATGTTATGAG CAGCTCGATG ATGCATCGAA AGAGCGCTTC
GAGAAAAATC CGCTACGGAT ACTCGACTCG AAAAATCCGG AAATGCAGCA GATCGTGAAG
GACGCTCCGA AACTGTACGA TTATCTGGGG CGCGAGGCTC TGGATGATTT TGAAAAAGTG
CTTTTTTATC TTTCAGCCCG GGGGATACCC TTTCAGATCG ATCACAGACT GGTTCGGGGT
CTCGACTATT ACAGCTACAC AGCATTCGAA GTGACCAGTT CGGCGCTTGG TGCACAGGAC
GCTCTTGGCG GCGGGGGACG CTATGACTCG CTTGCCGTCG AGCTGGGGAG TTCCGGTGAA
GTGCCTGCAT CCGGTTTTGC CGTCGGGATG GAACGACTCC TGATCGCCAT GCAGAAACAG
GGTTTGTTTT CAGATCTCGA TGCTGCGGCG CCATCTGTTT TTGTTATCGT TCAGCAGGAG
GAGCTTTTCG ATCAGGCGCT TGAGATAGTC ACCACTCTTC GCCGGGCGGG TATCAGTGCG
GTGATCGATC TTGCCGGGCG AAGCATGAAA GCCCAATTGC GGGAAGCGAA CAGGATGAAT
GCTGCAAACG CTCTTTTTGT AGGCAGCGAT GAGCTCGCAT CGGGAAAATG CACGATGAAA
GATCTCCGGT CGTCACTGCA GGATGAGTAT TTCCTTGAAG AGATAATCGA CAAGTTCCGG
AAGCCCGAAC CGCTTAACCG GTTACGTTCA TGA
 
Protein sequence
MPHYQVVKGA RDIFPDEILQ WKHVEGVIHR LAALYGFNEI RTPVFEYTEL FQRSIGSSTD 
IVGKEMFSFL PDPSGRSITL RPEMTAGVMR AALQKNLLAA APVQKLYYIS ELFRKERPQA
GRQRQFSQFG AELLGVSSPA AVAEVITFMM HVFEDLGLQG LKLRINTLGN MDDRKRYRDA
LRNYLAPCYE QLDDASKERF EKNPLRILDS KNPEMQQIVK DAPKLYDYLG REALDDFEKV
LFYLSARGIP FQIDHRLVRG LDYYSYTAFE VTSSALGAQD ALGGGGRYDS LAVELGSSGE
VPASGFAVGM ERLLIAMQKQ GLFSDLDAAA PSVFVIVQQE ELFDQALEIV TTLRRAGISA
VIDLAGRSMK AQLREANRMN AANALFVGSD ELASGKCTMK DLRSSLQDEY FLEEIIDKFR
KPEPLNRLRS