Gene Acid345_1205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1205 
SymbolhisS 
ID4068546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1488985 
End bp1490310 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content62% 
IMG OID637983215 
Producthistidyl-tRNA synthetase 
Protein accessionYP_590282 
Protein GI94968234 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.105107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGG CAGTTCGAGG AACTCGCGAC CTTCTTCCCC CCGACACGGA GCTATGGAAC 
CGCGTCGAGG CGACGGTGCG CGACGTGTTC CAGCGGTATA ACTTCCACGA GATTCGCACG
CCGATCTTCG AAGACACCGC GCTGTTTGCG CGCGGCGTGG GCGAAGAGAC GGACATCGTT
TCGAAAGAGA TGTTCACGTG GGAGGACAAG GCCCGCGCCC AGAGTGAGAA GTCGCAACAA
TTGACGCTGC GTCCGGAGAA CACGGCGGGC GTGGTGCGCG CGTACATCGA GCACCAGATG
GCGAAGTCGG GCGGGCTGCA GAAGCTGTAT TACATTGGGC CGCAGTTCCG GCGCGAGCGT
CCGCAGAAAG GGCGCTACCG GCAGTTCTTC CAGATCGGGG CGGAAGTGAT CGGGCCGCCG
CCGTCAGGGA GCGAATCTCC GGCGCGCGAT GCCGAAGTGA TCGAGATGCT GGCGACGCTG
CTGGAAGAAG TGGGGCTGAC GGGATGGACG CTGTATCTGA ATTCCGTGGG CGATGCGAAT
TGCCGTCCGG CGTACAACGA GGCGTTGCGG CAAGCGCTCG CGAGCGTGAA AGACAAGATG
TGCGGCGACT GCCAACGGCG CGCGGAGACC AATCCGCTGC GCGTGCTCGA CTGCAAGGTT
CCGGAAGACC AGCCGATTAT CGAGACACTG CCGAAGATCG GCGACTATCT CGACGATGCG
TGCAAGGCGC ACTTCGCGGC GGTGCGTGCG ATGCTCGACA AAGTTGGCGT GCCGTACACG
GTCAATCCGC GGATGGTGCG TGGACTCGAC TACTACACGC GCACAACGTT TGAATTCACC
CACGGCGAAC TCGGCGCGCA GAGTGCGGTG CTAGGCGGTG GTCGCTACGA CGGATTGAGC
GAAGCGCTCG ACGGGCCTAA AGCTCCGGGC ATCGGCTTCG CAATCGGTGA AGACCGGTTG
GTCATGGCGC TGCAGGCGGC GAAACCGGCG GAGCCGATGA CGATCGACGC CTATGTTGCA
CCGCTCGGAG CGGGCACGAA CGCGGAGGCG CTGGGCATTT GCCGCAAGTT GCGCCGCCAA
GGACTAAGAG TGGAGCTTGG CGATGAAAGC TTCCGGTTGA AGAAGAGCTT TGAGGCTGCT
GAACGTGCGG GCGCGAAGTA CATCGTCATC TGCGGCGAGA ACGAAGTGGC GAACAACGAG
TTCTCGGTGA AGGAACTCGC GACCGGCAAG CAAGAGAGCG TGGCACGCGC GGAACTGGCG
TTCTACATCC GCAACCACGG CGCCAGGAAG ACTGGCGGCG AAAACATTGA ACCCGACGAG
CAATAG
 
Protein sequence
MIKAVRGTRD LLPPDTELWN RVEATVRDVF QRYNFHEIRT PIFEDTALFA RGVGEETDIV 
SKEMFTWEDK ARAQSEKSQQ LTLRPENTAG VVRAYIEHQM AKSGGLQKLY YIGPQFRRER
PQKGRYRQFF QIGAEVIGPP PSGSESPARD AEVIEMLATL LEEVGLTGWT LYLNSVGDAN
CRPAYNEALR QALASVKDKM CGDCQRRAET NPLRVLDCKV PEDQPIIETL PKIGDYLDDA
CKAHFAAVRA MLDKVGVPYT VNPRMVRGLD YYTRTTFEFT HGELGAQSAV LGGGRYDGLS
EALDGPKAPG IGFAIGEDRL VMALQAAKPA EPMTIDAYVA PLGAGTNAEA LGICRKLRRQ
GLRVELGDES FRLKKSFEAA ERAGAKYIVI CGENEVANNE FSVKELATGK QESVARAELA
FYIRNHGARK TGGENIEPDE Q