Gene Cag_1467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1467 
SymbolhisS 
ID3746436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1932884 
End bp1934161 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content48% 
IMG OID637774001 
Producthistidyl-tRNA synthetase 
Protein accessionYP_379766 
Protein GI78189428 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.162943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAGTT TTCAGTGCGT AAAAGGTACT CGCGATATTT TGCCCGACGA GAGCCTTCTT 
TGGTCGTTTG TGTCATCCCA TTTTCATCAT GTAGCGTCGC TTTACGGATT TCGTGAAATT
CGAACGCCAA TGTTTGAATA CACCGATCTG TTCCAGCGAG GTATTGGTGC CACCACCGAT
ATTGTGGGCA AGGAGATGTT TTCATTTCAG CCCGATCCAG CAGGGCGCTC TATAACGCTT
CGTCCTGAAA TGACGGCAGG CGTTATGCGT GCTGCTCTGC AAAACAATTT GTTGGCACAA
GCTCCTCTCC ACAAGCTTTT TTATATAGGG GAACTGTTTC GCAAAGAGCG TCCACAAGCA
GGACGCCAAC GGCAGTTTAA CCAATGTGGC GCTGAGCTGC TTGGTGTTTC ATCGCCTGCG
GCAGTAGCTG AGGTGATGTC GCTGATGATG CACTTTTTTG GCGCACTTGG CTTAACGGGT
TTAACGCTCA AGGTTAATAC GCTTGGCAAT GCCGAAGAGC GACTTGCTTA TCGTGAAGCC
TTGCAAGCCT ACTTTGCACC TCATCGCGCA ATGCTTGATG CATCATCGCA AGAGCGGCTC
GAAAAAAATC CTTTGCGTAT TCTTGATTCT AAAAATCCTG CTTTACAAGA GCTGATTGCG
GCTGCTCCTC GTTTGTACGA TTATTTGCAA GAGGCGTCGT TGCGTGATTT TGAAAAGGTG
CTTTTTTATT TAACCGAGCG AAGAATTTCT TACACGATTG ATTACCGCTT AGTGCGCGGT
CTTGATTATT ACTGCCATAC TGCGTTTGAA GTTACCAGCA ATGAGCTTGG TGCACAAGAT
GCCATTGGCG GTGGTGGTCG TTACGATGCG TTAGCGCGTG AGCTTGGCAG TGCAACTGAT
ATTCCAGCCG TTGGTTTTGC TGTTGGCATG GAGCGGTTGC TTATTGTGTT GGAAAAGCAA
GGATTGCTGG GTAATCGCCA TGCGCGTCCA CCTCGCTTGT ATGTGGTGGT TCAGCAGCAA
GAGATGCTCG ATCACGCTTT GCAGCTTGTG TGGCGTTTGC GCAACGGTGG TATTCGTAGT
GAGCTTGATT TAGCGGGACG TAGCATGAAA GCGCAAATGC GTGAAGCCAA TAAGCTTGGC
GCTCTGTATG CGCTTTTTGT AGGTGCTTCG GAATGTGCAA GTGGCAAATA TGGCTTAAAA
AATCTTGCAA CATCGGAGCA AACCGATCTC TCCATAGAGG CAGTTATGCA GTTGCTGCAC
GATCATGTAA CCGAGTAA
 
Protein sequence
MSSFQCVKGT RDILPDESLL WSFVSSHFHH VASLYGFREI RTPMFEYTDL FQRGIGATTD 
IVGKEMFSFQ PDPAGRSITL RPEMTAGVMR AALQNNLLAQ APLHKLFYIG ELFRKERPQA
GRQRQFNQCG AELLGVSSPA AVAEVMSLMM HFFGALGLTG LTLKVNTLGN AEERLAYREA
LQAYFAPHRA MLDASSQERL EKNPLRILDS KNPALQELIA AAPRLYDYLQ EASLRDFEKV
LFYLTERRIS YTIDYRLVRG LDYYCHTAFE VTSNELGAQD AIGGGGRYDA LARELGSATD
IPAVGFAVGM ERLLIVLEKQ GLLGNRHARP PRLYVVVQQQ EMLDHALQLV WRLRNGGIRS
ELDLAGRSMK AQMREANKLG ALYALFVGAS ECASGKYGLK NLATSEQTDL SIEAVMQLLH
DHVTE