Gene Cphamn1_0397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0397 
SymbolhisS 
ID6374059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp421919 
End bp423223 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content50% 
IMG OID642682914 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001958843 
Protein GI189499373 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00425435 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.748396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAGT ACAGGGCGGT AAAGGGAACA AAAGATATCT TTCCTGATGA GATCACCTCA 
TGGAAATATA TTGAGGGTGT CATTCACAGA GTTGTCGGGC TCTATGGCTT TCAGGAAATC
CGCACACCCG TATTTGAATA TACAGATCTG TTTCAACGCA GTATCGGCTC AACAACCGAC
ATTGTGGGCA AAGAGATGTT TTCCTTCAGA CCTGAGCCTG ACGGCCGTTC GGTGACTTTA
CGTCCCGAAA TGACGGCTGG TGTCATGCGT GCGTTTCTCC AGGCGAATCT TTCTTCTGCT
TCTCCGGTTC ATAAGCTGTA TTACATAGCA GAACTGTTTC GAAAGGAACG CCCTCAGGCA
GGGCGCCAGC GTCAGTTTTC ACAATTCGGG GCTGAAATGC TGGGAGCTTC CTCCCCTGAG
GCTGTCGCTG AAGTGATAGA TATGATGATG CAGGTGTTTA CCTCTCTGGG GGTATCCGGG
CTCAGGCTGA GGATTAACAC GCTTGGTGAT CTGGATGATC GGGTTCGATA CAGAGATGCA
TTGCGAGCCT ATCTTGAACC CCATAGCGGG CTTCTTGACG CGCCGTCAAG AGAGCGTCTT
GAAAAAAACC CTCTTCGTAT TCTGGATTCA AAAAATCCCG ATATACAGTC AGTCATTGCC
GATGCTCCGA AACTGCATGA TTTTCTCAAT CCTTCTGCAA GAGCGGAGTT TGATCAGGTC
TTGCTCTATC TCGATCAGAA ATCCATAGAG TATGTTATCG ATCCTTTGCT TGTCAGGGGA
TTGGATTATT ACTGTCATAC AGCGTTTGAA GTTGTCAGCC CTGAGCTTGG AGCACAGGAT
GCAATTGGAG GGGGCGGTCG TTATGACGGT CTTGCAAGAG AACTTGGCAG TAAATCCGAT
ATTCCTGCTG TCGGTTTTGC CGTTGGTATG GAGCGGTTAT TGATTACCAT GGAAAAGCAG
GGATTGCTTC GGCATATCGT GCCGTCAGGT CCCCGGGTCT ATATTGTACT CCAGAATGAG
GAGCTGAAAA CCCATGCTCT CTCTGCCTGT GACCTGTTGC GAAGATCAGG GATACGAACT
GAAATGGATC TTTGCGGAAG GAGCATGAAG GCGCAGATGC GCGAGGCCAA CAGGCAGCAT
GCCGACTATG CTCTGTTTGT AGGGAAGAGC GAGGTGGAGT CGCAAGCCTA TGGGTTAAAA
AATCTCAGGA CATCCGAACA GGATTTTCTC TCCATCCGGG AGATGATCGC AAGGCTTGCT
TCATCAACGA AGCACGTTGA AGTCCCGGAT GGCGGCCCCG ATTGA
 
Protein sequence
MSEYRAVKGT KDIFPDEITS WKYIEGVIHR VVGLYGFQEI RTPVFEYTDL FQRSIGSTTD 
IVGKEMFSFR PEPDGRSVTL RPEMTAGVMR AFLQANLSSA SPVHKLYYIA ELFRKERPQA
GRQRQFSQFG AEMLGASSPE AVAEVIDMMM QVFTSLGVSG LRLRINTLGD LDDRVRYRDA
LRAYLEPHSG LLDAPSRERL EKNPLRILDS KNPDIQSVIA DAPKLHDFLN PSARAEFDQV
LLYLDQKSIE YVIDPLLVRG LDYYCHTAFE VVSPELGAQD AIGGGGRYDG LARELGSKSD
IPAVGFAVGM ERLLITMEKQ GLLRHIVPSG PRVYIVLQNE ELKTHALSAC DLLRRSGIRT
EMDLCGRSMK AQMREANRQH ADYALFVGKS EVESQAYGLK NLRTSEQDFL SIREMIARLA
SSTKHVEVPD GGPD