Gene Cpha266_0364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0364 
SymbolhisS 
ID4569342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp407766 
End bp409055 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content50% 
IMG OID639764962 
Producthistidyl-tRNA synthetase 
Protein accessionYP_910847 
Protein GI119356203 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00780967 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCAGT ATCAGGTTGT CAAGGGTGCC AGGGATATTT TTCCGGATGA AATAGTCCGC 
TGGCACTATG TCGAGGACGT TGTTCATCGT CTTGCGTCTC TTTATGGATA TAGTGAAATT
CGTACTCCTG TTTTTGAATA TACGGAGCTT TTTCAACGTA GTATCGGGAC TACAACGGAT
ATTGTGGGCA AGGAGATGTT TTCCTTTCTT CCGGATCCTC AGGGTCGATC CATTACCTTG
CGTCCAGAGA TGACTGCGGG AGTTATGCGG GCGGTTTTGC AGAAAAATCT GCTTTCGACG
GCACCGATAC ACAAGCTTTT TTATCTTTCA GAGCTTTTTC GAAAAGAGCG TCCGCAGGCC
GGACGCCAGC GACAGTTTTC ACAGTTTGGC GCCGAATTGC TCGGGGTTTC CTCGCCTGCT
GCCGTTGCCG AGGTCATAAC CTTTATGATG CAGGTGTTCG AAACTCTCGG AATACGAGGT
TTGAAGCTTC GGATCAATAC CCTTGGCGAC AGCAGCGATC GAGCAAGGTA TCGCGAAATA
CTCAGAGCGT ATCTCGCGCC TTTTTATGAC AGGCTTGATC TGGCATCCCG GGAGCGGTTT
GAAAAAAATC CTCTGAGAAT TCTCGATTCG AAAAATCCTG ATATGCAGGA GATCATTGAA
GGAGCTCCAA CGCTGCATGA TTCTCTTTCT CATGAAGCTT TGGAAGATTT TGAGAAAGTG
CGTTTCTATC TTGACAGTCG GAGTATAGCT TACGATATTG ACTATCGTCT TGTTCGCGGC
CTCGATTACT ACTGCCATAC CGCATTTGAG GTGACCAGTC CGGAACTTGG TGCACAGGAT
GCTATTGGCG GGGGAGGCAG ATATGACGGT CTTGCGAAAG AGTTGGGAAG TTCCGGAGAT
GTTCCTGCAT CAGGTTTTGC CGCAGGGATG GAAAGAGTGC TGATCACGAT GGAAAAGCAG
GGTTTATTCG CCGCCCTGCG TCCTTCTGGT CCGAAGGTCT ATGTTGTTGC CCAGCAGCAC
GCCCTGCTTG ACCATGCCTT GCAGGTGGCT TATCGTTTGA GGCGCGAGGG GATCAGCACT
GAAGTTGATC TTGCCGGAAG AAGCATGAAA GCCCAGATGA GAGATGCCAA CAGGATGCGC
GCCTGCTTTG CGCTTTTTAT CGGCGAAGAT GAGGTGGTTT CCGGCTCGTA TGCGCTGAAA
AATCTTGTTA CTGCCGACCA GACGGCACAA TCGATTGAAA CCATTATTGA AATGCTCAAT
CAATATTCGG GAGCGGAGCA GGGATCATGA
 
Protein sequence
MSQYQVVKGA RDIFPDEIVR WHYVEDVVHR LASLYGYSEI RTPVFEYTEL FQRSIGTTTD 
IVGKEMFSFL PDPQGRSITL RPEMTAGVMR AVLQKNLLST APIHKLFYLS ELFRKERPQA
GRQRQFSQFG AELLGVSSPA AVAEVITFMM QVFETLGIRG LKLRINTLGD SSDRARYREI
LRAYLAPFYD RLDLASRERF EKNPLRILDS KNPDMQEIIE GAPTLHDSLS HEALEDFEKV
RFYLDSRSIA YDIDYRLVRG LDYYCHTAFE VTSPELGAQD AIGGGGRYDG LAKELGSSGD
VPASGFAAGM ERVLITMEKQ GLFAALRPSG PKVYVVAQQH ALLDHALQVA YRLRREGIST
EVDLAGRSMK AQMRDANRMR ACFALFIGED EVVSGSYALK NLVTADQTAQ SIETIIEMLN
QYSGAEQGS