Gene BCZK4146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK4146 
SymbolhisS 
ID3024351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp4252939 
End bp4254210 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content40% 
IMG OID637548360 
Producthistidyl-tRNA synthetase 
Protein accessionYP_085725 
Protein GI52141104 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0483726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTATTC AAATCCCACG CGGAACGCAA GATATTCTTC CAGGCACTGT TGAGTTATGG 
CAGTATATCG AAGGGCAAGC ACGCGAAATT TGCCGTCGTT ACAATTATAA AGAAATTCGT
ACACCAATTT TTGAACACAC TGAGCTATTT TTACGTGGTG TTGGTGATAC GACAGATATC
GTGCAAAAAG AAATGTACTC ATTCCAAGAT CGTGGAGAGC GTAGTTTAAC ATTACGTCCA
GAAGGCACGG CACCTGTTGT ACGTTCTTAC GTTGAAAACA AAATGTTCGG TGACGCAACA
CAACCAACGA AATTATATTA TATCGGACAA ATGTTCCGTT ACGAAAGACC ACAAGCAGGT
CGCTATCGTC AATTCGTACA ATTCGGTATT GAAGCAATCG GTAGTAACGA TCCTGCAATT
GATGCGGAAG TAATTGCACT TGCTGTAGAG TTTTACCGAG GCATGGGCTT AAAAAATATT
AAAGTTGTAT TAAACAGCTT AGGTGATGCA GCGAGCCGTC AAGCGCACCG TGATGCATTA
ATCGCACACT TTGAGCCACG TATCGGTGAG TTCTGTTCTG ACTGTCAATC TCGTTTAGAA
AAGAACCCTC TTCGTATTTT AGATTGTAAG AAGGACCGTA ACCATGAATT AATGGGAACA
GCACCATCTA TTACAGAATA CTTAAACGAA GATTCAGCAG TATACTACGA CAAAGTTCAA
GAACTATTAA CGATGATGGA TGTTCCATTT GAAAAAGATC CGAACTTAGT ACGTGGTTTA
GACTACTACC AGCACACTGT TTTTGAAATT ATGAGTGAAG CAGAAGGTTT CGGTGCGATC
ACTACACTAA GCGGTGGTGG CCGTTATAAC GGACTTGTAC AAGAAATCGG TGGACCAGAA
ATGCCAGGTA TCGGTTTTGC GATGAGTATT GAACGTTTAA TTATGGCGCT AAAAGCTGAA
AACATCGAAT TACCAATTGA ACATAGTATC GATTGCTATG TTGTAGCGCT TGGTGAAAAA
GCAAAAGACC ATGCTGCAAA AGTTGCGTTT GATCTTCGGA AAGCTGGATT AGCAGTTGAA
AAAGATTATT TAGATCGCAA AATGAAAGCA CAATTTAAAT CAGCAGATCG TCTAAAAGCG
AAATTCGTAG CTGTACTAGG GGAAGATGAG CTAGATAAAG GCATCATTAA CTTAAAAGAT
ATGGCAACAG GCGAACAAGA AGAAGTAGCA TTAGATGTGT TTGCTTCATA CGTAGCAGAG
AAATTAATAT AG
 
Protein sequence
MSIQIPRGTQ DILPGTVELW QYIEGQAREI CRRYNYKEIR TPIFEHTELF LRGVGDTTDI 
VQKEMYSFQD RGERSLTLRP EGTAPVVRSY VENKMFGDAT QPTKLYYIGQ MFRYERPQAG
RYRQFVQFGI EAIGSNDPAI DAEVIALAVE FYRGMGLKNI KVVLNSLGDA ASRQAHRDAL
IAHFEPRIGE FCSDCQSRLE KNPLRILDCK KDRNHELMGT APSITEYLNE DSAVYYDKVQ
ELLTMMDVPF EKDPNLVRGL DYYQHTVFEI MSEAEGFGAI TTLSGGGRYN GLVQEIGGPE
MPGIGFAMSI ERLIMALKAE NIELPIEHSI DCYVVALGEK AKDHAAKVAF DLRKAGLAVE
KDYLDRKMKA QFKSADRLKA KFVAVLGEDE LDKGIINLKD MATGEQEEVA LDVFASYVAE
KLI