Gene BAS4298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4298 
SymbolhisS 
ID2850534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4209867 
End bp4211138 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content40% 
IMG OID637507534 
Producthistidyl-tRNA synthetase 
Protein accessionYP_030546 
Protein GI49187294 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.408041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTATTC AAATCCCACG CGGAACGCAA GATATTCTTC CAGGCACTGT TGAGTTATGG 
CAGTATATCG AAGGGCAAGC ACGCGAAATT TGCCGTCGTT ACAATTATAA AGAAATTCGT
ACACCGATCT TTGAACACAC TGAGTTATTT TTACGTGGTG TTGGTGATAC GACAGATATC
GTGCAAAAAG AAATGTACTC ATTCCAAGAT CGTGGAGAGC GTAGTTTAAC ATTACGTCCA
GAAGGCACTG CACCTGTTGT ACGTTCTTAC GTTGAAAACA AAATGTTCGG TGACGCAACA
CAACCAACGA AATTATATTA TATCGGTCAA ATGTTCCGTT ATGAAAGACC ACAAGCAGGT
CGCTATCGTC AATTCGTACA ATTCGGTATT GAAGCAATCG GTAGTAACGA TCCTGCAATT
GATGCGGAAG TAATTGCACT TGCTGTAGAG TTTTACCGCG GCATGGGCTT AAAAAATATT
AAAGTTGTAT TAAACAGCTT AGGTGATGCG GCGAGCCGTC AAGCGCACCG TGATGCGTTA
ATTGCACACT TTGAGCCACG TATCGGTGAG TTCTGTTCTG ACTGTCAATC TCGTTTAGAA
AAGAACCCTC TTCGTATTTT AGATTGTAAG AAGGACCGTA ACCATGAATT AATGGGAACA
GCACCATCTA TTACAGAATA CTTAAACGAA GATTCAGCAG TATACTACGA CAAAGTTCAA
GAACTATTAA CGATGATGGA TGTTCCATTT GAAAAAGATC CGAACTTAGT ACGTGGTTTA
GACTACTACC AGCACACTGT TTTTGAAATT ATGAGTGAAG CAGAAGGTTT CGGTGCGATC
ACTACACTAA GCGGTGGTGG CCGTTATAAC GGACTTGTAC AAGAAATCGG TGGACCAGAA
ATGCCAGGTA TCGGTTTTGC GATGAGTATT GAACGTTTAA TTATGGCGCT AAAAGCTGAA
AACATCGAAT TACCAATTGA ACATAGTATC GATTGTTACG TTGTAGCGCT TGGTGAAAAA
GCGAAAGACC ATGCTGCAAA AGTTGCGTTT GATCTTCGTA AAGCTGGATT AGCAGTTGAA
AAAGATTATT TAGATCGCAA AATGAAAGCA CAATTTAAAT CAGCAGATCG TCTAAAAGCG
AAATTCGTAG CTGTACTAGG GGAAGATGAG TTAGATAAAG GCATCATTAA CTTAAAAGAT
ATGGCAACAG GCGAACAAGA AGAAGTAGCA TTAGATGTGT TTGCTTCATA CGTAGCAGAG
AAATTAATAT AG
 
Protein sequence
MSIQIPRGTQ DILPGTVELW QYIEGQAREI CRRYNYKEIR TPIFEHTELF LRGVGDTTDI 
VQKEMYSFQD RGERSLTLRP EGTAPVVRSY VENKMFGDAT QPTKLYYIGQ MFRYERPQAG
RYRQFVQFGI EAIGSNDPAI DAEVIALAVE FYRGMGLKNI KVVLNSLGDA ASRQAHRDAL
IAHFEPRIGE FCSDCQSRLE KNPLRILDCK KDRNHELMGT APSITEYLNE DSAVYYDKVQ
ELLTMMDVPF EKDPNLVRGL DYYQHTVFEI MSEAEGFGAI TTLSGGGRYN GLVQEIGGPE
MPGIGFAMSI ERLIMALKAE NIELPIEHSI DCYVVALGEK AKDHAAKVAF DLRKAGLAVE
KDYLDRKMKA QFKSADRLKA KFVAVLGEDE LDKGIINLKD MATGEQEEVA LDVFASYVAE
KLI