Gene BCAH820_3351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_3351 
SymbolhisS1 
ID7189902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp3198121 
End bp3199401 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content38% 
IMG OID643556762 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002452301 
Protein GI218904467 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones205 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAAA TGAGAAATGT AAAAGGAACG AAAGACTATT TACCAGAAGA ACAAGTGCTG 
CGAAATAAAA TTAAAAGAGC GTGCGAGGAT ACGTTTGAAC GTTATGGCTG TAAACCGTTA
GAAACGCCGA CATTAAATAT GTATGAGCTT ATGTCTTACA AGTACGGAGG TGGCGATGAA
ATATTAAAAG AAATATATAC GCTTCAAGAT CAAGGAAAAC GCGCACTTGC CTTACGTTAC
GATTTGACTA TTCCATTCGC AAAAGTTGTG GCAATGAATC CGAACATCCG CCTTCCTTTT
AAACGGTATG AAATTGGGAA AGTATTTCGA GATGGCCCTA TTAAACAAGG GAGATTTCGT
GAATTCATAC AATGTGACGT TGATATTGTC GGTGTTGAGT CTGTCATGGC AGAAGCTGAA
CTTATGAGCA TGGCGTTTGA ACTGTTCCGA ACGTTAAACT TAGAAGTAAC GATCCAATAT
AATAACCGAA AATTGTTAAA CGGTATTCTT CAGGCCATTA ACATCCCTAC TGAGTTAACG
AGTGACGTCA TTTTATCATT AGACAAAATC GAAAAAATTG GGATTGATGG TGTAAGAAAA
GATGTATTAG AGCGCGGAAT TACTGAAGAA ATGGCTGATA CGATATGTAA TACTGTTTTA
TCTTGTCTAA AGCTTACAAT TGCTGACTTT AAAGAAGCTT TCAATAATCC ACTCGTTGCC
GATGGAGTAA ACGAATTACA ACAATTACAG CAATATTTAA TCGCTCTTGG AATAAATGAA
AATGCTATAT TCAACCCATT TTTAGCACGA GGACTCACAA TGTATACAGG TACCGTATAT
GAAATCTTTT TAAAAGATGG CTCGATTACA TCTAGCATCG GTAGCGGTGG TCGTTACGAT
AATATTATTG GCGCATTCCG CGGTGATAAT ATGAACTATC CAACAGTCGG TATTTCATTC
GGCTTAGACG TTATTTATAC AGCACTATCA CAGAAAGAAA CGATATCATC TACAGCGGAT
GTATTTATCA TCCCGCTTGG GACAGAATTA CAATGCTTAC AACTTGCCCA GCAATTACGT
TCTACCACTT CCTTAAAAGT CGAACTTGAA CTAGCAGGAC GCAAATTAAA ACGCGCCCTT
AATTATGCAA ATAAAGAGAA TATCCCTTAT GTGCTTATTA TTGGGGAAGA AGAACTTAGT
ACAGAAACCG CTATGCTGCG GAATATGAAG GAAGGTAGTG AGGTGAAGGT TCCGCTTTCT
TCTTTAAGTA ATTATTTGTA A
 
Protein sequence
MMEMRNVKGT KDYLPEEQVL RNKIKRACED TFERYGCKPL ETPTLNMYEL MSYKYGGGDE 
ILKEIYTLQD QGKRALALRY DLTIPFAKVV AMNPNIRLPF KRYEIGKVFR DGPIKQGRFR
EFIQCDVDIV GVESVMAEAE LMSMAFELFR TLNLEVTIQY NNRKLLNGIL QAINIPTELT
SDVILSLDKI EKIGIDGVRK DVLERGITEE MADTICNTVL SCLKLTIADF KEAFNNPLVA
DGVNELQQLQ QYLIALGINE NAIFNPFLAR GLTMYTGTVY EIFLKDGSIT SSIGSGGRYD
NIIGAFRGDN MNYPTVGISF GLDVIYTALS QKETISSTAD VFIIPLGTEL QCLQLAQQLR
STTSLKVELE LAGRKLKRAL NYANKENIPY VLIIGEEELS TETAMLRNMK EGSEVKVPLS
SLSNYL