Gene BCG9842_B1607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1607 
SymbolhtrA 
ID7184924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp3543014 
End bp3544255 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content37% 
IMG OID643551434 
Productserine protease HtrA 
Protein accessionYP_002447104 
Protein GI218898693 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000329826 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000000000234039 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGATATT ACGACGGACC AAATTTAAAT GAAGAGCATA GTGAAACGAG AGAAGTGAGA 
AAATCAGGTA GTAAAAAAGG CTATTTCTTC ACAGGTTTAG TGGGAGCTGT AGTCGGAGCG
GTTTCGATTA GTTTTGCAGC ACCATATATG CCATGGGCTC AAAATAATGG AGCACCAGTA
TCATCATTTA GTTCGGATTC AAAAGTAGAA GGTACTGTAG TTCCTGTTGT AAATAAAGCG
AAAAATGAAA CTGATTTACC TGGTATGATT GAAGGAGCGA AAGATGTTGT TGTAGGCGTT
ATTAATATGC AACAAAGCGT TGATCCATTT GCAATGCAAC CGACAGGTCA AGAACAACAA
GCTGGTTCAG GATCAGGTGT TATTTATAAA AAAGCAGGAA ATAAAGCATA TATTGTAACA
AACAACCACG TAGTAGATGG AGCGAATAAA CTTGCTGTAA AGCTAAGTGA TGGTAAAAAG
GTAGATGCAA AGTTAGTAGG GAAAGATCCT TGGTTAGACT TAGCTGTTGT TGAAATTGAT
GGGGCTAATG TAAATAAAGT TGCAACTTTA GGTGACTCAA GTAAACTTCG TGCGGGTGAA
AAAGCGATTG CAATCGGTAA CCCACTTGGA TTTGACGGAA GTGTAACGGA AGGTATAATC
AGTAGTAAAG AACGCGAAAT CCCAGTTGAT ATTGATGGGG ATAAACGCCC AGATTGGCAA
GCACAAGTTA TTCAAACAGA TGCAGCGATT AATCCTGGTA ACAGTGGTGG TGCATTATTT
AACCAAAACG GTGAAATAAT TGGGATTAAT TCAAGTAAAA TTGCACAACA AGAAGTTGAA
GGAATTGGAT TTGCTATTCC AATTAATATC GCAAAGCCAG TTATTGAATC ACTTGAAAAA
GACGGAGTAG TAAAACGTCC AGCTCTTGGA GTAGGTGTCG TTTCGTTAGA AGATGTGCAA
GCTTATGCAG TCAATCAATT GAAAGTACCG AAAGAAGTAA CTAATGGTGT TGTATTAGGT
AAAATTTACC CAATATCACC GGCAGAAAAA GCTGGTTTAG AGCAATATGA TATTGTCGTA
GCATTAGATG ATCAAAAAGT AGAAAATTCA CTTCAATTCC GTAAATATTT ATATGAAAAG
AAAAAAGTAG GCGAGAAAGT AGAAGTCACA TTCTACCGTA ACGGTCAAAA AATGACGAAA
ACAGCTACTT TAGCAGATAA TTCAGCTACA AAGAATCAAT AA
 
Protein sequence
MGYYDGPNLN EEHSETREVR KSGSKKGYFF TGLVGAVVGA VSISFAAPYM PWAQNNGAPV 
SSFSSDSKVE GTVVPVVNKA KNETDLPGMI EGAKDVVVGV INMQQSVDPF AMQPTGQEQQ
AGSGSGVIYK KAGNKAYIVT NNHVVDGANK LAVKLSDGKK VDAKLVGKDP WLDLAVVEID
GANVNKVATL GDSSKLRAGE KAIAIGNPLG FDGSVTEGII SSKEREIPVD IDGDKRPDWQ
AQVIQTDAAI NPGNSGGALF NQNGEIIGIN SSKIAQQEVE GIGFAIPINI AKPVIESLEK
DGVVKRPALG VGVVSLEDVQ AYAVNQLKVP KEVTNGVVLG KIYPISPAEK AGLEQYDIVV
ALDDQKVENS LQFRKYLYEK KKVGEKVEVT FYRNGQKMTK TATLADNSAT KNQ