Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B1607 |
Symbol | htrA |
ID | 7184924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 3543014 |
End bp | 3544255 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643551434 |
Product | serine protease HtrA |
Protein accession | YP_002447104 |
Protein GI | 218898693 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000329826 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.000000000234039 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGATATT ACGACGGACC AAATTTAAAT GAAGAGCATA GTGAAACGAG AGAAGTGAGA AAATCAGGTA GTAAAAAAGG CTATTTCTTC ACAGGTTTAG TGGGAGCTGT AGTCGGAGCG GTTTCGATTA GTTTTGCAGC ACCATATATG CCATGGGCTC AAAATAATGG AGCACCAGTA TCATCATTTA GTTCGGATTC AAAAGTAGAA GGTACTGTAG TTCCTGTTGT AAATAAAGCG AAAAATGAAA CTGATTTACC TGGTATGATT GAAGGAGCGA AAGATGTTGT TGTAGGCGTT ATTAATATGC AACAAAGCGT TGATCCATTT GCAATGCAAC CGACAGGTCA AGAACAACAA GCTGGTTCAG GATCAGGTGT TATTTATAAA AAAGCAGGAA ATAAAGCATA TATTGTAACA AACAACCACG TAGTAGATGG AGCGAATAAA CTTGCTGTAA AGCTAAGTGA TGGTAAAAAG GTAGATGCAA AGTTAGTAGG GAAAGATCCT TGGTTAGACT TAGCTGTTGT TGAAATTGAT GGGGCTAATG TAAATAAAGT TGCAACTTTA GGTGACTCAA GTAAACTTCG TGCGGGTGAA AAAGCGATTG CAATCGGTAA CCCACTTGGA TTTGACGGAA GTGTAACGGA AGGTATAATC AGTAGTAAAG AACGCGAAAT CCCAGTTGAT ATTGATGGGG ATAAACGCCC AGATTGGCAA GCACAAGTTA TTCAAACAGA TGCAGCGATT AATCCTGGTA ACAGTGGTGG TGCATTATTT AACCAAAACG GTGAAATAAT TGGGATTAAT TCAAGTAAAA TTGCACAACA AGAAGTTGAA GGAATTGGAT TTGCTATTCC AATTAATATC GCAAAGCCAG TTATTGAATC ACTTGAAAAA GACGGAGTAG TAAAACGTCC AGCTCTTGGA GTAGGTGTCG TTTCGTTAGA AGATGTGCAA GCTTATGCAG TCAATCAATT GAAAGTACCG AAAGAAGTAA CTAATGGTGT TGTATTAGGT AAAATTTACC CAATATCACC GGCAGAAAAA GCTGGTTTAG AGCAATATGA TATTGTCGTA GCATTAGATG ATCAAAAAGT AGAAAATTCA CTTCAATTCC GTAAATATTT ATATGAAAAG AAAAAAGTAG GCGAGAAAGT AGAAGTCACA TTCTACCGTA ACGGTCAAAA AATGACGAAA ACAGCTACTT TAGCAGATAA TTCAGCTACA AAGAATCAAT AA
|
Protein sequence | MGYYDGPNLN EEHSETREVR KSGSKKGYFF TGLVGAVVGA VSISFAAPYM PWAQNNGAPV SSFSSDSKVE GTVVPVVNKA KNETDLPGMI EGAKDVVVGV INMQQSVDPF AMQPTGQEQQ AGSGSGVIYK KAGNKAYIVT NNHVVDGANK LAVKLSDGKK VDAKLVGKDP WLDLAVVEID GANVNKVATL GDSSKLRAGE KAIAIGNPLG FDGSVTEGII SSKEREIPVD IDGDKRPDWQ AQVIQTDAAI NPGNSGGALF NQNGEIIGIN SSKIAQQEVE GIGFAIPINI AKPVIESLEK DGVVKRPALG VGVVSLEDVQ AYAVNQLKVP KEVTNGVVLG KIYPISPAEK AGLEQYDIVV ALDDQKVENS LQFRKYLYEK KKVGEKVEVT FYRNGQKMTK TATLADNSAT KNQ
|
| |