Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BT9727_3357 |
Symbol | htrA |
ID | 2857007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus thuringiensis serovar konkukian str. 97-27 |
Kingdom | Bacteria |
Replicon accession | NC_005957 |
Strand | - |
Start bp | 3437563 |
End bp | 3438804 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637514777 |
Product | serine protease |
Protein accession | YP_037679 |
Protein GI | 49478290 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.000249985 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATATT ACGACGGACC AAATTTAAAT GAAGAGCATA GTGAAACGAG AGAAGTGAGA AAATCGGGCA GTAAAAAAGG CTATTTTTTC ACAGGTTTAG TCGGAGCTGT AGTTGGGGCT GTTTCAATTA GTTTTGCGGC ACCATATATG CCATGGGCTC AAAATAATGG AGCGGCTGTA TCATCTTTTA GTTCAGATTC AAAAGTTGAA GGTACTGTAG TTCCTGTTGT CAATAAAGCA AAAAATGAAA CGGATTTACC TGGTATGATT GAAGGCGCGA AAGATGTTGT TGTTGGTGTT ATTAATATGC AACAAAGCAT TGATCCATTT GCAATGCAAC CGACAGGCCA AGAACAACAA GCTGGTTCAG GATCAGGTGT TATTTATAAA AAAGCAGGAA ATAAAGCATA TATTGTAACG AACAATCATG TAGTAGATGG TGCAAATAAA CTTGCTGTAA AACTGAGCGA TGGCAAGAAG GTAGATGCAA AGCTTGTAGG GAAAGACCCT TGGTTAGATT TAGCTGTTGT TGAAATTGAT GGTGCTAATG TTAATAAAGT TGCCACTTTA GGTGATTCAA GTAAACTTCG TGCGGGTGAA AAAGCAATTG CAATCGGTAA CCCATTAGGA TTTGACGGAA GTGTAACGGA AGGTATTATT AGTAGTAAAG AACGTGAAAT TCCAGTAGAT ATCGATGGCG ATAAGCGTGC AGATTGGAAT GCTCAAGTTA TTCAAACAGA TGCAGCAATT AACCCTGGGA ACAGTGGTGG TGCGTTATTT AACCAAAACG GAGAAATAAT TGGGATTAAT TCAAGTAAAA TTGCACAACA AGAAGTTGAA GGAATTGGAT TTGCTATTCC AATTAATATC GCAAAACCAG TTATTGAATC ACTTGAAAAA GACGGAGTAG TGAAACGTCC AGCTCTTGGA GTAGGTGTCG TTTCATTAGA AGATGTGCAA GCTTATGCAG TAAATCAATT GAAAGTGCCA AAAGAAGTAA CAAACGGCGT TGTATTAGGT AAAATTTACC CAATATCACC TGCAGAAAAA GCTGGTTTAG AGCAATATGA TATTGTAGTA GCATTAGATA ATCAAAAAGT AGAAAACTCA CTTCAATTCC GTAAATATTT ATATGAGAAG AAAAAAGTAG GCGAGAAAGT GGAAGTTACA TTCTACCGTA ACGGTCAAAA AATGACGAAA ACAGCTACTT TAGCAGATAA CTCAGCTACA AAGAATCAAT AA
|
Protein sequence | MGYYDGPNLN EEHSETREVR KSGSKKGYFF TGLVGAVVGA VSISFAAPYM PWAQNNGAAV SSFSSDSKVE GTVVPVVNKA KNETDLPGMI EGAKDVVVGV INMQQSIDPF AMQPTGQEQQ AGSGSGVIYK KAGNKAYIVT NNHVVDGANK LAVKLSDGKK VDAKLVGKDP WLDLAVVEID GANVNKVATL GDSSKLRAGE KAIAIGNPLG FDGSVTEGII SSKEREIPVD IDGDKRADWN AQVIQTDAAI NPGNSGGALF NQNGEIIGIN SSKIAQQEVE GIGFAIPINI AKPVIESLEK DGVVKRPALG VGVVSLEDVQ AYAVNQLKVP KEVTNGVVLG KIYPISPAEK AGLEQYDIVV ALDNQKVENS LQFRKYLYEK KKVGEKVEVT FYRNGQKMTK TATLADNSAT KNQ
|
| |