Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA2343 |
Symbol | htrA |
ID | 3105048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 2534666 |
End bp | 2536063 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637171486 |
Product | protease DO |
Protein accession | YP_114759 |
Protein GI | 53803452 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family [TIGR02038] periplasmic serine pepetdase DegS |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGACCCTC TTGCTATGCG GTTTAGAAAG ATAATCGATA GAGGTAATTC CATGCGCATG AAATCCATCG GCGCGCTGTT GCTACTGACC GCCAGCGGCC TGTCCCTCTG CCCCCCGGTG TGGGCCGATT TGCCCGCCAG CGTCAACGGT CTGCCACTGC CCAGCCTCGC GCCCGTCTTG AAGAAGGCCA TGCCCGCGGT GGTCAACATC TCGACGAAGA CCCAGATCGA AATCGCCGAG AATCCCCTGA TGCAAGACCC CTTCTTCCGA CATTTCTTCG GTATTCCGAA TCAGCCGCGG CGCCGTGAGA GCTCCAGCCT CGGCTCCGGG GTGATCGTCG ACGCCCGCCG AGGCTACATC CTCACCAACA ACCATGTGAT CGACAAGGCG GACGAGATAA GCGTGACCTT GAGAGACGGA CGCCAGTTGA GCGCAAAACT GGTCGGCGCC GATCCGGAGT CGGATTTGGC CGTCATCAAG GTCGAACCCA AGAATCTGAC GGAACTGCCC ATCGGTGATT CGAGTCAGCT CGAAGTCGGC GACTTCGTGG TAGCCATCGG CAATCCCTTT GGCCTGGGGC AGACAGTGAC CTCCGGCATC GTCAGTGCCC TGGGGCGATC CGGACTCGGC ATTGAGGGGT ACGAGGATTT CATCCAGACC GACGCCTCGA TCAACCCCGG CAACTCGGGA GGCGCCTTGA TCAATCTCCG TGGCGAGCTG GTCGGCGTGA ATACGGCCAT CATCGCTCCC ACCGGCGGAA ACGTGGGCAT CGGTTTCGCC ATCCCGTCCA ACATGGCCGC CAGCATCATG ACCCAACTGG TCGAAAAGGG CGAAATCCGC CGCGGCCAGA TCGGCATCAC CATCCAAGAC CTGACGCCGG ATCTCGCTCA GGCCTTTGGC CTGAAGCAGA GCCAGGGCGC GGTGATCACC GGCGTCCAAA AGGATTCCCC GGCCGCATCT TCGGGCCTGG AAGCCGGCGA CGTCGTCGTC AGCGTCAATG ACCGCCCGGT CAAAAACAGC GCGGACGTCC GCAACACCAT CGGCCTCCTG CCCATAGGCG AAGAAGTCCG GGTCGAAGTG ATGCACAAGG GGGAGAGAGT GGTACGCGAG GTGGTGATCC GCGCCCCCAA ACTGGTCCAG GAAGAGGGCA ATAAAATCCA TCCCAGACTG TCCGGTGTCA TACTTAAGAA CAACGAGGAG GGCGGTGTCC AGGTGGAAAA AATCCACACG AGTTCTTACG CCTTCCAGGC CGGCCTGCGC CCCGGCGACG TGATCGTGAT GGCGAACCGC GAGGAAATCG AAACGCTCGA TGACCTGAAG CGCGCCACCA AGGGCCGCTC GGAGCTGCTC CTCAGCGTCC AGCGGGGCAG CGGCTCGTTC TTCTTGATGC TGAAGTAG
|
Protein sequence | MDPLAMRFRK IIDRGNSMRM KSIGALLLLT ASGLSLCPPV WADLPASVNG LPLPSLAPVL KKAMPAVVNI STKTQIEIAE NPLMQDPFFR HFFGIPNQPR RRESSSLGSG VIVDARRGYI LTNNHVIDKA DEISVTLRDG RQLSAKLVGA DPESDLAVIK VEPKNLTELP IGDSSQLEVG DFVVAIGNPF GLGQTVTSGI VSALGRSGLG IEGYEDFIQT DASINPGNSG GALINLRGEL VGVNTAIIAP TGGNVGIGFA IPSNMAASIM TQLVEKGEIR RGQIGITIQD LTPDLAQAFG LKQSQGAVIT GVQKDSPAAS SGLEAGDVVV SVNDRPVKNS ADVRNTIGLL PIGEEVRVEV MHKGERVVRE VVIRAPKLVQ EEGNKIHPRL SGVILKNNEE GGVQVEKIHT SSYAFQAGLR PGDVIVMANR EEIETLDDLK RATKGRSELL LSVQRGSGSF FLMLK
|
| |