Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_11247 |
Symbol | |
ID | 5221924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | + |
Start bp | 1369641 |
End bp | 1371227 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640606001 |
Product | serine protease htrA |
Protein accession | YP_001287192 |
Protein GI | 148822438 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 257 |
Plasmid unclonability p-value | 0.039312 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 167 |
Fosmid unclonability p-value | 0.00292251 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGATACTA GGGTGGACAC GGACAACGCG ATGCCTGCAC GTTTTAGCGC CCAGATTCAG AATGAGGATG AGGTGACCTC CGACCAAGGC AACAACGGCG GCCCGAACGG CGGAGGCCGC CTGGCGCCGC GCCCGGTTTT TCGGCCACCG GTCGACCCGG CGTCGCGTCA AGCGTTCGGG CGTCCGTCCG GGGTCCAAGG GTCCTTTGTG GCCGAGCGTG TGCGCCCGCA GAAGTACCAG GACCAGTCTG ACTTCACACC GAACGATCAG CTTGCTGACC CGGTGCTTCA GGAGGCGTTC GGTCGTCCGT TCGCGGGCGC CGAATCGCTG CAGCGCCATC CCATCGATGC CGGAGCGCTG GCAGCTGAGA AAGACGGTGC CGGCCCCGAC GAGCCCGACG ATCCGTGGCG CGACCCCGCG GCCGCGGCCG CGCTGGGGAC GCCAGCGCTA GCCGCGCCGG CACCGCACGG TGCGCTGGCC GGCAGCGGCA AGCTGGGTGT GCGCGACGTG CTGTTTGGCG GCAAGGTGTC CTACTTGGCG CTGGGCATCT TGGTCGCTAT CGCACTGGTG ATCGGCGGCA TCGGCGGTGT CATCGGCCGC AAGACCGCGG AAGTAGTCGA TGCGTTCACC ACGTCGAAGG TGACCCTGTC GACCACTGGC AATGCCCAGG AACCGGCCGG CCGGTTCACC AAGGTGGCGG CCGCCGTGGC CGATTCGGTG GTGACCATTG AGTCGGTCAG CGACCAGGAG GGCATGCAAG GTTCCGGCGT CATCGTCGAT GGCCGCGGCT ACATCGTCAC CAACAATCAC GTGATCTCTG AGGCGGCCAA CAATCCCAGC CAGTTCAAGA CGACCGTGGT GTTCAACGAC GGCAAGGAGG TGCCCGCCAA TCTGGTGGGT CGTGACCCCA AGACCGACTT GGCCGTCCTC AAGGTCGACA ACGTCGACAA TCTGACCGTG GCCCGGCTCG GTGATTCCAG CAAGGTACGG GTCGGTGACG AAGTCCTCGC GGTCGGCGCG CCCCTGGGGC TGCGCAGTAC GGTGACCCAG GGCATTGTCA GCGCGCTACA CCGCCCCGTT CCGTTGTCGG GCGAGGGCTC TGACACCGAC ACCGTCATTG ACGCAATTCA GACCGACGCC TCGATCAACC ACGGTAACTC CGGCGGTCCG CTAATCGACA TGGATGCCCA GGTGATTGGC ATCAACACCG CCGGTAAGTC ACTGTCGGAT AGCGCCAGCG GGCTGGGCTT TGCGATCCCG GTCAACGAGA TGAAATTGGT GGCAAATTCT CTGATCAAAG ACGGAAAGAT CGTGCATCCG ACGTTGGGCA TCAGCACCCG GTCAGTAAGC AACGCGATCG CGTCGGGCGC GCAGGTGGCC AATGTAAAGG CGGGAAGTCC CGCGCAGAAG GGCGGGATCT TGGAGAACGA TGTGATCGTC AAGGTCGGTA ACCGCGCGGT CGCCGACTCC GACGAGTTCG TCGTCGCCGT GCGCCAGTTG GCTATCGGCC AGGACGCTCC GATAGAGGTG GTCCGCGAGG GTCGGCATGT GACGCTGACG GTGAAACCGG ACCCCGATAG CACCTAG
|
Protein sequence | MDTRVDTDNA MPARFSAQIQ NEDEVTSDQG NNGGPNGGGR LAPRPVFRPP VDPASRQAFG RPSGVQGSFV AERVRPQKYQ DQSDFTPNDQ LADPVLQEAF GRPFAGAESL QRHPIDAGAL AAEKDGAGPD EPDDPWRDPA AAAALGTPAL AAPAPHGALA GSGKLGVRDV LFGGKVSYLA LGILVAIALV IGGIGGVIGR KTAEVVDAFT TSKVTLSTTG NAQEPAGRFT KVAAAVADSV VTIESVSDQE GMQGSGVIVD GRGYIVTNNH VISEAANNPS QFKTTVVFND GKEVPANLVG RDPKTDLAVL KVDNVDNLTV ARLGDSSKVR VGDEVLAVGA PLGLRSTVTQ GIVSALHRPV PLSGEGSDTD TVIDAIQTDA SINHGNSGGP LIDMDAQVIG INTAGKSLSD SASGLGFAIP VNEMKLVANS LIKDGKIVHP TLGISTRSVS NAIASGAQVA NVKAGSPAQK GGILENDVIV KVGNRAVADS DEFVVAVRQL AIGQDAPIEV VREGRHVTLT VKPDPDST
|
| |