Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_2753 |
Symbol | htrA |
ID | 5149369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 2850037 |
End bp | 2851074 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640557642 |
Product | serine protease DO-like precursor |
Protein accession | YP_001238796 |
Protein GI | 148254211 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGATT TCACTTCCGA TGTCGCCGGC GACGCACTGT CGCCGGCGCA AGCGTCTGGT GCTGCCCCCG CCGATGATCG GGCGCTGCTC GACGCCTATT CCAATGCTGT CATCGACGTC ACCGAGCGCG TCGGCCCGGC CGTCGTCCGC GTCGAGACCG GCCCGAAGGT CGGCTCCCGC GGCGAGCGCG GCGGCCTCGG CTCCGGCATC GTGATCTCGC CGGATGGTCT CGTGCTGACC AACAGCCATG TGGTCGGCAG CTCCAAGACC ATCCGGTTGC GCGATGTCGA GGGTGTCGTC ACCGACGCCC AGGTGCTCGG TGTCGATCCC GACACCGACC TCGCGCTGCT GCGCGCGAAC CATGCGCGCG ATTTGCGTTA CGCCGCGCTC GGCAACTCCA AGAGCCTGCG GCGCGGCCAA CTCGTGGTCG CGATCGGCAA TCCGCTCGGC TTCGAGTCGA CGGTGACCGC CGGCGTCGTC TCGGCGCTGG GCCGCTCGAT CCGCTCGGTC TCGGGCCGGA TGATCGAGGA CGTTATCCAG ACCGATGCCG CGCTCAACCC CGGCAATTCG GGTGGGGCGC TGGTGTCGTC GGCCGCCGAG GTGATCGGCA TCAACACCGC CATCATCCAA GGCGCGCAGG GCATCTGCTT TGCGGTTGCC AGCAACACCG CGCAATTCGT GCTGTCGGAG ATCATCCGCC ACGGCTATGT CCGCCGCGCC TATGTCGGCG TCTCCGGACA GACCGCGCCG GTGCCGCGGC GCCACGCCGT GCTGGCCGGC GTCGAGAACA AGATGGGCGC GTTGTTGATG CAGATCGAGC CGGACGGGCC GGCCGCGCGT GCCGGGCTGT TGCCGGGCGA TGTCGTGATC AGGCTCGACG GTGTCGACAT CAACGGCGTC GACGACCTGA TCCGCGTGCT CGACCGCGAC CGCATCGGCC GCACCGTGGC GATGGACGTG CTGCGGCTGG GACGGCTGCG CGGCATCGAC ATTCATCCCG TCGAGCGCAA GCCGGCGTCG CGGCAGCCGG CGACGTAG
|
Protein sequence | MLDFTSDVAG DALSPAQASG AAPADDRALL DAYSNAVIDV TERVGPAVVR VETGPKVGSR GERGGLGSGI VISPDGLVLT NSHVVGSSKT IRLRDVEGVV TDAQVLGVDP DTDLALLRAN HARDLRYAAL GNSKSLRRGQ LVVAIGNPLG FESTVTAGVV SALGRSIRSV SGRMIEDVIQ TDAALNPGNS GGALVSSAAE VIGINTAIIQ GAQGICFAVA SNTAQFVLSE IIRHGYVRRA YVGVSGQTAP VPRRHAVLAG VENKMGALLM QIEPDGPAAR AGLLPGDVVI RLDGVDINGV DDLIRVLDRD RIGRTVAMDV LRLGRLRGID IHPVERKPAS RQPAT
|
| |