Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_5038 |
Symbol | |
ID | 5150468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 5266565 |
End bp | 5267959 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640559816 |
Product | serine protease |
Protein accession | YP_001240945 |
Protein GI | 148256360 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.431803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTTGA TTCGCTCCAC CGTGGTTCTG CTGCTGTCGC TGCTGGCCGC CACTCCGCTC GCGGCGCAGG AGCGGCGCGT ACCGCAATCC CCGGCCGAGC TGCGGCTGTC CTATGCGCCG ATCGTGCAGC GCGTGCAGCC GGCGGTCGTC AACGTCTACG CCGCCAAGGT GGTGCAGAAC CGCAATCCGT TCCTCGACGA TCCGATCTTC CGCCGCTTCT TCGGCCTGCA GGGCGGGCCG CAGGAGCAGA TGCAGCGCTC GCTCGGCTCG GGCGTGATGG TCGATGCGTC GGGCCTGGTG GTCACCAACG TGCACGTCAT CGAGGGCGCC GACGAGGTGA AGGTGTCGCT GTCGGACAAG CGCGAGTTCG AGGCCGAGAT CGTGCTGAAG GACAGCCGTA CCGATCTTGC CGTGCTCAGG CTCAAAGGCA CGCGCGAGAC CTTTCCGACG CTCGACCTCG CCAATTCCGA TGACCTCCTG GTCGGCGACG TCGTGCTCGC AATCGGCAAT CCGTTCGGCG TCGGCCAGAC CGTCACGCAT GGCATCGTCT CGGCGCTGGC GCGCACCCAG GTCGGCATCA CCGACTATCA ATTCTTCATC CAGACCGATG CCGCGATCAA TCCCGGCAAT TCCGGTGGCG CGCTGGTCGA CATGACCGGC CGTCTGGTCG GTATCAACAC GGCGATCTAT TCCAAGTCCG GCGGTTCGCA GGGAATCGGC TTCGCGATTC CCGCCAACAT GGTGCGCGTC GTCGTGGCCT CGGCCAAGAG TGGCGGCAAG GCGGTGAAGC GTCCGTGGCT CGGCGCGCGG CTGCAGGCGG TGACGCCGGA GATCGCCGAG AGCCTGGGTC TGCGCTCACC CACCGGCGCG CTGGTCGCCT CCGTCGTTCC GAACAGCCCG GCGGCGAAAG CGGGCATCAA ATCGTCGGAC CTGATCGTGT CGATCGACGG CCAGACCGTC GATGATCCGA ACGCCTTCGA CTATCGCTTC GCGACCCGTC CGCTCGGCGG CAACGCGCAG ATCGAGGTGC AGCGGAGTGG CAAGCCGGTG AAGGTCGCGG TGGCGCTGGA GACGGCGCCT GACACCGGGC GCAACGAGAT GGTCATCAAC GGCCGCTCGC CGTTCCAGGG CGCCAAGGTG GCCAACATCT CGCCCGCGGT GGCCGATGAG CTGCATCTCG ACGCCGACAC CGAAGGCGTC GTCGTGCTCG AGCCGGGCGA CGGCACCACG GCGGCCAATG TCGGCTTCCA GAAGGGCGAC GTGATCATGG CCGTCAACAA CCAGAAGATT GCCAAGACCG CCGATCTCGA CAAGGCCTCG CGCGAGTCTG CGCGCATCTG GCGTATCACC GTGCTGCGCG GTGGCCAGCA GATCAACGTC ACGCTCGGCG GATGA
|
Protein sequence | MTLIRSTVVL LLSLLAATPL AAQERRVPQS PAELRLSYAP IVQRVQPAVV NVYAAKVVQN RNPFLDDPIF RRFFGLQGGP QEQMQRSLGS GVMVDASGLV VTNVHVIEGA DEVKVSLSDK REFEAEIVLK DSRTDLAVLR LKGTRETFPT LDLANSDDLL VGDVVLAIGN PFGVGQTVTH GIVSALARTQ VGITDYQFFI QTDAAINPGN SGGALVDMTG RLVGINTAIY SKSGGSQGIG FAIPANMVRV VVASAKSGGK AVKRPWLGAR LQAVTPEIAE SLGLRSPTGA LVASVVPNSP AAKAGIKSSD LIVSIDGQTV DDPNAFDYRF ATRPLGGNAQ IEVQRSGKPV KVAVALETAP DTGRNEMVIN GRSPFQGAKV ANISPAVADE LHLDADTEGV VVLEPGDGTT AANVGFQKGD VIMAVNNQKI AKTADLDKAS RESARIWRIT VLRGGQQINV TLGG
|
| |