Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_1107 |
Symbol | |
ID | 5152322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 1179069 |
End bp | 1180040 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640556096 |
Product | putative serine protease |
Protein accession | YP_001237263 |
Protein GI | 148252678 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0917977 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCTT TGACCGAATG GAAAGTGCCG TTCGCCTTCC AGCCTCGCGC CGAAGATTAC CAATATGATC TTGATCACGC CTTGTCCTCG ATCGTGGGCC TGCACGCGAT CATTCCCGCC GACGCGTTCA GCGCGGAGAC GCTCGGCACC GAACGCGCCG GCAATGCGGT CGTGATCGAC GAAGGCCTGG TGCTCACGAT CGGTTATCTC ATCACCGAGG CCGAATCCGT CTGGCTGCAC CTCAACGATG GCCGGGTGGT CGAAGGCCAC GTGCTGGGGT TCGATTTCGC CACCGGCTTC GGTCTGGTGC AGGCGCTCGG GCAGCTTGAC CTGCCGGCAT TGCCGCTCGG CTCCTCGGAG TCTGCCAAGA TTGGCGATCA GGTGGTGCTC GGTGGCGCCG GCGGCCGCAC GCGATCCGTC GCCAGCCAGA TCATCGCCAA GCAGGAATTC GCCGGCTATT GGGAATATCT GCTGGACGAG GCGATCTTCA CCCATCCGGC GCATCCGAAT TGGGGCGGCA CTGCCCTGCT CTCGGCCAAG GGCGAGCTGA TCGGCATCGG GTCGCTGCAG CTCGAACGTG AGCGCGACAA CAAATCTGAG CACGTCAACA TGATCGTGCC GATCGATCTG CTGAAACCGA TTATCGACGA TCTCAAGCGG TTCGGCCGCG TCAACAAGCC GGCCCGGCCC TGGCTCGGCC TCTACGCCAC CGAGGTCGAC GATCGCGTCG TGGTGATCGG CGTCTCGAAC AACGGCCCCG CCGCCCGCGC CGAATTGAAG GCCGGCGACG TCATCCTCGG CATCAATGGC GACAAGGTGA CGAGCCAGAG CGAATTCTAC CGCAAGCTCT GGGCGCTCGG CGACGCCGGC GTGGACGTGC CACTCACGGT GCATCACGAG GGCGTCACCT TCGACGTCAC CGTGGCCTCG ACCGATCGCG CCAAGCTGCT CAAGGCGCCG CGGCTGCACT GA
|
Protein sequence | MPSLTEWKVP FAFQPRAEDY QYDLDHALSS IVGLHAIIPA DAFSAETLGT ERAGNAVVID EGLVLTIGYL ITEAESVWLH LNDGRVVEGH VLGFDFATGF GLVQALGQLD LPALPLGSSE SAKIGDQVVL GGAGGRTRSV ASQIIAKQEF AGYWEYLLDE AIFTHPAHPN WGGTALLSAK GELIGIGSLQ LERERDNKSE HVNMIVPIDL LKPIIDDLKR FGRVNKPARP WLGLYATEVD DRVVVIGVSN NGPAARAELK AGDVILGING DKVTSQSEFY RKLWALGDAG VDVPLTVHHE GVTFDVTVAS TDRAKLLKAP RLH
|
| |