Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4914 |
Symbol | degP |
ID | 5156174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 5155929 |
End bp | 5157053 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640559712 |
Product | serine protease |
Protein accession | YP_001240842 |
Protein GI | 148256257 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.471766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGAT GGAGCTTTGC CGCGCGCCTG TTGGCGGTCG CGGCGACGGC GCTGCTGCTC ATGATGGCGT GGCAGACTTT CCCGCTGATC CAGGCGGAGA TCCTGGGCCT GCGCGCCAAG CCCCGCGAGA TCACGGCGCG CGGCGACCTC GCCGCCGACG AAAAGAGCAC GATTGCGCTG TTCGAAAGCC GCAGCGGCTC GGTGGTCTTC ATCACGACCG TTCAACAATC AGTCAATGCC TGGACAGGCG ATGCGCAGCA GGAGCGCTCC GGCACCGGCT CCGGCTTCGT CTGGGACGAT CTTGGCCATG TCGTCACCAA TTATCACGTC ATCGAGGGCG CGACTGAAGC ACTGGTCAGC CTGACCGATG GCCGCTCGTT CCGCGCGGCC CTGGTCGGCG CCAACCCGGA GAACGATCTC GCGGTGCTGC TGATCGGCGT CGGCACCGAC CGGCCGAAGC CGTTGCCGAT CGGGACCAGC GCCGATCTCA AAGTGGGGCA GAAAGTGTTC GCGATCGGCA ATCCGTTCGG CCTCAGCAGT ACGCTGACCA CGGGCATCGT CTCGGCGCTC AACCGCAACC TGCAGGTCAC GCAGGAGCGC ACCCTCAACG GCTTGATCCA GACCGATGCC GCGATCAATC CCGGCAATTC CGGCGGGCCG CTGCTCGACA GCGCCGGACG GCTGATCGGG GTCAATACCG CGATCTACAG CCCGTCCGGG GCGTCGGCCG GGATCGGCTT CGCCGTGCCG GTCGACAAGG TCAACCGCAT CGTGCCGCGG CTGATCGCGA GCGGCCGCTA TGTCAGCCCG AGTCTCGGGA TCCGGACCGA TCCAAAAGCC AATGAGGCGC TGTCGGCCCG CCTCAATATG TCGGGCGTGT TCGTGCTCGA TGTCGAGCCG GATTCGGCGG CGGAGAAGGC GGGGCTGATC CCGGCGCGCC TGACCCGCGA CGGCGGCTTC GCGCTTGGCG ACGTGCTGCT GGCCATCGAC GGACAGGTGG TGGATTCGCC CGACGACATG ACACGGGCGT TGGAGACCAA GACTCCCGGC GACCGCGTCG TGCTGCGGGT CAGGCGCGCC GGCAAGACGA TCGAGGTCCG GGTGACGCTC GACGTCGCGC GGTGA
|
Protein sequence | MSRWSFAARL LAVAATALLL MMAWQTFPLI QAEILGLRAK PREITARGDL AADEKSTIAL FESRSGSVVF ITTVQQSVNA WTGDAQQERS GTGSGFVWDD LGHVVTNYHV IEGATEALVS LTDGRSFRAA LVGANPENDL AVLLIGVGTD RPKPLPIGTS ADLKVGQKVF AIGNPFGLSS TLTTGIVSAL NRNLQVTQER TLNGLIQTDA AINPGNSGGP LLDSAGRLIG VNTAIYSPSG ASAGIGFAVP VDKVNRIVPR LIASGRYVSP SLGIRTDPKA NEALSARLNM SGVFVLDVEP DSAAEKAGLI PARLTRDGGF ALGDVLLAID GQVVDSPDDM TRALETKTPG DRVVLRVRRA GKTIEVRVTL DVAR
|
| |