Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1086 |
Symbol | |
ID | 6131616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 1210776 |
End bp | 1212281 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641641376 |
Product | protease Do |
Protein accession | YP_001768048 |
Protein GI | 170739393 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0760064 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCTG CCGAGTCCGC GCCCGCGCGC CGATCCGAAG CCTTCGCGAA ACGGCGGCTG CCGGCGCTCG CGGGCGCCCT CCTGGCGCTC TCGGTCGGGA CCGCCGCGCT CCCCTCCGCG GCCCTCGCCA GGGGCCCCGA ATCCCTCGCC GACCTCACCG AGCAGGTCAC CGACGCGGTG GTGAACATCT CGGCCTCGAC GACGGTGGAG ACCCGCGGCC GCACCCTGCC GCAGCTGCCC CCCGGCACCC CCTTCGAGGA CCTGTTCGAG GATTTCTTCA ACCGGCGGGG CGGCGGCGAT CAGCCGCGCC AGCCGCGCAA GTCGAACTCG CTCGGCTCCG GCTTCATCAT CGACGCCTCC GGCATCGTGG TGACGAACAA CCACGTGATC GGCGACGCCA ACGACATCCA GGTCATCCTG CACGACGGCC GCAAGCTGAA GGCGGAGATC GTCGGCAAGG ATTCCAAGAC CGACATCGCG GTGCTGCGGG TCAAGCCGGA GGCGGACCGG CCGCTCAAGG CGGTGCCGCT CGGCGATTCC GAGAAGATGC GGCCGGGCGA CTGGGTGATC GCGATCGGCA ACCCGTTCGG CCTCGGCGGC TCGGTCTCGG CCGGCATCGT CTCGGCGCGC GGCCGCAACA TCGATTCGGG GCCCTACGAC AACTACATCC AGACCGACGC GGCCATCAAC AAGGGCAATT CGGGCGGTCC GCTGTTCAAC ATGAGCGGCG AGGTGATCGG CATCAACACG GCGATCCTGT CGCCGACCGG CGGCTCGGTC GGCATCGGCT TCGCGGTCCC GACCGCGACG GCGGCCCCGG TGATCGAGCA GTTGCGCCAG TACGGCGAGA CCCGTCGCGG CTGGCTCGGC GTGCGGATCC AGAACGTCGA CGACACCACC GCCGAGGCGC TCGGCCTCAA GGGCGGCGCC CGCGGCGCGC TGATCGCCGG CATCGACGAG AAGGGCCCGG CCAAGACCGC CGGCTTCGAG GTCGGCGACG TGATCGTGAA GTTCAACGGC GTCGAGGTGA AGTCGTCGAG CGACCTGCCC CGCATCGTGG CGACGACGCC GGTCGGCAAG ACCGTGGACG TGCTCACGAT CCGCAAGGGC GCGGAGCAGA CGCGGCCGGT CACCCTCGGG CGGCTGGAGG ACAACGACAA GCCCCAGCCC GCCGCCCTCA ACCGGCCCCA GCCCGAGGCC GACGTGACGC GCCAGGCCCT CGGCCTCAAC CTGACCGGCC TCTCCGAGGA GGCGCGGCGG CGCTTCAACA TCAAGGACGG GCTGAAGGGG GTGGTAGTCA CCCGCGTCGA CCCGAACTCG AACGCGGCCG ACAAGCGCAT CCAGGCCGGC GACCTCATCG TCGAGGTCGG CCAGGAGCCG GTGAACTCAC CCTCGGACGT CACCCGCCGC CTGGATCAGA TCAAGAAGGA GGGCCGCAAA TCCGCCCTGC TGCTGGTCTC GAACGCCCAG GGCGAGGTGC GGTTCGTGGC GCTGAGCCTC GAATAG
|
Protein sequence | MPAAESAPAR RSEAFAKRRL PALAGALLAL SVGTAALPSA ALARGPESLA DLTEQVTDAV VNISASTTVE TRGRTLPQLP PGTPFEDLFE DFFNRRGGGD QPRQPRKSNS LGSGFIIDAS GIVVTNNHVI GDANDIQVIL HDGRKLKAEI VGKDSKTDIA VLRVKPEADR PLKAVPLGDS EKMRPGDWVI AIGNPFGLGG SVSAGIVSAR GRNIDSGPYD NYIQTDAAIN KGNSGGPLFN MSGEVIGINT AILSPTGGSV GIGFAVPTAT AAPVIEQLRQ YGETRRGWLG VRIQNVDDTT AEALGLKGGA RGALIAGIDE KGPAKTAGFE VGDVIVKFNG VEVKSSSDLP RIVATTPVGK TVDVLTIRKG AEQTRPVTLG RLEDNDKPQP AALNRPQPEA DVTRQALGLN LTGLSEEARR RFNIKDGLKG VVVTRVDPNS NAADKRIQAG DLIVEVGQEP VNSPSDVTRR LDQIKKEGRK SALLLVSNAQ GEVRFVALSL E
|
| |