Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpop_2778 |
Symbol | |
ID | 6314143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium populi BJ001 |
Kingdom | Bacteria |
Replicon accession | NC_010725 |
Strand | - |
Start bp | 2980006 |
End bp | 2981541 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642651502 |
Product | protease Do |
Protein accession | YP_001925468 |
Protein GI | 188582023 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.725165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.242961 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTCG CCGCGAACGC CGTCCGGGGC CGAACGCCGT CGTTTGCCCG GCGCGCCTCA TCGGCCCTGG CCGCCGCGGT ACTGGGCGTC ACCGTCACGG TGACGGCCCT GCCGATGCCA GCCTTCGCCC GGGGTCCGGA ATCGCTTGCC GACCTTGCCG AGAAGGTGAC GGATGCGGTG GTGAACATCT CGGCTTCGAC CACGGTCGAG GCCAGCAATC GCGGTGGCCG CAACCTGCCG CAGCTGCCTC AGGGCACGCC CTTCGAGGAC CTGTTCGAGG AATTCTTCAA GCGGCGCGGC CAGGGCGCGC CGAAGGGCGA CGAGGACAGC CCGCGCGGTC CCACGCGCAA GTCGAACTCG CTGGGCTCCG GCTTCATCAT CGACGCCTCG GGCATCGTCG TGACGAACAA CCACGTCATC GGCGACGCCA ACGACATCCA GGTGATCCTG CACGACGGCA CCAAGCTGAA GGCGGAGATC ATCGGCAAGG ATTCGAAGAT CGACCTCGCC CTGCTCCGGG TGAAGCCCAC TGCCGATCGC CCGCTCAAGG CGGTGCCGTT CGGCGATTCC GACAAGATGC GCCCGGGCGA CTGGGTGATG GCGATCGGTA ACCCGTTCGG CCTCGGCGGC TCGGTCTCCG CCGGCATCGT CTCGGCGCGC GGTCGCAACA TCGAATCCGG CCCCTACGAC AACTACATCC AGACGGATGC GGCCATCAAC AAGGGCAATT CGGGCGGCCC GCTGTTCAAC ATGGACGGCG AGGTGATCGG CATCAACACC GCGATCCTCT CGCCCTCGGG CGGCTCCGTC GGCATCGGCT TCGCGGTTCC GTCGGGGACG GCCAGCCAAG TCGTCGACCA GCTTCGGCAG TTCGGCGAGG TTCGCCGCGG CTGGCTCGGC GTGCGTATCC AGAACGTCGA CGAGGCCACC GCCGAGGCGC TCGGCCTCAA GGGCGGCGCC AAGGGCGCGC TGGTGGCCGG CGTCGACGAG AAGGGCCCCG CCAAGACCGC CGGGCTCGAG GTCGGCGACG TCATCGTCAA GTTCAACGGC GTGCCGGTGA AGTCGTCGAG CGAACTGCCG CGCATCGTCG CCGCGACCCC GGTGGGCAAG ACCGTGGACG TTCAGATCGT GCGCAAGGGC GAGGAGCAGA CGAAGTCCGT CCTGCTCGGC CGCCTCGAGG ACGGCGAGAA GACCCAGGTC GCCAACGTCA AGCAGCCGGA AGCGGAATCG GTCAATCGCC AGATCCTCGG CCTCAACCTC TCCGGTCTCA ACGACGAGGC GCGCCGCCGC TTCGGCATCA AAGAGAGCGT GAAGAGCGGC GTGGTCGTCA CCAAGGTCGA TCCGAACTCG ACCGCCGCCG ACAAGCGCAT CCAGCCCGGC GAGGTCATCG TCGAGGTCGG CCAGGAAGCG GTCGCGAACC CGGCCGACGT GACGAAGCGC GTCGAGGCGC TCAAGAAGCA AGGCCGCAAG TCGGTGCTGC TGCTGGTCGC CAGCGCCAGC GGCGACGTGC GCTTCGTGGC AATCGGGCTG GACTAG
|
Protein sequence | MRVAANAVRG RTPSFARRAS SALAAAVLGV TVTVTALPMP AFARGPESLA DLAEKVTDAV VNISASTTVE ASNRGGRNLP QLPQGTPFED LFEEFFKRRG QGAPKGDEDS PRGPTRKSNS LGSGFIIDAS GIVVTNNHVI GDANDIQVIL HDGTKLKAEI IGKDSKIDLA LLRVKPTADR PLKAVPFGDS DKMRPGDWVM AIGNPFGLGG SVSAGIVSAR GRNIESGPYD NYIQTDAAIN KGNSGGPLFN MDGEVIGINT AILSPSGGSV GIGFAVPSGT ASQVVDQLRQ FGEVRRGWLG VRIQNVDEAT AEALGLKGGA KGALVAGVDE KGPAKTAGLE VGDVIVKFNG VPVKSSSELP RIVAATPVGK TVDVQIVRKG EEQTKSVLLG RLEDGEKTQV ANVKQPEAES VNRQILGLNL SGLNDEARRR FGIKESVKSG VVVTKVDPNS TAADKRIQPG EVIVEVGQEA VANPADVTKR VEALKKQGRK SVLLLVASAS GDVRFVAIGL D
|
| |