Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpop_5124 |
Symbol | |
ID | 6312451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium populi BJ001 |
Kingdom | Bacteria |
Replicon accession | NC_010725 |
Strand | + |
Start bp | 5486981 |
End bp | 5488471 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 642653806 |
Product | protease Do |
Protein accession | YP_001927755 |
Protein GI | 188584310 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.38057 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTGA CCGTTCGCCG CCGCGTCACC GCCTCCGTCG CTGCGGCCGC CCTCGTTGCG GGCGGCGCAG CCGGGTTCGG CCTCACCGAA TCCGCCATGC CGGCCTACGC CCAGGCCCTG CCCAAGACCC CGATCGAGGC TCCCGAGCAT CCGCCGGGCT CGTTTGCCAA CGTCGTCGAC AAGGTGAAGC CGGGCGTCGT TGCCGTGAAG GTCAAGCTCG ACAGCGGCCT CGACGACGAT GACGACGGCC CTGGCGGCCC CAACATGCAG CAGGTGCCGC CGCAACTGCG CGAGTTCTTC AGGCGCTTCG GCCAAGGTGG TCCCGGCGGG CGCGGCATGC CGCAGCGCGG CGGGGTCGGC TCCGGCTTCA TCATCTCGGC GGACGGCTAC GTGGTGACCA ACAACCACGT CGTCGATCAT GCCAAGACCG TGCAGGTCAC CCTCGACGAC GGCCGGACCC TCGACGCCAA GGTCATCGGC AAGGACCCGA AGACCGACAT CGCGCTCCTG AAGATCACCG AGAGCGGCAG CTACCCCTAC GTCCAGTTCG GCAAGGGCGC GCCGCGGGTC GGCGACTGGG TTTTGGCTAT CGGCAACCCG TTCGGCCTCG GCGGCACGGT CACGGCGGGC ATCGTCTCGG CCCGCGGCCG CGACATCGGT GCCGGCCCCT ACGACGACTT CCTGCAGATC GATGCGCCGA TCAACAAGGG CAATTCCGGC GGCCCGACCT TCAACGTCAA CGGCGAGGTC GTGGGCGTGA ACACGGCGAT CGCCTCCCCG TCCGGCGGCT CGGTCGGCCT CGGCTTCGCG ATCCCCGCCG AGACGGTGCA GACGGTGGTC GATCAGCTCC GCACCGACGG CAAGGTGGTG CGCGGCTATC TCGGCGTGCA GGTCCAGCCG GTGACGAAGG ACATCGCCGA GGGGCTCGGC CTCGACAAGG CCAAGGGTGC GCTCGTCAAT GACGCCGAGA GCGGCACGCC GGCCGCCAAG GCCGGGTTGA AGCCCGGCGA CGTGATCGAG TCGGTCAACG GCGTCCCGAT CGACAACGCG CGCGACCTCT CGCGGTTGAT CGCCGGCCTC AAGCCCGGCA CCGAGGTGAA GCTCACCTAT CGGCGCGGCG GCAAGAGCGA CACCGCGACC GTCGAACTCG GTACCTTGCC GGGCGATGGC AAAGTGGTGA GCCGCGGCGA CGACGCGCCG AGCGGTCAGG TCCGGCTCGG CCTCAGCCTG GCCCCCGCCA GCGAGGTTGG CCTCGGCGAC GAGGGCGTGG CGGTGATGGA TGTCGATCCG ACCGGCCCGG CGGCGGCCAG GGGCATCTCG CAAGGCGATG TGATCCTAGA TGTCGGCGGC ACCAGCGTCG CGAAGCCCTC CGATGTCCAG GCGCAGATCC GCGCCGCGGA ATCGAGCGGC CGCAAGGCGG TGCTGATGCG GGTGAAGGGC GCGAGGGGGC AGACCCGCTT CGTCGCCGTC GCGCTCAACA AGGAAGGATG A
|
Protein sequence | MALTVRRRVT ASVAAAALVA GGAAGFGLTE SAMPAYAQAL PKTPIEAPEH PPGSFANVVD KVKPGVVAVK VKLDSGLDDD DDGPGGPNMQ QVPPQLREFF RRFGQGGPGG RGMPQRGGVG SGFIISADGY VVTNNHVVDH AKTVQVTLDD GRTLDAKVIG KDPKTDIALL KITESGSYPY VQFGKGAPRV GDWVLAIGNP FGLGGTVTAG IVSARGRDIG AGPYDDFLQI DAPINKGNSG GPTFNVNGEV VGVNTAIASP SGGSVGLGFA IPAETVQTVV DQLRTDGKVV RGYLGVQVQP VTKDIAEGLG LDKAKGALVN DAESGTPAAK AGLKPGDVIE SVNGVPIDNA RDLSRLIAGL KPGTEVKLTY RRGGKSDTAT VELGTLPGDG KVVSRGDDAP SGQVRLGLSL APASEVGLGD EGVAVMDVDP TGPAAARGIS QGDVILDVGG TSVAKPSDVQ AQIRAAESSG RKAVLMRVKG ARGQTRFVAV ALNKEG
|
| |