Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpop_5235 |
Symbol | |
ID | 6309243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium populi BJ001 |
Kingdom | Bacteria |
Replicon accession | NC_010725 |
Strand | - |
Start bp | 5602904 |
End bp | 5604409 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 642653916 |
Product | protease Do |
Protein accession | YP_001927864 |
Protein GI | 188584419 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATGA CTGTCCGCCG CCGCGCCTTC GCTTCCGTTG CCGCGGCCGC CCTCGTTGCG GGCGGTGCAG CCGGGTTCGG CCTCACCGAA TCCGCGATGC CGGCCTACGC TCAGGCCCTG CCCAAGACCC CGATCGAGGC CCCCGAGCAT CCGCCGGGCT CGTTTGCCAA CGTCGTCGAC AAGGTGAAGC CGGGCGTCGT CGCCGTGAAG GTCAAGCTCG ACAACAGCGC CGGCGACGAT GACGACAGCT CCGGCAACCC GAACCTCCAG CAGGTGCCGC CGCAGCTGCG CGAGTTCTTC AAGCGCTTCG GCCAAGGTGG TCCCGGCGGA CGCGGCATGC CGCAGCAGCG CGGCGAGCGC GGCGCGGTCG GCTCCGGCTT CATCATCTCG GCGGACGGCT ACGTCGTGAC CAACAACCAC GTCGTCGATC ACGCCAAGAC CGTGCAGGTC ACCCTCGACG ACGGCCGGAC CCTCGACGCC AAGGTCATCG GCAAGGACCC GAAGACCGAC ATCGCGCTCC TGAAGATCAC CGAGAGCGGC AGCTACCCCT ACGTCCAGTT CGGCAAGGGC GCGCCGCGGG TCGGCGACTG GGTCGTCGCC ATCGGCAACC CGTTCGGCCT CGGCGGCACG GTGACGGCGG GCATCGTGTC GGCCCGCGGC CGCGACATCG GCGCCGGCCC CTACGACGAC TTCCTGCAGA TCGATGCGCC GATCAACAAG GGTAATTCCG GCGGCCCGAC CTTCAACGTC AACGGCGAGG TCGTGGGCGT GAACACGGCG ATCGCCTCCC CGTCCGGCGG CTCGGTCGGC CTCGCCTTCG CGATCCCCGC CGAGACGGTG CAGACGGTGG TCGATCAGCT CCGCACCGAC GGCAAGGTGG TGCGCGGCTA TCTCGGCGTG CAGGTGCAGC CGGTGACGAA GGACATCGCC GAGGGGCTCG GTCTCGACAA GGCCAAGGGA GCCCTGGTCG ATCACGCCGA GAACGGCACG CCGGCCGCCA AGGCCGGGTT GAAGTCGGGC GACGTGATCG AGTCGGTCAA CGGCGCGCCG GTCAACGATG CCCGCGACCT GTCACGCCGC ATCGCCGGCC TCAAGCCCGG CACCGAGGTG AAGCTCGCCT ATCTGCGCGG CGGCAAGAGC GACATCGCGA CGGTCGAACT CGGCACCCTG CCGACGGACG GCAAGGTGGC CAGCCTCGGC GACGGCGCCT CGGGCGGTCA ACCGCGCCTT GGCCTGAGCC TTGCACCGGC GAACGATGTC GGCCTCGGCG ACGAGGGCGT GGCGGTGATG GATGTCGATC CCGACGGCCC GGCCGCGGCC AAGGGCATCG CCCAGGGTGA CGTGATCCTG GACGTTGCAG GAACCAGCGT CGCGAAGCCC TCCGATGTCC AGGCGCAGAT CCGCGCCGCG GAGTCGAATG GCCGCAAGGC TGTGCTGATG CGCGTGAAGA GTTCCAAGGG GCAGACCCGC TTCGTCGCCG TCGCGCTCGG CAAGAAGGAG GGCTGA
|
Protein sequence | MTMTVRRRAF ASVAAAALVA GGAAGFGLTE SAMPAYAQAL PKTPIEAPEH PPGSFANVVD KVKPGVVAVK VKLDNSAGDD DDSSGNPNLQ QVPPQLREFF KRFGQGGPGG RGMPQQRGER GAVGSGFIIS ADGYVVTNNH VVDHAKTVQV TLDDGRTLDA KVIGKDPKTD IALLKITESG SYPYVQFGKG APRVGDWVVA IGNPFGLGGT VTAGIVSARG RDIGAGPYDD FLQIDAPINK GNSGGPTFNV NGEVVGVNTA IASPSGGSVG LAFAIPAETV QTVVDQLRTD GKVVRGYLGV QVQPVTKDIA EGLGLDKAKG ALVDHAENGT PAAKAGLKSG DVIESVNGAP VNDARDLSRR IAGLKPGTEV KLAYLRGGKS DIATVELGTL PTDGKVASLG DGASGGQPRL GLSLAPANDV GLGDEGVAVM DVDPDGPAAA KGIAQGDVIL DVAGTSVAKP SDVQAQIRAA ESNGRKAVLM RVKSSKGQTR FVAVALGKKE G
|
| |