Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4692 |
Symbol | |
ID | 5832140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 5246147 |
End bp | 5247658 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641370487 |
Product | protease Do |
Protein accession | YP_001642131 |
Protein GI | 163854088 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.601387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATGA CTGTCCGCCG CCGCGCCTTC GCCTCCGTCG CCGCAGCCGC CCTCGTCGCG GGCGGCGCGG CCGGGTTCGG CCTGACCGAG CCCATGACCC CGGCTTACGC CCAGGCCCTG CCCAAGACAC CGATCGAAGC GCCCGAGCAC CCGCCGGGCT CGTTTGCCAA CGTCGTCGAC AAGGTGAAGC CGGGCGTCGT CGCCGTGAAG GTGAAGCTCG ACAACAGCGC CGACGATGAC GACGACAGCG CGGGCGGCCC CAACCTGCAG CAGGTGCCGC CGCAACTGCG CGAATTCTTC AAGCGCTTCG GCCAGGGTGG GCCGGGCGGT CAGGGTGGGC GCGGCATGCC GCAGCGCGGC GAGCGCGGCG CGGTCGGCTC GGGCTTCATC ATCTCGGCGG ACGGCTACGT CGTCACCAAC AACCACGTCG TCGACAAGGC CAAGACCGTG CAGGTCACGC TCGACGACAA CCGCACCCTC GATGCCAAGG TGATCGGCAA GGATCCGAAG ACCGACATCG CGCTGCTCAA GATCACCGAG AGCGGCAGCT ATCCCTACGT CCAGTTCGGC AAGAGCGCCC CGCGCGTTGG CGACTGGGTC GTCGCCATCG GCAACCCGTT CGGCCTCGGC GGTACGGTGA CAGCGGGCAT CGTCTCGGCC CGCGGTCGCG ACATCGGCGC CGGCCCCTAC GACGACTTCC TGCAGATCGA CGCGCCGATC AACAAGGGCA ATTCCGGCGG CCCGACCTTC AACGTCAACG GCGAAGTCGT GGGCGTGAAC ACGGCGATCG CCTCGCCGTC CGGCGGCTCG GTCGGCCTCG CCTTCGCGAT CCCCGCCGAG ACGGTGCAGA CGGTGGTCGA TCAGCTCCGC ACCGACGGCA AGGTGGTGCG CGGCTATCTC GGCGTGCAGG TCCAGCCGGT GACGAAGGAC ATCGCCGACG GGCTCGGCCT CGACAAGGCC AAGGGCGCGC TGGTCGATCA CGCCGAGAAC GGCACGCCCG CGGCCAAGGC CGGCCTGAAA TCGGGCGACG TGATCGAGTC GGTCAACGGC GCCCCGGTCA ACGATGCCCG CGACCTCTCG CGCCGCATCG CCGGCCTCAA GCCCGGTACC GAGGTGAAGC TCGCCTATCT GCGGGGCGGC AAGAGCGACG TCGCGACGGT CGAACTCGGC ACGCAGCCGA CCGACGCCAA GGTCGCGAGC CGCAGTGATA GCTCGTCTGG TGGCCAGGCG CGCCTCGGCC TCAGCCTGGC CCCTGCCAGC GAGATCGGCC TCGGCGACGA AGGCGTGGCG GTGATGGATG TCGATCCCGA CGGTCCGGCC GCGGCCAAGG GCATCGCCCA GGGCGACGTG ATCCTGGATG TCGCCGGCAC CAGTGTCTCG AAGCCCTCCG AGGTGCAGGC GCAGATTCGC GCCGCAGAAT CGAACGGCCG CAAGGCGGTG CTGATGCGGG TGAAGAGCGC CAAGGGCCAG ACCCGCTTCG TCGCCGTGGC CCTCGGCAAG AAGGAGGGCT GA
|
Protein sequence | MTMTVRRRAF ASVAAAALVA GGAAGFGLTE PMTPAYAQAL PKTPIEAPEH PPGSFANVVD KVKPGVVAVK VKLDNSADDD DDSAGGPNLQ QVPPQLREFF KRFGQGGPGG QGGRGMPQRG ERGAVGSGFI ISADGYVVTN NHVVDKAKTV QVTLDDNRTL DAKVIGKDPK TDIALLKITE SGSYPYVQFG KSAPRVGDWV VAIGNPFGLG GTVTAGIVSA RGRDIGAGPY DDFLQIDAPI NKGNSGGPTF NVNGEVVGVN TAIASPSGGS VGLAFAIPAE TVQTVVDQLR TDGKVVRGYL GVQVQPVTKD IADGLGLDKA KGALVDHAEN GTPAAKAGLK SGDVIESVNG APVNDARDLS RRIAGLKPGT EVKLAYLRGG KSDVATVELG TQPTDAKVAS RSDSSSGGQA RLGLSLAPAS EIGLGDEGVA VMDVDPDGPA AAKGIAQGDV ILDVAGTSVS KPSEVQAQIR AAESNGRKAV LMRVKSAKGQ TRFVAVALGK KEG
|
| |