Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2656 |
Symbol | |
ID | 5831060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2971027 |
End bp | 2972562 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641368457 |
Product | protease Do |
Protein accession | YP_001640119 |
Protein GI | 163852076 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.419635 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTCG CCGCGAACGC CGTCCGGGGT CGAACGCCGT CGTTCGCCCG GCGCGCCTCA TCGGCCCTGG CCGCTGCGGT GCTGGGCGTC ACCGTCACGG TCACCGCCCT GCCGCTCCCC GCCTTCGCCC GTGGCCCGGA ATCGCTCGCC GACCTTGCCG ACAAGGTGAC GGATGCGGTG GTGAACATCT CGGCCTCGAC AACGGTCGAA GCCAGCAACC GCGGCGGCCG GACCATGCCG CAACTGCCTC AGGGCACACC CTTCGAGGAT CTCTTCGAGG AGTTCTTCAA GCGGCGCGGC CAGGGCGCCC CGAAGGGTGA CGACGAAAGC CCGCGCGGAC CGACGCGCAA GTCGAACTCG CTCGGCTCCG GCTTCATCAT CGACGCCTCG GGCATCGTGG TGACGAACAA CCACGTCATC GGCGACGCCA ACGACATTCA GGTCATCCTG AGCGACGGCA CCAAGCTCAA GGCAGAGATC ATCGGCAAGG ATTCGAAGAT CGACCTCGCC CTGCTTCGGG TGAAGCCGAC GGCCGAGCGC CCTCTCAAGG CCGTGCCCTT CGGCGATTCC GACAAGATGC GCCCGGGCGA CTGGGTGATG GCGATCGGCA ACCCGTTCGG CCTCGGCGGC TCGGTCTCCG CCGGCATCGT CTCGGCGCGG GGCCGCAACA TCGAGTCCGG ACCCTACGAC AACTACATCC AGACCGACGC GGCCATCAAC AAGGGCAATT CCGGCGGTCC GCTGTTCAAC ATGGACGGAG AGGTGATCGG CATCAACACC GCGATCCTTT CCCCCTCGGG CGGCTCGGTC GGCATCGGCT TCGCGGTGCC GTCGGCAACC GCCGGTCAGG TCGTCGATCA GCTCCGCCAG TTCGGCGAGG TCCGCCGCGG CTGGATCGGC GTGCGCATCC AGAACGTCGA TGAGGCCACC GCCGAGGCGC TCGGCCTGAA GGGCGGCGCT AAGGGTGCGC TGGTGGCCGG CGTCGACGAG AAGGGCCCGG CCAAGACCGC GGGGCTCGAG GTCGGCGACG TCATCGTCAA GTTCAACGGT GTGCCGGTGA AATCCTCCAG CGAGTTGCCG CGCATCGTCG CCGCGACCCC GGTGGGCAAG TCCGTGGACG TCCAAGTCGT ACGCAAGGGC GAGGAGCAGA CGAAATCTGT CGTGCTCGGT CGCCTCGAGG ACGGCGAGAA GGCTCAGGTC GCCAACCTCA AGCAGCCGGA GGCGGAATCG GTCAATCGCC AGGTCCTCGG CCTCAACCTC TCCGGCCTCA ACGACGAGGT GCGGCGCCGC TACGGCATCA AGGAGAGCGT CAAGACCGGC GTGGTCGTCA CCAAGGTCGA TCCCAACTCG ACCGCCGCCG ACAAGCGCAT CCAGCCGGGC GAGGTCATCG TCGAGGTTGG CCAGGAGGCG ATCTCGAACC CGGCCGACGT GACGAAGCGC GTCGAGGCGC TCAAGAAGGA GGGCCGCAAG TCGGTGCTGC TGCTGGTGGC CAGCGCGAGC GGCGACGTGC GCTTCGTCGC GATCGGGTTG GAGTAA
|
Protein sequence | MRLAANAVRG RTPSFARRAS SALAAAVLGV TVTVTALPLP AFARGPESLA DLADKVTDAV VNISASTTVE ASNRGGRTMP QLPQGTPFED LFEEFFKRRG QGAPKGDDES PRGPTRKSNS LGSGFIIDAS GIVVTNNHVI GDANDIQVIL SDGTKLKAEI IGKDSKIDLA LLRVKPTAER PLKAVPFGDS DKMRPGDWVM AIGNPFGLGG SVSAGIVSAR GRNIESGPYD NYIQTDAAIN KGNSGGPLFN MDGEVIGINT AILSPSGGSV GIGFAVPSAT AGQVVDQLRQ FGEVRRGWIG VRIQNVDEAT AEALGLKGGA KGALVAGVDE KGPAKTAGLE VGDVIVKFNG VPVKSSSELP RIVAATPVGK SVDVQVVRKG EEQTKSVVLG RLEDGEKAQV ANLKQPEAES VNRQVLGLNL SGLNDEVRRR YGIKESVKTG VVVTKVDPNS TAADKRIQPG EVIVEVGQEA ISNPADVTKR VEALKKEGRK SVLLLVASAS GDVRFVAIGL E
|
| |