Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2221 |
Symbol | |
ID | 5832696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2465343 |
End bp | 2466809 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641368020 |
Product | protease Do |
Protein accession | YP_001639687 |
Protein GI | 163851644 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.370789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCCT CGCCGCTCCT GCCGGCCTGC CTCGTATCCG TGCTTCTCGC CGCCGCGCCT GCGAGCGCGC AGATGGCGAA CACGCCCGGA AAGCCCGCCG CGGAGAAGGC CTCCCCGGAC AAGGCGGTGC CCCTGTCAAA GGGCGAGATC CAGCTCTCCT TCGCTCCCGT GGTGAAGCGG GCGGCGCCCT CCGTGGTCAA CGTCTATGCC TCCCATGTCG AGAAGCGCTC CGCACGCTCC AACGCCATGG AAGAGTTCAT GCGCCGCTTC TTCGGCGAGG ACCGTCCGGG CCGCGGCCCC AGCGGCCTGC CCGGCGAGCG GGCGCAGCGC TCCCTCGGCT CGGGCGTGAT CGTCGATGGC TCGGGCCTCG TCATCACCAA CAACCACGTC ATCGAGAACA TGAACGAGGT GAAGGTGGCG CTCGCCGACA AGCGCGAGTT CGAGGCGCAG ATCGTGCTGC GCGACCCCCG CACCGACCTC GCGGTGCTCA AGATCAAGGG CCCGGCCGAC ATCGCCTCGA TGCCGATCGG CGATTCCGAC CACTTGGAGG TCGGCGATTT CGTCATGGCA ATCGGCAACC CGTTCGGCGT CGGGCAGACC GTGACGCAGG GCATCGTCTC GGCGCTGGCC CGCACCCAGG TCGGATCGTC GGACTACCAG TTCTTCATCC AGACCGATGC GGCGATCAAT CCGGGCAATT CCGGCGGCGC GCTGGTGGAC CTGAAGGGGC ATCTCGTCGG CATCAACACC GCGATCTATT CGCAGTCCGG CGGCAGCCAC GGCATCGGCT TCGCCATTCC CGCGAGCATG GTCCGCGCCG TGGTGGAGAC CGCCAAGAGC GGCGGCAGCC TCGTGCGCCG GCCCTGGCTC GGGGCGCGGG TGCAGGGCGT AACCCCGGAT ATCGCCGAGA GCGTCGGGCT TGACCGGCCG ACCGGTGTGC TGGTGGCGAG CATGCAGGCC AAGAGCCCGG CCGAGGAAGC CGGTCTCAAG CGCGGCGACG TGATCCTCAC GGTCGATGGA CAGACCGTCG AAGATCCGGA AGCCTTCGGC TACCGCTACG CCCTCAAGGG CATTTCCGGC ACAGCCGATT TCGGCATCCT GCGCGGCACC AAGCGGCAGA CGGTCCAGAT CAAGCTCGGA CCGGCGCCGG AGACGCGGCC CCGCGACAGC CTTAAGGTCC GCACCCGCAC GCCGTTCGCG GGCGCGACCT TCGTCAACAC CTCGCCCGCG GTGGGCGAGG AGCTTCAGGC GGACCTGCCG GACGAGGGCG TGGCGGTGAC CACCGTCGAA GACGGCTCGC TCGCCGGCCG GGCGGGCTTC CGCAAGGGTG ACGTGATCGT GGCGATCAAC GGCATGCCGA TCGCCTCGAC GAAGGATCTG GAGCGGGTGA CGCAGCGCAA TCTCGGCCTG TGGGAGGTCG CGATCAACCG CGGCGGTGAG GTTCTGACCT CGGTGTTCGG CGGGTAG
|
Protein sequence | MPSSPLLPAC LVSVLLAAAP ASAQMANTPG KPAAEKASPD KAVPLSKGEI QLSFAPVVKR AAPSVVNVYA SHVEKRSARS NAMEEFMRRF FGEDRPGRGP SGLPGERAQR SLGSGVIVDG SGLVITNNHV IENMNEVKVA LADKREFEAQ IVLRDPRTDL AVLKIKGPAD IASMPIGDSD HLEVGDFVMA IGNPFGVGQT VTQGIVSALA RTQVGSSDYQ FFIQTDAAIN PGNSGGALVD LKGHLVGINT AIYSQSGGSH GIGFAIPASM VRAVVETAKS GGSLVRRPWL GARVQGVTPD IAESVGLDRP TGVLVASMQA KSPAEEAGLK RGDVILTVDG QTVEDPEAFG YRYALKGISG TADFGILRGT KRQTVQIKLG PAPETRPRDS LKVRTRTPFA GATFVNTSPA VGEELQADLP DEGVAVTTVE DGSLAGRAGF RKGDVIVAIN GMPIASTKDL ERVTQRNLGL WEVAINRGGE VLTSVFGG
|
| |