Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_2883 |
Symbol | |
ID | 7115690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 3031229 |
End bp | 3032764 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643525633 |
Product | protease Do |
Protein accession | YP_002421650 |
Protein GI | 218530834 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTCG CCGCGAACGC CGTCCGGGGT CGAACGCCGT CGTTCGCCCG GCGCGCCTCA TCGGCCCTGG CCGCTGCGGT GCTGGGCGTC ACCGTCACGG TCACCGCCCT GCCGCTCCCC GCCTTCGCCC GTGGCCCGGA ATCGCTCGCC GACCTTGCCG ACAAGGTGAC GGATGCGGTG GTGAACATCT CGGCCTCGAC GACGGTCGAA GCCAGCAACC GCGGCGGCCG GACCATGCCG CAACTGCCGC AGGGCACACC GTTCGAGGAT CTCTTCGAGG AGTTCTTCAA GCGGCGCGGC CAGGGCGCGC CGAAGGGTGA CGACGAAAGC CCGCGCGGAC CGACGCGCAA GTCGAACTCG CTCGGCTCCG GCTTCATCAT CGACGCCTCG GGCATCGTGG TGACGAACAA CCACGTCATC GGCGACGCCA ACGACATTCA GGTCATCCTG AGCGACGGCA CCAAGCTGAA GGCGGAGATC ATCGGCAAGG ATTCGAAGAT CGACCTCGCC CTGCTTCGGG TGAAGCCGAC CGCCGAGCGC CCGCTTAAGG CCGTGCCCTT CGGCGATTCC GACAAGATGC GCCCGGGCGA CTGGGTGATG GCGATCGGCA ACCCGTTCGG CCTCGGCGGC TCGGTCTCCG CCGGCATCGT CTCGGCGCGG GGCCGCAACA TCGAGTCCGG TCCCTACGAC AACTACATCC AGACGGATGC GGCCATCAAC AAGGGCAATT CCGGCGGCCC GCTGTTCAAC ATGGACGGCG AGGTGATCGG CATCAACACC GCCATCCTTT CGCCCTCCGG CGGTTCGGTG GGCATCGGCT TCGCGGTGCC GTCGGCAACC GCCGGTCAGG TCGTCGATCA GCTCCGCCAG TTCGGCGAGG TCCGCCGCGG CTGGATCGGC GTGCGCATCC AGAACGTCGA TGAGGCCACC GCCGAGGCGC TCGGTCTGAA GGGCGGCGCC AAGGGTGCGC TAGTGGCCGG CGTCGATGAG AAGGGCCCGG CCAAGACCGC GGGGCTCGAG GTCGGCGACG TCATCGTCAA GTTCAACGGT GTGCCGGTGA AATCCTCCAG CGAGCTGCCG CGGATCGTCG CCGCGACCCC CGTGGGCAAG TCCGTGGACG TTCAGGTCGT GCGCAAGGGC GAGGAGCAGA CGAAATCCGT CGTGCTCGGT CGCCTCGAGG ACGGCGAGAA GGCTCAGGTC GCCAACCTCA AGCAGCCGGA GGCGGACTCG GTCAATCGCC AGGTCCTCGG CCTCAACCTC TCCGGCCTCA ACGACGAGGT GCGGCGCCGG TATGGCATCA AGGAGAGCGT CAAGACCGGC GTGGTCGTGA CCAAGGTCGA TCCCAACTCG ACCGCGGCCG ACAAGCGCAT CCAGCCGGGC GAGGTCATCG TCGAGGTCGG CCAAGAGGCG ATCTCGAACC CGGCCGACGT GACGAAGCGC GTCGAGGCGC TCAAGAAGGA GGGCCGCAAG TCGGTCCTGT TGCTGGTGGC CAGCACCAGC GGCGACGTGC GCTTCGTGGC GATCGGGTTG GAGTAA
|
Protein sequence | MRLAANAVRG RTPSFARRAS SALAAAVLGV TVTVTALPLP AFARGPESLA DLADKVTDAV VNISASTTVE ASNRGGRTMP QLPQGTPFED LFEEFFKRRG QGAPKGDDES PRGPTRKSNS LGSGFIIDAS GIVVTNNHVI GDANDIQVIL SDGTKLKAEI IGKDSKIDLA LLRVKPTAER PLKAVPFGDS DKMRPGDWVM AIGNPFGLGG SVSAGIVSAR GRNIESGPYD NYIQTDAAIN KGNSGGPLFN MDGEVIGINT AILSPSGGSV GIGFAVPSAT AGQVVDQLRQ FGEVRRGWIG VRIQNVDEAT AEALGLKGGA KGALVAGVDE KGPAKTAGLE VGDVIVKFNG VPVKSSSELP RIVAATPVGK SVDVQVVRKG EEQTKSVVLG RLEDGEKAQV ANLKQPEADS VNRQVLGLNL SGLNDEVRRR YGIKESVKTG VVVTKVDPNS TAADKRIQPG EVIVEVGQEA ISNPADVTKR VEALKKEGRK SVLLLVASTS GDVRFVAIGL E
|
| |