Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_5651 |
Symbol | |
ID | 7119187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011758 |
Strand | + |
Start bp | 272002 |
End bp | 273888 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643528306 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_002424302 |
Protein GI | 218533487 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.460503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGTTT CGCTTCCGCG TTGGCTTGTC GACTTCATCC TCCTCGGCCC AGCGAACGAC CGTCGGCAGT TGCAGGAGTC GCCCGTCCTC GGGGACGTTT GGACCGCGTT CGCAACGGAT CCGGCTGGGA CGCAGGACCT GCTGATCGAG CCCTATCGAA CCCAGACCGC AGGCCGGGTG GCAGCCGCCT TGGCGGATCG GTTTGCCCGT ATCGATCGGC ACGGACACCT CGCGGCGGGG CGGGTGCCGG GCGCGAGGCA GCCTCGGTCC GCCCGCAACA TCGCCTACCT ACAGGGCATC GTGGCCGCGC GGCTATCCTT CGAGGACCTG CTCTGCGCTG TCGTGCCGAT GAGCGATTGG TGGGCGAGGC GAGAAACCCG TGACCGGATC GCGCAGGTCA GTCCTGAGCG GATCGCCGCG ACCGTCGATG CGCTGATCGC TTCTGCTCGA GACTTCGAGG CAGGGAAGCC GGAGCCCGCC ACACCAAACA TCACGCCCTT GGAGAGATTC GTCGCCTTGA TGGGCGTCGT AGTCTGGGCT GCCGACGCGC CGTCGAGGAT TGGGCAGGGT CGAGGCACTC GGACGCTTCG AAAGCTCGGC GACATACTGA AAGTGCTGAC GGGCACCGCC GTCACCGAGC GATTGCTACC CATCCTGCAT GGGATGCTCG AAGCCGGCGA AGCGGGACCG ATGGTGTGGC AGGTCTCGCT GAACCGACAG GCCGCACCCG CGCTTGAGAA GTCGGTGCCC GCCGTCAAGG CGGATGCGGC CCGTACCTTG TTCGGGGTGA GGTGCGACGC GATCACCTGG GCCATCCTGG ACTCCGGGAT CGACGCCGAT CATCCGGTGT TCAACGGACC CGACGGTTGC CGGGTCCGGA AGACGCTGGA CTTCAGCCGC ATGCGCCAGA TCGTCAGCCT CGACAACACG GACGAGGCCA CCCTTGAGGC GCGCGTGGAC GAATTGCTGC AGCGCGACCT CGCTACAGAG ATGACCAGGG AGGAGGCGAA GGCCGACCTG CGGGAACTCG CCTCGGACGC TCGCAAGGGG CGCCCCGTCC AGTGGGAGAT CGTCGAGAGA CTCGTCACGC TCCGTTGCGG CACGCCGCCA TATGGGGGAC ACGGCACACA CGTCGCCGGC ATCATCGGGG GGAGGGCGCC CGAGCCGGAC AGTTCCTTCG CGGACGGCAT GTGCCCGGAG ATCAATCTCC TGGACGTCCG CGTCCTTTCA CGCTCCGCCG CCGACACGGA GTTCGCGGTC ATCGCCGCGC TCCAATACCT GCGCGACCTG AACGAACGGC ACAGCTTCAT CACAGTGCAC GGGGCGAACC TGAGCCTCTC GATCCCGCAC GACGTCCGAA ATTTCGCGTG CGGGCGCACG CCGGTCTGCA ACGAGTGCGA GCGCCTGATC GAGAGCGGCG TCGTGGTCGT CGCCGCCGCC GGCAACCGGG GTTACCACAG CTACGAGACC CGGGACGGCG CCTACGAAGG TTACGCCGCC TTTAGCATCA CCGATCCCGG CAACGCGGAT GGGGTCATCA CCGTCGGCGC GACCCACCGC TACTGGCCGC ACACCTATGG GGTCAGCTTC TTCTCCAGCC GCGGCCCGAC GGGCGACGGC CGCATGAAGC CGGACCTCGT AGCGCCGGGT GAGCGCGTCC AGGGCCCGCT CCCCGAGGAG GGCTGGGGCC AGCAGGACGG CACGAGCATG GCAGCGCCGC ACGTCAGCGG GGCGGCGGCG ATGCTGATGG CCAGGTATTC TGAGCTGATC GGGCAGCCGC ATCGCATCAA GCAGGTGATC TGTGCGAGCG CGACCGACCT GGGCCGCGAG CGCAGCTTCC AGGGCGGTGG GATGCTCGAC GTCCTGCGGG CGTTCCAGAG CGTGTGA
|
Protein sequence | MPVSLPRWLV DFILLGPAND RRQLQESPVL GDVWTAFATD PAGTQDLLIE PYRTQTAGRV AAALADRFAR IDRHGHLAAG RVPGARQPRS ARNIAYLQGI VAARLSFEDL LCAVVPMSDW WARRETRDRI AQVSPERIAA TVDALIASAR DFEAGKPEPA TPNITPLERF VALMGVVVWA ADAPSRIGQG RGTRTLRKLG DILKVLTGTA VTERLLPILH GMLEAGEAGP MVWQVSLNRQ AAPALEKSVP AVKADAARTL FGVRCDAITW AILDSGIDAD HPVFNGPDGC RVRKTLDFSR MRQIVSLDNT DEATLEARVD ELLQRDLATE MTREEAKADL RELASDARKG RPVQWEIVER LVTLRCGTPP YGGHGTHVAG IIGGRAPEPD SSFADGMCPE INLLDVRVLS RSAADTEFAV IAALQYLRDL NERHSFITVH GANLSLSIPH DVRNFACGRT PVCNECERLI ESGVVVVAAA GNRGYHSYET RDGAYEGYAA FSITDPGNAD GVITVGATHR YWPHTYGVSF FSSRGPTGDG RMKPDLVAPG ERVQGPLPEE GWGQQDGTSM AAPHVSGAAA MLMARYSELI GQPHRIKQVI CASATDLGRE RSFQGGGMLD VLRAFQSV
|
| |