Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_5887 |
Symbol | |
ID | 6133125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 6475852 |
End bp | 6477366 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641645996 |
Product | protease Do |
Protein accession | YP_001772608 |
Protein GI | 170743953 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.685816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAGC CCGTTCCCGC CTCCCGTCTC CATCGCAAGG CGCTCGCGTC CGTGGCCGCC GTGACGCTCG TGGCGACGGG CGCGCTCGGC TCCGCCTTCC TGCCGCCGAG CACGCCGGTC CTCGCCCAGG CCCTGCCGCA GACCCCGATC ACCGCCCCCG AGCACCCGCC GGGCAGCTTC GCGCCGATCG TCAACCGGGT GAAGCCCGGC GTCGTCTCGG TGAAGGTCAA GCTCAAGGAC GACGCGGCGG ATGACGAGGA GGGCGGCCCC GGCGGCCAGA ACGTGCCGCC GCAGCTGCGC GAGTTCTTCC GCCGCTTCGG CGAGAACGGC ATGCCCAACC GGCCGCACCG CAACGGCGGC CGCGCCGCGC AGGGCTCGGG CTTCTTCATC TCGGCGGACG GCTACGTCGT CACCAACAAC CACGTGGTCG AGAACGCCAA GTCGGTGGAG GTCACCCTCG ACGACGGCCG CACCCTCGAC GCCAAGGTGG TCGGCACCGA CCCGAAGACC GACCTCGCGC TGCTCAAGGT CACGGAGGGC AACGGCTCCT TCCCCTACGT GAGGCTCGCG CACGGCGCGC CGCAGGTCGG CGACTGGGTG GTGGCGATCG GCAACCCCTT CGGCCTCGGC GGCACCGTCA CGGCCGGCAT CGTCTCGGCC CGCGGCCGCG ACATCGGCGC CGGGCCCTAC GACGACTTCC TGCAGATCGA TGCGCCGATC AACAAGGGCA ATTCGGGCGG CCCGACCTTC AACGTGTCGG GCGAGGTCGT CGGCGTGAAC ACTGCCATCG CCTCGCCGTC GGGCGGCAAT GTGGGCCTCG CCTTCGCCAT CCCCTCCGAG ACGGTGCAGG CGGTCGTCGA CCAGCTGCGG ACGGACGGCA AGGTCGCCCG CGGCTATCTG GGCCTCCAGA TCCAGCCCGT CACCAAGGAC ATCGCCGAGG GGCTCGGCCT CGACAAGGCG AAGGGCGCGC TCGTCACCAG CGCCCAGGAC GGCACGCCGG CCGCCAAGGC GGGCCTGAAG TCCGGCGACG TGGTCCAGGC GGTGAACGGG GATCCGGTCG GCGACGCGCG CGAATTGTCG CGCCGGATCG CCTCGATGAA GCCGGGCACC AAGGTCCAGC TGTCCTACCT GCGCGGCGGC AAGACCGACA CCGCGACGGT CGAACTCGCG ACCCTGCCGA ACGACACCCG GGTCGCGGCC CGGGAGGAGC GCGGGCGGGG CTCGGACGCG CAGCCGCGGC TCGGCCTGAG CCTCGCGCCC GCCGACGCGG TGGGCGCCGG CCAGGAGGGC GTGGCGGTGG TGAACGTCGA TCCGGACGGC CCGGCGGCGG CCAAGGGCAT CGAGCCCGGC GACGTCATCC TCGACGTCGG CGGCCAGCCG GTCTCCTCGG TCTCCGACGT GCAGGGCCGG ATCCGGGCCG CCGAGCGCGA CGGCCGCAAG GCCGTGCTGA TGCGGGTGAA GAGCGACAAG GGCACGCGCT TCGTCGCCAT CGCCCTCCAG AACCGCAACG GCTGA
|
Protein sequence | MSEPVPASRL HRKALASVAA VTLVATGALG SAFLPPSTPV LAQALPQTPI TAPEHPPGSF APIVNRVKPG VVSVKVKLKD DAADDEEGGP GGQNVPPQLR EFFRRFGENG MPNRPHRNGG RAAQGSGFFI SADGYVVTNN HVVENAKSVE VTLDDGRTLD AKVVGTDPKT DLALLKVTEG NGSFPYVRLA HGAPQVGDWV VAIGNPFGLG GTVTAGIVSA RGRDIGAGPY DDFLQIDAPI NKGNSGGPTF NVSGEVVGVN TAIASPSGGN VGLAFAIPSE TVQAVVDQLR TDGKVARGYL GLQIQPVTKD IAEGLGLDKA KGALVTSAQD GTPAAKAGLK SGDVVQAVNG DPVGDARELS RRIASMKPGT KVQLSYLRGG KTDTATVELA TLPNDTRVAA REERGRGSDA QPRLGLSLAP ADAVGAGQEG VAVVNVDPDG PAAAKGIEPG DVILDVGGQP VSSVSDVQGR IRAAERDGRK AVLMRVKSDK GTRFVAIALQ NRNG
|
| |