Gene M446_1718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1718 
Symbol 
ID6135399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1930229 
End bp1931644 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content73% 
IMG OID641641976 
Productprotease Do 
Protein accessionYP_001768645 
Protein GI170739990 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0759509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTGTC CGATGCGCCT TGCCCTGGTC GCCGCGCTCG CCCTCGGCCC CGCCGCGGCC 
CTGGCCGACC CCGCCCCGCC GACCCGCGCC CTGCCCGAGA GCCGCGCGCA GGTGCAGCTC
TCCTTCGCGC CGATCGTGCG CAAGGCGGCG CCCTCGGTGG TCAACGTCTA CGGCGCCCAC
GTCGAGAAGC GCTCGGCCAA CCGCAACGCC ATGGACGAGT TCCTGCGCCG CTTCTTCGGG
GAATCCGGCC CCGGCGGCAC GCAGGAGCGG GCGCAGCGCT CGCTGGGCTC GGGCGTGATC
GTCGACGCCT CCGGGCTGAT CGTCACCAAC AACCACGTCG TCGAGAACAT GAACGAGGTG
AAGGTGGCGC TGACCGACCG GCGGGAATTC CCGGCCGAGA TCGTGCTGCG CGACCCCCGC
ACCGACCTCG CGGTCCTGCG CATCAAGGCG CCGGGCGGGA TCGCCGCGAT GGAGTTCGGC
GATTCCGAGG CGCTGCAGGT CGGCGACTTC GTGATCGCCA TCGGCAACCC CTTCGGGGTC
GGCCAGACGG TGACGCAGGG CATCGTCTCG GCCCTCGCCC GCACGCAGGT GGGCTCGGCC
GATTACCAGT TCTTCATCCA GACGGACGCG GCCATCAACC CGGGCAATTC GGGGGGCGCC
CTGGTCGACC TGTCGGGCGC GCTGGTCGGC ATCAACACGG CGATCTTCTC GCAATCGGGC
GGCAGCCACG GCATCGGCTT CGCCATCCCC GCCAGCATGG TGCGGGCGGT GGTGGAGACG
GCGCGGGGCG GCGGGCGGAT CGTGCGCCGG CCCTGGCTCG GGGCGCGGCT GCAGAACGTG
ACGCCCGACA TCGCCGACAG CGTCGGCCTC GACCACCCGA CCGGGGTGCT GGTGGCCGGC
ATGCTGGGCA AGAGTCCGGC GGAGGAATCC GGCCTCAAGC GCGGCGACGT GATCCTGAGC
GTGGACGGCC AGCCGGTCGA CGACCCGGAG GCCTTCGGCT ACCGCTTCGC CCTCAAGGGC
ATCAGCGGCG AGACGAAGCT CGCGGTGCTG CGCGGGTCGA ACCGCATCAC GCTGCCGGTG
CGCCTCGCCC CGGCGCCCGA GACGCGCCCG CGCGACACGC TCAAGATCCG CACCCGCTCG
CCCTTCCTGG GGGCGACGGC GGTCAACCTC TCGCCGGCCG TCGCCGAGGA GCTGCAGCTC
GACCTCCCGG CGGACGGGGT GGTGATCGCC GAGGTCGACG GCGGCAGCAT CGCGGCCCGG
GCCGGGCTGC AGAAGGGCGA CGTGATCGTG GCGGTGAACG GGGCCTCGGT CGCCAGCACC
AGGGACCTCG ACCGCATCAC CCGCAACAGC CTCTCGGCCT GGGAGGTGAC GATCAACCGC
GGCGGCCAGC AGCTCACCTC GCTGTTCAGC GGCTGA
 
Protein sequence
MRCPMRLALV AALALGPAAA LADPAPPTRA LPESRAQVQL SFAPIVRKAA PSVVNVYGAH 
VEKRSANRNA MDEFLRRFFG ESGPGGTQER AQRSLGSGVI VDASGLIVTN NHVVENMNEV
KVALTDRREF PAEIVLRDPR TDLAVLRIKA PGGIAAMEFG DSEALQVGDF VIAIGNPFGV
GQTVTQGIVS ALARTQVGSA DYQFFIQTDA AINPGNSGGA LVDLSGALVG INTAIFSQSG
GSHGIGFAIP ASMVRAVVET ARGGGRIVRR PWLGARLQNV TPDIADSVGL DHPTGVLVAG
MLGKSPAEES GLKRGDVILS VDGQPVDDPE AFGYRFALKG ISGETKLAVL RGSNRITLPV
RLAPAPETRP RDTLKIRTRS PFLGATAVNL SPAVAEELQL DLPADGVVIA EVDGGSIAAR
AGLQKGDVIV AVNGASVAST RDLDRITRNS LSAWEVTINR GGQQLTSLFS G