Gene M446_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1086 
Symbol 
ID6131616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1210776 
End bp1212281 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content71% 
IMG OID641641376 
Productprotease Do 
Protein accessionYP_001768048 
Protein GI170739393 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0760064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCTG CCGAGTCCGC GCCCGCGCGC CGATCCGAAG CCTTCGCGAA ACGGCGGCTG 
CCGGCGCTCG CGGGCGCCCT CCTGGCGCTC TCGGTCGGGA CCGCCGCGCT CCCCTCCGCG
GCCCTCGCCA GGGGCCCCGA ATCCCTCGCC GACCTCACCG AGCAGGTCAC CGACGCGGTG
GTGAACATCT CGGCCTCGAC GACGGTGGAG ACCCGCGGCC GCACCCTGCC GCAGCTGCCC
CCCGGCACCC CCTTCGAGGA CCTGTTCGAG GATTTCTTCA ACCGGCGGGG CGGCGGCGAT
CAGCCGCGCC AGCCGCGCAA GTCGAACTCG CTCGGCTCCG GCTTCATCAT CGACGCCTCC
GGCATCGTGG TGACGAACAA CCACGTGATC GGCGACGCCA ACGACATCCA GGTCATCCTG
CACGACGGCC GCAAGCTGAA GGCGGAGATC GTCGGCAAGG ATTCCAAGAC CGACATCGCG
GTGCTGCGGG TCAAGCCGGA GGCGGACCGG CCGCTCAAGG CGGTGCCGCT CGGCGATTCC
GAGAAGATGC GGCCGGGCGA CTGGGTGATC GCGATCGGCA ACCCGTTCGG CCTCGGCGGC
TCGGTCTCGG CCGGCATCGT CTCGGCGCGC GGCCGCAACA TCGATTCGGG GCCCTACGAC
AACTACATCC AGACCGACGC GGCCATCAAC AAGGGCAATT CGGGCGGTCC GCTGTTCAAC
ATGAGCGGCG AGGTGATCGG CATCAACACG GCGATCCTGT CGCCGACCGG CGGCTCGGTC
GGCATCGGCT TCGCGGTCCC GACCGCGACG GCGGCCCCGG TGATCGAGCA GTTGCGCCAG
TACGGCGAGA CCCGTCGCGG CTGGCTCGGC GTGCGGATCC AGAACGTCGA CGACACCACC
GCCGAGGCGC TCGGCCTCAA GGGCGGCGCC CGCGGCGCGC TGATCGCCGG CATCGACGAG
AAGGGCCCGG CCAAGACCGC CGGCTTCGAG GTCGGCGACG TGATCGTGAA GTTCAACGGC
GTCGAGGTGA AGTCGTCGAG CGACCTGCCC CGCATCGTGG CGACGACGCC GGTCGGCAAG
ACCGTGGACG TGCTCACGAT CCGCAAGGGC GCGGAGCAGA CGCGGCCGGT CACCCTCGGG
CGGCTGGAGG ACAACGACAA GCCCCAGCCC GCCGCCCTCA ACCGGCCCCA GCCCGAGGCC
GACGTGACGC GCCAGGCCCT CGGCCTCAAC CTGACCGGCC TCTCCGAGGA GGCGCGGCGG
CGCTTCAACA TCAAGGACGG GCTGAAGGGG GTGGTAGTCA CCCGCGTCGA CCCGAACTCG
AACGCGGCCG ACAAGCGCAT CCAGGCCGGC GACCTCATCG TCGAGGTCGG CCAGGAGCCG
GTGAACTCAC CCTCGGACGT CACCCGCCGC CTGGATCAGA TCAAGAAGGA GGGCCGCAAA
TCCGCCCTGC TGCTGGTCTC GAACGCCCAG GGCGAGGTGC GGTTCGTGGC GCTGAGCCTC
GAATAG
 
Protein sequence
MPAAESAPAR RSEAFAKRRL PALAGALLAL SVGTAALPSA ALARGPESLA DLTEQVTDAV 
VNISASTTVE TRGRTLPQLP PGTPFEDLFE DFFNRRGGGD QPRQPRKSNS LGSGFIIDAS
GIVVTNNHVI GDANDIQVIL HDGRKLKAEI VGKDSKTDIA VLRVKPEADR PLKAVPLGDS
EKMRPGDWVI AIGNPFGLGG SVSAGIVSAR GRNIDSGPYD NYIQTDAAIN KGNSGGPLFN
MSGEVIGINT AILSPTGGSV GIGFAVPTAT AAPVIEQLRQ YGETRRGWLG VRIQNVDDTT
AEALGLKGGA RGALIAGIDE KGPAKTAGFE VGDVIVKFNG VEVKSSSDLP RIVATTPVGK
TVDVLTIRKG AEQTRPVTLG RLEDNDKPQP AALNRPQPEA DVTRQALGLN LTGLSEEARR
RFNIKDGLKG VVVTRVDPNS NAADKRIQAG DLIVEVGQEP VNSPSDVTRR LDQIKKEGRK
SALLLVSNAQ GEVRFVALSL E