Gene Mchl_2883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2883 
Symbol 
ID7115690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp3031229 
End bp3032764 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content68% 
IMG OID643525633 
Productprotease Do 
Protein accessionYP_002421650 
Protein GI218530834 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTCG CCGCGAACGC CGTCCGGGGT CGAACGCCGT CGTTCGCCCG GCGCGCCTCA 
TCGGCCCTGG CCGCTGCGGT GCTGGGCGTC ACCGTCACGG TCACCGCCCT GCCGCTCCCC
GCCTTCGCCC GTGGCCCGGA ATCGCTCGCC GACCTTGCCG ACAAGGTGAC GGATGCGGTG
GTGAACATCT CGGCCTCGAC GACGGTCGAA GCCAGCAACC GCGGCGGCCG GACCATGCCG
CAACTGCCGC AGGGCACACC GTTCGAGGAT CTCTTCGAGG AGTTCTTCAA GCGGCGCGGC
CAGGGCGCGC CGAAGGGTGA CGACGAAAGC CCGCGCGGAC CGACGCGCAA GTCGAACTCG
CTCGGCTCCG GCTTCATCAT CGACGCCTCG GGCATCGTGG TGACGAACAA CCACGTCATC
GGCGACGCCA ACGACATTCA GGTCATCCTG AGCGACGGCA CCAAGCTGAA GGCGGAGATC
ATCGGCAAGG ATTCGAAGAT CGACCTCGCC CTGCTTCGGG TGAAGCCGAC CGCCGAGCGC
CCGCTTAAGG CCGTGCCCTT CGGCGATTCC GACAAGATGC GCCCGGGCGA CTGGGTGATG
GCGATCGGCA ACCCGTTCGG CCTCGGCGGC TCGGTCTCCG CCGGCATCGT CTCGGCGCGG
GGCCGCAACA TCGAGTCCGG TCCCTACGAC AACTACATCC AGACGGATGC GGCCATCAAC
AAGGGCAATT CCGGCGGCCC GCTGTTCAAC ATGGACGGCG AGGTGATCGG CATCAACACC
GCCATCCTTT CGCCCTCCGG CGGTTCGGTG GGCATCGGCT TCGCGGTGCC GTCGGCAACC
GCCGGTCAGG TCGTCGATCA GCTCCGCCAG TTCGGCGAGG TCCGCCGCGG CTGGATCGGC
GTGCGCATCC AGAACGTCGA TGAGGCCACC GCCGAGGCGC TCGGTCTGAA GGGCGGCGCC
AAGGGTGCGC TAGTGGCCGG CGTCGATGAG AAGGGCCCGG CCAAGACCGC GGGGCTCGAG
GTCGGCGACG TCATCGTCAA GTTCAACGGT GTGCCGGTGA AATCCTCCAG CGAGCTGCCG
CGGATCGTCG CCGCGACCCC CGTGGGCAAG TCCGTGGACG TTCAGGTCGT GCGCAAGGGC
GAGGAGCAGA CGAAATCCGT CGTGCTCGGT CGCCTCGAGG ACGGCGAGAA GGCTCAGGTC
GCCAACCTCA AGCAGCCGGA GGCGGACTCG GTCAATCGCC AGGTCCTCGG CCTCAACCTC
TCCGGCCTCA ACGACGAGGT GCGGCGCCGG TATGGCATCA AGGAGAGCGT CAAGACCGGC
GTGGTCGTGA CCAAGGTCGA TCCCAACTCG ACCGCGGCCG ACAAGCGCAT CCAGCCGGGC
GAGGTCATCG TCGAGGTCGG CCAAGAGGCG ATCTCGAACC CGGCCGACGT GACGAAGCGC
GTCGAGGCGC TCAAGAAGGA GGGCCGCAAG TCGGTCCTGT TGCTGGTGGC CAGCACCAGC
GGCGACGTGC GCTTCGTGGC GATCGGGTTG GAGTAA
 
Protein sequence
MRLAANAVRG RTPSFARRAS SALAAAVLGV TVTVTALPLP AFARGPESLA DLADKVTDAV 
VNISASTTVE ASNRGGRTMP QLPQGTPFED LFEEFFKRRG QGAPKGDDES PRGPTRKSNS
LGSGFIIDAS GIVVTNNHVI GDANDIQVIL SDGTKLKAEI IGKDSKIDLA LLRVKPTAER
PLKAVPFGDS DKMRPGDWVM AIGNPFGLGG SVSAGIVSAR GRNIESGPYD NYIQTDAAIN
KGNSGGPLFN MDGEVIGINT AILSPSGGSV GIGFAVPSAT AGQVVDQLRQ FGEVRRGWIG
VRIQNVDEAT AEALGLKGGA KGALVAGVDE KGPAKTAGLE VGDVIVKFNG VPVKSSSELP
RIVAATPVGK SVDVQVVRKG EEQTKSVVLG RLEDGEKAQV ANLKQPEADS VNRQVLGLNL
SGLNDEVRRR YGIKESVKTG VVVTKVDPNS TAADKRIQPG EVIVEVGQEA ISNPADVTKR
VEALKKEGRK SVLLLVASTS GDVRFVAIGL E