Gene Mext_2656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2656 
Symbol 
ID5831060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2971027 
End bp2972562 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content68% 
IMG OID641368457 
Productprotease Do 
Protein accessionYP_001640119 
Protein GI163852076 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.419635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTCG CCGCGAACGC CGTCCGGGGT CGAACGCCGT CGTTCGCCCG GCGCGCCTCA 
TCGGCCCTGG CCGCTGCGGT GCTGGGCGTC ACCGTCACGG TCACCGCCCT GCCGCTCCCC
GCCTTCGCCC GTGGCCCGGA ATCGCTCGCC GACCTTGCCG ACAAGGTGAC GGATGCGGTG
GTGAACATCT CGGCCTCGAC AACGGTCGAA GCCAGCAACC GCGGCGGCCG GACCATGCCG
CAACTGCCTC AGGGCACACC CTTCGAGGAT CTCTTCGAGG AGTTCTTCAA GCGGCGCGGC
CAGGGCGCCC CGAAGGGTGA CGACGAAAGC CCGCGCGGAC CGACGCGCAA GTCGAACTCG
CTCGGCTCCG GCTTCATCAT CGACGCCTCG GGCATCGTGG TGACGAACAA CCACGTCATC
GGCGACGCCA ACGACATTCA GGTCATCCTG AGCGACGGCA CCAAGCTCAA GGCAGAGATC
ATCGGCAAGG ATTCGAAGAT CGACCTCGCC CTGCTTCGGG TGAAGCCGAC GGCCGAGCGC
CCTCTCAAGG CCGTGCCCTT CGGCGATTCC GACAAGATGC GCCCGGGCGA CTGGGTGATG
GCGATCGGCA ACCCGTTCGG CCTCGGCGGC TCGGTCTCCG CCGGCATCGT CTCGGCGCGG
GGCCGCAACA TCGAGTCCGG ACCCTACGAC AACTACATCC AGACCGACGC GGCCATCAAC
AAGGGCAATT CCGGCGGTCC GCTGTTCAAC ATGGACGGAG AGGTGATCGG CATCAACACC
GCGATCCTTT CCCCCTCGGG CGGCTCGGTC GGCATCGGCT TCGCGGTGCC GTCGGCAACC
GCCGGTCAGG TCGTCGATCA GCTCCGCCAG TTCGGCGAGG TCCGCCGCGG CTGGATCGGC
GTGCGCATCC AGAACGTCGA TGAGGCCACC GCCGAGGCGC TCGGCCTGAA GGGCGGCGCT
AAGGGTGCGC TGGTGGCCGG CGTCGACGAG AAGGGCCCGG CCAAGACCGC GGGGCTCGAG
GTCGGCGACG TCATCGTCAA GTTCAACGGT GTGCCGGTGA AATCCTCCAG CGAGTTGCCG
CGCATCGTCG CCGCGACCCC GGTGGGCAAG TCCGTGGACG TCCAAGTCGT ACGCAAGGGC
GAGGAGCAGA CGAAATCTGT CGTGCTCGGT CGCCTCGAGG ACGGCGAGAA GGCTCAGGTC
GCCAACCTCA AGCAGCCGGA GGCGGAATCG GTCAATCGCC AGGTCCTCGG CCTCAACCTC
TCCGGCCTCA ACGACGAGGT GCGGCGCCGC TACGGCATCA AGGAGAGCGT CAAGACCGGC
GTGGTCGTCA CCAAGGTCGA TCCCAACTCG ACCGCCGCCG ACAAGCGCAT CCAGCCGGGC
GAGGTCATCG TCGAGGTTGG CCAGGAGGCG ATCTCGAACC CGGCCGACGT GACGAAGCGC
GTCGAGGCGC TCAAGAAGGA GGGCCGCAAG TCGGTGCTGC TGCTGGTGGC CAGCGCGAGC
GGCGACGTGC GCTTCGTCGC GATCGGGTTG GAGTAA
 
Protein sequence
MRLAANAVRG RTPSFARRAS SALAAAVLGV TVTVTALPLP AFARGPESLA DLADKVTDAV 
VNISASTTVE ASNRGGRTMP QLPQGTPFED LFEEFFKRRG QGAPKGDDES PRGPTRKSNS
LGSGFIIDAS GIVVTNNHVI GDANDIQVIL SDGTKLKAEI IGKDSKIDLA LLRVKPTAER
PLKAVPFGDS DKMRPGDWVM AIGNPFGLGG SVSAGIVSAR GRNIESGPYD NYIQTDAAIN
KGNSGGPLFN MDGEVIGINT AILSPSGGSV GIGFAVPSAT AGQVVDQLRQ FGEVRRGWIG
VRIQNVDEAT AEALGLKGGA KGALVAGVDE KGPAKTAGLE VGDVIVKFNG VPVKSSSELP
RIVAATPVGK SVDVQVVRKG EEQTKSVVLG RLEDGEKAQV ANLKQPEAES VNRQVLGLNL
SGLNDEVRRR YGIKESVKTG VVVTKVDPNS TAADKRIQPG EVIVEVGQEA ISNPADVTKR
VEALKKEGRK SVLLLVASAS GDVRFVAIGL E