Gene Mext_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2221 
Symbol 
ID5832696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2465343 
End bp2466809 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content69% 
IMG OID641368020 
Productprotease Do 
Protein accessionYP_001639687 
Protein GI163851644 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.370789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCCT CGCCGCTCCT GCCGGCCTGC CTCGTATCCG TGCTTCTCGC CGCCGCGCCT 
GCGAGCGCGC AGATGGCGAA CACGCCCGGA AAGCCCGCCG CGGAGAAGGC CTCCCCGGAC
AAGGCGGTGC CCCTGTCAAA GGGCGAGATC CAGCTCTCCT TCGCTCCCGT GGTGAAGCGG
GCGGCGCCCT CCGTGGTCAA CGTCTATGCC TCCCATGTCG AGAAGCGCTC CGCACGCTCC
AACGCCATGG AAGAGTTCAT GCGCCGCTTC TTCGGCGAGG ACCGTCCGGG CCGCGGCCCC
AGCGGCCTGC CCGGCGAGCG GGCGCAGCGC TCCCTCGGCT CGGGCGTGAT CGTCGATGGC
TCGGGCCTCG TCATCACCAA CAACCACGTC ATCGAGAACA TGAACGAGGT GAAGGTGGCG
CTCGCCGACA AGCGCGAGTT CGAGGCGCAG ATCGTGCTGC GCGACCCCCG CACCGACCTC
GCGGTGCTCA AGATCAAGGG CCCGGCCGAC ATCGCCTCGA TGCCGATCGG CGATTCCGAC
CACTTGGAGG TCGGCGATTT CGTCATGGCA ATCGGCAACC CGTTCGGCGT CGGGCAGACC
GTGACGCAGG GCATCGTCTC GGCGCTGGCC CGCACCCAGG TCGGATCGTC GGACTACCAG
TTCTTCATCC AGACCGATGC GGCGATCAAT CCGGGCAATT CCGGCGGCGC GCTGGTGGAC
CTGAAGGGGC ATCTCGTCGG CATCAACACC GCGATCTATT CGCAGTCCGG CGGCAGCCAC
GGCATCGGCT TCGCCATTCC CGCGAGCATG GTCCGCGCCG TGGTGGAGAC CGCCAAGAGC
GGCGGCAGCC TCGTGCGCCG GCCCTGGCTC GGGGCGCGGG TGCAGGGCGT AACCCCGGAT
ATCGCCGAGA GCGTCGGGCT TGACCGGCCG ACCGGTGTGC TGGTGGCGAG CATGCAGGCC
AAGAGCCCGG CCGAGGAAGC CGGTCTCAAG CGCGGCGACG TGATCCTCAC GGTCGATGGA
CAGACCGTCG AAGATCCGGA AGCCTTCGGC TACCGCTACG CCCTCAAGGG CATTTCCGGC
ACAGCCGATT TCGGCATCCT GCGCGGCACC AAGCGGCAGA CGGTCCAGAT CAAGCTCGGA
CCGGCGCCGG AGACGCGGCC CCGCGACAGC CTTAAGGTCC GCACCCGCAC GCCGTTCGCG
GGCGCGACCT TCGTCAACAC CTCGCCCGCG GTGGGCGAGG AGCTTCAGGC GGACCTGCCG
GACGAGGGCG TGGCGGTGAC CACCGTCGAA GACGGCTCGC TCGCCGGCCG GGCGGGCTTC
CGCAAGGGTG ACGTGATCGT GGCGATCAAC GGCATGCCGA TCGCCTCGAC GAAGGATCTG
GAGCGGGTGA CGCAGCGCAA TCTCGGCCTG TGGGAGGTCG CGATCAACCG CGGCGGTGAG
GTTCTGACCT CGGTGTTCGG CGGGTAG
 
Protein sequence
MPSSPLLPAC LVSVLLAAAP ASAQMANTPG KPAAEKASPD KAVPLSKGEI QLSFAPVVKR 
AAPSVVNVYA SHVEKRSARS NAMEEFMRRF FGEDRPGRGP SGLPGERAQR SLGSGVIVDG
SGLVITNNHV IENMNEVKVA LADKREFEAQ IVLRDPRTDL AVLKIKGPAD IASMPIGDSD
HLEVGDFVMA IGNPFGVGQT VTQGIVSALA RTQVGSSDYQ FFIQTDAAIN PGNSGGALVD
LKGHLVGINT AIYSQSGGSH GIGFAIPASM VRAVVETAKS GGSLVRRPWL GARVQGVTPD
IAESVGLDRP TGVLVASMQA KSPAEEAGLK RGDVILTVDG QTVEDPEAFG YRYALKGISG
TADFGILRGT KRQTVQIKLG PAPETRPRDS LKVRTRTPFA GATFVNTSPA VGEELQADLP
DEGVAVTTVE DGSLAGRAGF RKGDVIVAIN GMPIASTKDL ERVTQRNLGL WEVAINRGGE
VLTSVFGG