Gene Mext_4692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4692 
Symbol 
ID5832140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5246147 
End bp5247658 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content70% 
IMG OID641370487 
Productprotease Do 
Protein accessionYP_001642131 
Protein GI163854088 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.601387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGA CTGTCCGCCG CCGCGCCTTC GCCTCCGTCG CCGCAGCCGC CCTCGTCGCG 
GGCGGCGCGG CCGGGTTCGG CCTGACCGAG CCCATGACCC CGGCTTACGC CCAGGCCCTG
CCCAAGACAC CGATCGAAGC GCCCGAGCAC CCGCCGGGCT CGTTTGCCAA CGTCGTCGAC
AAGGTGAAGC CGGGCGTCGT CGCCGTGAAG GTGAAGCTCG ACAACAGCGC CGACGATGAC
GACGACAGCG CGGGCGGCCC CAACCTGCAG CAGGTGCCGC CGCAACTGCG CGAATTCTTC
AAGCGCTTCG GCCAGGGTGG GCCGGGCGGT CAGGGTGGGC GCGGCATGCC GCAGCGCGGC
GAGCGCGGCG CGGTCGGCTC GGGCTTCATC ATCTCGGCGG ACGGCTACGT CGTCACCAAC
AACCACGTCG TCGACAAGGC CAAGACCGTG CAGGTCACGC TCGACGACAA CCGCACCCTC
GATGCCAAGG TGATCGGCAA GGATCCGAAG ACCGACATCG CGCTGCTCAA GATCACCGAG
AGCGGCAGCT ATCCCTACGT CCAGTTCGGC AAGAGCGCCC CGCGCGTTGG CGACTGGGTC
GTCGCCATCG GCAACCCGTT CGGCCTCGGC GGTACGGTGA CAGCGGGCAT CGTCTCGGCC
CGCGGTCGCG ACATCGGCGC CGGCCCCTAC GACGACTTCC TGCAGATCGA CGCGCCGATC
AACAAGGGCA ATTCCGGCGG CCCGACCTTC AACGTCAACG GCGAAGTCGT GGGCGTGAAC
ACGGCGATCG CCTCGCCGTC CGGCGGCTCG GTCGGCCTCG CCTTCGCGAT CCCCGCCGAG
ACGGTGCAGA CGGTGGTCGA TCAGCTCCGC ACCGACGGCA AGGTGGTGCG CGGCTATCTC
GGCGTGCAGG TCCAGCCGGT GACGAAGGAC ATCGCCGACG GGCTCGGCCT CGACAAGGCC
AAGGGCGCGC TGGTCGATCA CGCCGAGAAC GGCACGCCCG CGGCCAAGGC CGGCCTGAAA
TCGGGCGACG TGATCGAGTC GGTCAACGGC GCCCCGGTCA ACGATGCCCG CGACCTCTCG
CGCCGCATCG CCGGCCTCAA GCCCGGTACC GAGGTGAAGC TCGCCTATCT GCGGGGCGGC
AAGAGCGACG TCGCGACGGT CGAACTCGGC ACGCAGCCGA CCGACGCCAA GGTCGCGAGC
CGCAGTGATA GCTCGTCTGG TGGCCAGGCG CGCCTCGGCC TCAGCCTGGC CCCTGCCAGC
GAGATCGGCC TCGGCGACGA AGGCGTGGCG GTGATGGATG TCGATCCCGA CGGTCCGGCC
GCGGCCAAGG GCATCGCCCA GGGCGACGTG ATCCTGGATG TCGCCGGCAC CAGTGTCTCG
AAGCCCTCCG AGGTGCAGGC GCAGATTCGC GCCGCAGAAT CGAACGGCCG CAAGGCGGTG
CTGATGCGGG TGAAGAGCGC CAAGGGCCAG ACCCGCTTCG TCGCCGTGGC CCTCGGCAAG
AAGGAGGGCT GA
 
Protein sequence
MTMTVRRRAF ASVAAAALVA GGAAGFGLTE PMTPAYAQAL PKTPIEAPEH PPGSFANVVD 
KVKPGVVAVK VKLDNSADDD DDSAGGPNLQ QVPPQLREFF KRFGQGGPGG QGGRGMPQRG
ERGAVGSGFI ISADGYVVTN NHVVDKAKTV QVTLDDNRTL DAKVIGKDPK TDIALLKITE
SGSYPYVQFG KSAPRVGDWV VAIGNPFGLG GTVTAGIVSA RGRDIGAGPY DDFLQIDAPI
NKGNSGGPTF NVNGEVVGVN TAIASPSGGS VGLAFAIPAE TVQTVVDQLR TDGKVVRGYL
GVQVQPVTKD IADGLGLDKA KGALVDHAEN GTPAAKAGLK SGDVIESVNG APVNDARDLS
RRIAGLKPGT EVKLAYLRGG KSDVATVELG TQPTDAKVAS RSDSSSGGQA RLGLSLAPAS
EIGLGDEGVA VMDVDPDGPA AAKGIAQGDV ILDVAGTSVS KPSEVQAQIR AAESNGRKAV
LMRVKSAKGQ TRFVAVALGK KEG