Gene Mchl_5157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5157 
Symbol 
ID7116195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5524726 
End bp5526237 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content70% 
IMG OID643527850 
Productprotease Do 
Protein accessionYP_002423849 
Protein GI218533033 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0527663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGA CTGTCCGCCG CCGCGCCTTC GCCTCCGTCG CCGCAGCCGC CCTCGTCGCG 
GGCGGCGCCG CCGGGTTCGG CCTGACCGAG CCCATGACCC CGGCTTACGC CCAGGCCCTG
CCCAAGACCC CGATCGAAGC GCCCGAGCAC CCGCCGGGCT CGTTTGCCAA CGTCGTCGAC
AAGGTGAAGC CGGGCGTCGT CGCCGTGAAG GTGAAGCTCG ACAACAGCGC CGACGATGAC
GACGACAGCG CGGGCGGTCC CAACCTGCAG CAGGTGCCGC CGCAGCTGCG CGAGTTCTTC
AAGCGCTTCG GCCAGGGCGG TCCGGGTGGT CAGGGCGGGC GCGGCATGCC GCAGCGCGGC
GAGCGCGGCG CGGTCGGCTC GGGCTTCATC ATCTCGGCGG ACGGCTACGT CGTCACCAAC
AACCACGTCG TCGACAAGGC CAAGACCGTG CAGGTCACGC TCGACGACAA CCGCACCCTC
GATGCCAAGG TGATCGGCAA GGATCCGAAG ACCGACATCG CGCTGCTCAA GATCACCGAG
AGCGGCAGTT ACCCCTATGT CCAGTTCGGC AAGAGCGCCC CGCGCGTCGG CGATTGGGTC
GTCGCCATCG GCAACCCGTT CGGCCTCGGC GGTACGGTGA CGGCGGGCAT CGTCTCGGCC
CGCGGCCGTG ACATCGGCGC CGGCCCCTAC GACGACTTCC TGCAGATCGA CGCGCCGATC
AACAAGGGCA ATTCCGGCGG CCCGACCTTC AACGTCAACG GTGAGGTCGT GGGCGTGAAC
ACGGCGATCG CCTCACCGTC CGGCGGCTCG GTCGGCCTCG CCTTTGCGAT CCCCGCCGAG
ACGGTGCAGA CGGTGGTCGA TCAGCTCCGC ACCGACGGCA AGGTGGTGCG CGGTTATCTC
GGCGTGCAGG TCCAGCCGGT GACCAAGGAC ATCGCCGACG GGCTCGGCCT CGACAAGGCC
AAGGGCGCGC TGGTCGATCA CGCCGAGAAC GGTACGCCCG CGGCCAAGGC CGGCCTGAAG
TCGGGTGACG TGATCGAGTC GGTCAACGGC GCCCCGGTCA ACGATGCCCG CGACCTCTCG
CGCCGCATCG CCGGCCTCAA GCCTGGCACC GAGGTGAAGC TCGCCTATCT GCGGGGCGGC
AAGAGCGACG TCGCGACGGT CGAACTCGGC ACGCAGCCGA CCGACGCCAA GGTCGCGAGC
CGCAGTGACA GCACGTCCGG TGGCCAGGCG CGCCTCGGCC TCAGCCTGGC CCCTGCCAGC
GAGATCGGCC TCGGCGACGA GGGCGTGGCG GTGATGGATG TCGATCCCGA CGGTCCGGCC
GCGGCCAAGG GCATCGCCCA GGGCGACGTG ATCCTGGATG TCGCTGGCAC CAGCGTCTCG
AAGCCCTCCG AGGTGCAGGC GCAGATCCGC GCCGCAGAAT CGAACGGCCG CAAGGCGGTG
CTGATGCGGG TGAAGAGCGC CAAGGGCCAG ACCCGCTTCG TCGCCGTGGC CCTCGGCAAG
AAGGAGGGCT GA
 
Protein sequence
MTMTVRRRAF ASVAAAALVA GGAAGFGLTE PMTPAYAQAL PKTPIEAPEH PPGSFANVVD 
KVKPGVVAVK VKLDNSADDD DDSAGGPNLQ QVPPQLREFF KRFGQGGPGG QGGRGMPQRG
ERGAVGSGFI ISADGYVVTN NHVVDKAKTV QVTLDDNRTL DAKVIGKDPK TDIALLKITE
SGSYPYVQFG KSAPRVGDWV VAIGNPFGLG GTVTAGIVSA RGRDIGAGPY DDFLQIDAPI
NKGNSGGPTF NVNGEVVGVN TAIASPSGGS VGLAFAIPAE TVQTVVDQLR TDGKVVRGYL
GVQVQPVTKD IADGLGLDKA KGALVDHAEN GTPAAKAGLK SGDVIESVNG APVNDARDLS
RRIAGLKPGT EVKLAYLRGG KSDVATVELG TQPTDAKVAS RSDSTSGGQA RLGLSLAPAS
EIGLGDEGVA VMDVDPDGPA AAKGIAQGDV ILDVAGTSVS KPSEVQAQIR AAESNGRKAV
LMRVKSAKGQ TRFVAVALGK KEG