Gene Mpop_5235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpop_5235 
Symbol 
ID6309243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium populi BJ001 
KingdomBacteria 
Replicon accessionNC_010725 
Strand
Start bp5602904 
End bp5604409 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content70% 
IMG OID642653916 
Productprotease Do 
Protein accessionYP_001927864 
Protein GI188584419 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGA CTGTCCGCCG CCGCGCCTTC GCTTCCGTTG CCGCGGCCGC CCTCGTTGCG 
GGCGGTGCAG CCGGGTTCGG CCTCACCGAA TCCGCGATGC CGGCCTACGC TCAGGCCCTG
CCCAAGACCC CGATCGAGGC CCCCGAGCAT CCGCCGGGCT CGTTTGCCAA CGTCGTCGAC
AAGGTGAAGC CGGGCGTCGT CGCCGTGAAG GTCAAGCTCG ACAACAGCGC CGGCGACGAT
GACGACAGCT CCGGCAACCC GAACCTCCAG CAGGTGCCGC CGCAGCTGCG CGAGTTCTTC
AAGCGCTTCG GCCAAGGTGG TCCCGGCGGA CGCGGCATGC CGCAGCAGCG CGGCGAGCGC
GGCGCGGTCG GCTCCGGCTT CATCATCTCG GCGGACGGCT ACGTCGTGAC CAACAACCAC
GTCGTCGATC ACGCCAAGAC CGTGCAGGTC ACCCTCGACG ACGGCCGGAC CCTCGACGCC
AAGGTCATCG GCAAGGACCC GAAGACCGAC ATCGCGCTCC TGAAGATCAC CGAGAGCGGC
AGCTACCCCT ACGTCCAGTT CGGCAAGGGC GCGCCGCGGG TCGGCGACTG GGTCGTCGCC
ATCGGCAACC CGTTCGGCCT CGGCGGCACG GTGACGGCGG GCATCGTGTC GGCCCGCGGC
CGCGACATCG GCGCCGGCCC CTACGACGAC TTCCTGCAGA TCGATGCGCC GATCAACAAG
GGTAATTCCG GCGGCCCGAC CTTCAACGTC AACGGCGAGG TCGTGGGCGT GAACACGGCG
ATCGCCTCCC CGTCCGGCGG CTCGGTCGGC CTCGCCTTCG CGATCCCCGC CGAGACGGTG
CAGACGGTGG TCGATCAGCT CCGCACCGAC GGCAAGGTGG TGCGCGGCTA TCTCGGCGTG
CAGGTGCAGC CGGTGACGAA GGACATCGCC GAGGGGCTCG GTCTCGACAA GGCCAAGGGA
GCCCTGGTCG ATCACGCCGA GAACGGCACG CCGGCCGCCA AGGCCGGGTT GAAGTCGGGC
GACGTGATCG AGTCGGTCAA CGGCGCGCCG GTCAACGATG CCCGCGACCT GTCACGCCGC
ATCGCCGGCC TCAAGCCCGG CACCGAGGTG AAGCTCGCCT ATCTGCGCGG CGGCAAGAGC
GACATCGCGA CGGTCGAACT CGGCACCCTG CCGACGGACG GCAAGGTGGC CAGCCTCGGC
GACGGCGCCT CGGGCGGTCA ACCGCGCCTT GGCCTGAGCC TTGCACCGGC GAACGATGTC
GGCCTCGGCG ACGAGGGCGT GGCGGTGATG GATGTCGATC CCGACGGCCC GGCCGCGGCC
AAGGGCATCG CCCAGGGTGA CGTGATCCTG GACGTTGCAG GAACCAGCGT CGCGAAGCCC
TCCGATGTCC AGGCGCAGAT CCGCGCCGCG GAGTCGAATG GCCGCAAGGC TGTGCTGATG
CGCGTGAAGA GTTCCAAGGG GCAGACCCGC TTCGTCGCCG TCGCGCTCGG CAAGAAGGAG
GGCTGA
 
Protein sequence
MTMTVRRRAF ASVAAAALVA GGAAGFGLTE SAMPAYAQAL PKTPIEAPEH PPGSFANVVD 
KVKPGVVAVK VKLDNSAGDD DDSSGNPNLQ QVPPQLREFF KRFGQGGPGG RGMPQQRGER
GAVGSGFIIS ADGYVVTNNH VVDHAKTVQV TLDDGRTLDA KVIGKDPKTD IALLKITESG
SYPYVQFGKG APRVGDWVVA IGNPFGLGGT VTAGIVSARG RDIGAGPYDD FLQIDAPINK
GNSGGPTFNV NGEVVGVNTA IASPSGGSVG LAFAIPAETV QTVVDQLRTD GKVVRGYLGV
QVQPVTKDIA EGLGLDKAKG ALVDHAENGT PAAKAGLKSG DVIESVNGAP VNDARDLSRR
IAGLKPGTEV KLAYLRGGKS DIATVELGTL PTDGKVASLG DGASGGQPRL GLSLAPANDV
GLGDEGVAVM DVDPDGPAAA KGIAQGDVIL DVAGTSVAKP SDVQAQIRAA ESNGRKAVLM
RVKSSKGQTR FVAVALGKKE G