Gene Mpop_5124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpop_5124 
Symbol 
ID6312451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium populi BJ001 
KingdomBacteria 
Replicon accessionNC_010725 
Strand
Start bp5486981 
End bp5488471 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content70% 
IMG OID642653806 
Productprotease Do 
Protein accessionYP_001927755 
Protein GI188584310 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.38057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTGA CCGTTCGCCG CCGCGTCACC GCCTCCGTCG CTGCGGCCGC CCTCGTTGCG 
GGCGGCGCAG CCGGGTTCGG CCTCACCGAA TCCGCCATGC CGGCCTACGC CCAGGCCCTG
CCCAAGACCC CGATCGAGGC TCCCGAGCAT CCGCCGGGCT CGTTTGCCAA CGTCGTCGAC
AAGGTGAAGC CGGGCGTCGT TGCCGTGAAG GTCAAGCTCG ACAGCGGCCT CGACGACGAT
GACGACGGCC CTGGCGGCCC CAACATGCAG CAGGTGCCGC CGCAACTGCG CGAGTTCTTC
AGGCGCTTCG GCCAAGGTGG TCCCGGCGGG CGCGGCATGC CGCAGCGCGG CGGGGTCGGC
TCCGGCTTCA TCATCTCGGC GGACGGCTAC GTGGTGACCA ACAACCACGT CGTCGATCAT
GCCAAGACCG TGCAGGTCAC CCTCGACGAC GGCCGGACCC TCGACGCCAA GGTCATCGGC
AAGGACCCGA AGACCGACAT CGCGCTCCTG AAGATCACCG AGAGCGGCAG CTACCCCTAC
GTCCAGTTCG GCAAGGGCGC GCCGCGGGTC GGCGACTGGG TTTTGGCTAT CGGCAACCCG
TTCGGCCTCG GCGGCACGGT CACGGCGGGC ATCGTCTCGG CCCGCGGCCG CGACATCGGT
GCCGGCCCCT ACGACGACTT CCTGCAGATC GATGCGCCGA TCAACAAGGG CAATTCCGGC
GGCCCGACCT TCAACGTCAA CGGCGAGGTC GTGGGCGTGA ACACGGCGAT CGCCTCCCCG
TCCGGCGGCT CGGTCGGCCT CGGCTTCGCG ATCCCCGCCG AGACGGTGCA GACGGTGGTC
GATCAGCTCC GCACCGACGG CAAGGTGGTG CGCGGCTATC TCGGCGTGCA GGTCCAGCCG
GTGACGAAGG ACATCGCCGA GGGGCTCGGC CTCGACAAGG CCAAGGGTGC GCTCGTCAAT
GACGCCGAGA GCGGCACGCC GGCCGCCAAG GCCGGGTTGA AGCCCGGCGA CGTGATCGAG
TCGGTCAACG GCGTCCCGAT CGACAACGCG CGCGACCTCT CGCGGTTGAT CGCCGGCCTC
AAGCCCGGCA CCGAGGTGAA GCTCACCTAT CGGCGCGGCG GCAAGAGCGA CACCGCGACC
GTCGAACTCG GTACCTTGCC GGGCGATGGC AAAGTGGTGA GCCGCGGCGA CGACGCGCCG
AGCGGTCAGG TCCGGCTCGG CCTCAGCCTG GCCCCCGCCA GCGAGGTTGG CCTCGGCGAC
GAGGGCGTGG CGGTGATGGA TGTCGATCCG ACCGGCCCGG CGGCGGCCAG GGGCATCTCG
CAAGGCGATG TGATCCTAGA TGTCGGCGGC ACCAGCGTCG CGAAGCCCTC CGATGTCCAG
GCGCAGATCC GCGCCGCGGA ATCGAGCGGC CGCAAGGCGG TGCTGATGCG GGTGAAGGGC
GCGAGGGGGC AGACCCGCTT CGTCGCCGTC GCGCTCAACA AGGAAGGATG A
 
Protein sequence
MALTVRRRVT ASVAAAALVA GGAAGFGLTE SAMPAYAQAL PKTPIEAPEH PPGSFANVVD 
KVKPGVVAVK VKLDSGLDDD DDGPGGPNMQ QVPPQLREFF RRFGQGGPGG RGMPQRGGVG
SGFIISADGY VVTNNHVVDH AKTVQVTLDD GRTLDAKVIG KDPKTDIALL KITESGSYPY
VQFGKGAPRV GDWVLAIGNP FGLGGTVTAG IVSARGRDIG AGPYDDFLQI DAPINKGNSG
GPTFNVNGEV VGVNTAIASP SGGSVGLGFA IPAETVQTVV DQLRTDGKVV RGYLGVQVQP
VTKDIAEGLG LDKAKGALVN DAESGTPAAK AGLKPGDVIE SVNGVPIDNA RDLSRLIAGL
KPGTEVKLTY RRGGKSDTAT VELGTLPGDG KVVSRGDDAP SGQVRLGLSL APASEVGLGD
EGVAVMDVDP TGPAAARGIS QGDVILDVGG TSVAKPSDVQ AQIRAAESSG RKAVLMRVKG
ARGQTRFVAV ALNKEG