Gene Mext_3827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3827 
Symbol 
ID5835277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4250263 
End bp4251753 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content70% 
IMG OID641369618 
Productprotease Do 
Protein accessionYP_001641271 
Protein GI163853228 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGA CTGTCTGCCG CCGCCCCATC GCCTCCGTCG CCGCAGCCGC GCTCGTCGCA 
GGCGGCGCGG CTGGGTTCGG CTTGGCCGAG CCCATGACCC CGGCTTACGC CCAGGCCCTG
CCCAAGACCC CGATCGAGGC GCCCGATCAG CCGCCAGGCT CGTTCGCCAA CGTCGTCGAC
AAGGTGAAGC CGGGCGTCGT CGCCGTGAAG GTGAAGCTCG ACGACAGCGC CGACGATGAC
GACGACAGCC CCGGCGGCCC GAACATGCAG CAGGTGCCGC CGCAGCTGCG CGAATTCTTC
AAGCGCTTCG GCCAGGGTGG GCCAGGCGGT CGCGGCATGC GGCCGCGGGG CGGGGTCGGC
TCCGGCTTCA TCATCTCGGC GGACGGCTAC GTCGTCACCA ACAACCACGT CGTCGACAAG
GCCAAGACCG TGCAGGTCAC GCTGGACGAC GGCCGCACCC TCGACGCCAA GGTGATCGGC
AAGGACTCCA AGACCGACAT CGCCCTCCTG AAGATCACCG AGAGCGGCAG CTATCCCTAT
GTCCAGTTCG GCAAGGGCGC GCCCCGCGTC GGCGACTGGG TCTTGGCCAT CGGCAACCCG
TTCGGCCTCG GCGGTACGGT GACGGTGGGC ATCGTCTCGG CCCGCGGTCG CGACATCGGC
GCCGGCCCCT ACGACGATTT CCTGCAGATC GACGCGCCGA TCAACAAAGG CAATTCCGGC
GGCCCGACCT TCAACGTCAA CGGTGAGGTC GTAGGCGTGA ACACGGCGAT CGCCTCGCCG
TCCGGTGGCT CGGTCGGCCT CGGCTTCGCG ATCCCCGCCG AGACGGTGCA GACGGTGGTC
GATCAGCTCC GCACCGACGG CAAGGTGGTG CGTGGTTATC TCGGCGTGCA GGTCCAGCCG
GTGACGAAGG ACATCGCCGA GGGGCTCGGC CTCGACAAGG CCAAGGGCGC GCTCGTCAAT
GACGCCGAGA GCGGCACGCC GGCGGCCAAG GCCGGCCTGA AATCGGGCGA CGTGATCGAG
TCGGTCAACG GCGTGCCCGT GAACAACGCT CGCGATCTGT CGCGGCTGAT CGCCGGCCTC
AAGCCCGGCA CCGAGGTGAA GCTCGCCTAT CTGCGGGGCG GCAAAAGCGA GGTGGCCACC
GTCGAACTCG GTACGTTACC GGGCGACAGC AAGGTGGCGC GGCGCGGCGA CGAAGCGCCG
AGCGGTCAGG CCCGGCTCGG CCTGAGCCTG GCCCCTGCCA GCGAGATCGG CCTCGGCGAC
GAGGGCGTGG CGGTGATGGA TGTCGATCCC GACGGTCCGG CCGCGGCCAG GGGCATCTCC
CAGGGCGACG TGATCCTGGA TGTCGCCGGC ACCAGCGTCT CGAAGCCCTC CGAGGTGCAG
GCACAGATCC GTGCAGCCGA ATCGAGCGGC CGCAAGGCGG TGCTGATGCG GGTGAAGAGC
GCCAGGGGGC AGACCCGCTT CATCGCCGTC CCCCTGACCA AGGAGGGCTG A
 
Protein sequence
MPLTVCRRPI ASVAAAALVA GGAAGFGLAE PMTPAYAQAL PKTPIEAPDQ PPGSFANVVD 
KVKPGVVAVK VKLDDSADDD DDSPGGPNMQ QVPPQLREFF KRFGQGGPGG RGMRPRGGVG
SGFIISADGY VVTNNHVVDK AKTVQVTLDD GRTLDAKVIG KDSKTDIALL KITESGSYPY
VQFGKGAPRV GDWVLAIGNP FGLGGTVTVG IVSARGRDIG AGPYDDFLQI DAPINKGNSG
GPTFNVNGEV VGVNTAIASP SGGSVGLGFA IPAETVQTVV DQLRTDGKVV RGYLGVQVQP
VTKDIAEGLG LDKAKGALVN DAESGTPAAK AGLKSGDVIE SVNGVPVNNA RDLSRLIAGL
KPGTEVKLAY LRGGKSEVAT VELGTLPGDS KVARRGDEAP SGQARLGLSL APASEIGLGD
EGVAVMDVDP DGPAAARGIS QGDVILDVAG TSVSKPSEVQ AQIRAAESSG RKAVLMRVKS
ARGQTRFIAV PLTKEG