Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3827 |
Symbol | |
ID | 5835277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4250263 |
End bp | 4251753 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641369618 |
Product | protease Do |
Protein accession | YP_001641271 |
Protein GI | 163853228 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTGA CTGTCTGCCG CCGCCCCATC GCCTCCGTCG CCGCAGCCGC GCTCGTCGCA GGCGGCGCGG CTGGGTTCGG CTTGGCCGAG CCCATGACCC CGGCTTACGC CCAGGCCCTG CCCAAGACCC CGATCGAGGC GCCCGATCAG CCGCCAGGCT CGTTCGCCAA CGTCGTCGAC AAGGTGAAGC CGGGCGTCGT CGCCGTGAAG GTGAAGCTCG ACGACAGCGC CGACGATGAC GACGACAGCC CCGGCGGCCC GAACATGCAG CAGGTGCCGC CGCAGCTGCG CGAATTCTTC AAGCGCTTCG GCCAGGGTGG GCCAGGCGGT CGCGGCATGC GGCCGCGGGG CGGGGTCGGC TCCGGCTTCA TCATCTCGGC GGACGGCTAC GTCGTCACCA ACAACCACGT CGTCGACAAG GCCAAGACCG TGCAGGTCAC GCTGGACGAC GGCCGCACCC TCGACGCCAA GGTGATCGGC AAGGACTCCA AGACCGACAT CGCCCTCCTG AAGATCACCG AGAGCGGCAG CTATCCCTAT GTCCAGTTCG GCAAGGGCGC GCCCCGCGTC GGCGACTGGG TCTTGGCCAT CGGCAACCCG TTCGGCCTCG GCGGTACGGT GACGGTGGGC ATCGTCTCGG CCCGCGGTCG CGACATCGGC GCCGGCCCCT ACGACGATTT CCTGCAGATC GACGCGCCGA TCAACAAAGG CAATTCCGGC GGCCCGACCT TCAACGTCAA CGGTGAGGTC GTAGGCGTGA ACACGGCGAT CGCCTCGCCG TCCGGTGGCT CGGTCGGCCT CGGCTTCGCG ATCCCCGCCG AGACGGTGCA GACGGTGGTC GATCAGCTCC GCACCGACGG CAAGGTGGTG CGTGGTTATC TCGGCGTGCA GGTCCAGCCG GTGACGAAGG ACATCGCCGA GGGGCTCGGC CTCGACAAGG CCAAGGGCGC GCTCGTCAAT GACGCCGAGA GCGGCACGCC GGCGGCCAAG GCCGGCCTGA AATCGGGCGA CGTGATCGAG TCGGTCAACG GCGTGCCCGT GAACAACGCT CGCGATCTGT CGCGGCTGAT CGCCGGCCTC AAGCCCGGCA CCGAGGTGAA GCTCGCCTAT CTGCGGGGCG GCAAAAGCGA GGTGGCCACC GTCGAACTCG GTACGTTACC GGGCGACAGC AAGGTGGCGC GGCGCGGCGA CGAAGCGCCG AGCGGTCAGG CCCGGCTCGG CCTGAGCCTG GCCCCTGCCA GCGAGATCGG CCTCGGCGAC GAGGGCGTGG CGGTGATGGA TGTCGATCCC GACGGTCCGG CCGCGGCCAG GGGCATCTCC CAGGGCGACG TGATCCTGGA TGTCGCCGGC ACCAGCGTCT CGAAGCCCTC CGAGGTGCAG GCACAGATCC GTGCAGCCGA ATCGAGCGGC CGCAAGGCGG TGCTGATGCG GGTGAAGAGC GCCAGGGGGC AGACCCGCTT CATCGCCGTC CCCCTGACCA AGGAGGGCTG A
|
Protein sequence | MPLTVCRRPI ASVAAAALVA GGAAGFGLAE PMTPAYAQAL PKTPIEAPDQ PPGSFANVVD KVKPGVVAVK VKLDDSADDD DDSPGGPNMQ QVPPQLREFF KRFGQGGPGG RGMRPRGGVG SGFIISADGY VVTNNHVVDK AKTVQVTLDD GRTLDAKVIG KDSKTDIALL KITESGSYPY VQFGKGAPRV GDWVLAIGNP FGLGGTVTVG IVSARGRDIG AGPYDDFLQI DAPINKGNSG GPTFNVNGEV VGVNTAIASP SGGSVGLGFA IPAETVQTVV DQLRTDGKVV RGYLGVQVQP VTKDIAEGLG LDKAKGALVN DAESGTPAAK AGLKSGDVIE SVNGVPVNNA RDLSRLIAGL KPGTEVKLAY LRGGKSEVAT VELGTLPGDS KVARRGDEAP SGQARLGLSL APASEIGLGD EGVAVMDVDP DGPAAARGIS QGDVILDVAG TSVSKPSEVQ AQIRAAESSG RKAVLMRVKS ARGQTRFIAV PLTKEG
|
| |