Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1467 |
Symbol | |
ID | 3102844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 1557968 |
End bp | 1559389 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637170642 |
Product | serine protease, MucD |
Protein accession | YP_113924 |
Protein GI | 53804465 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.627151 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGACC GATACCGTTT TGTCGGCGCC ATTTGTGCGG CGCTGGTGCT GGCCGTTTCG CCGCCAGCCC GGGCCCAGCT TCCCGATTTC ACCCAGCTGG TTGAACAGAA CAACGCCGCC GTCGTGAACA TCAGCACCAC CCAGAAAGTG GCCGCCAACG AACAGCAGAT GCCGGAGGGT CTGGAGATTC CGGAGGGCAC GCCGTTCGAC GATTTTTTCC GGCATTACTT TGGCGAAGGC GGGGGTAGCG ACGGCCAGCC GAGCGAGGCG AAGTCCCTGG GTTCGGGTTT CATCATGTCG GCGGATGGCT ATATCATCAC CAATCACCAC GTCGTGAAAG GGGCCGATGA GATCGTGGTG CGACTGCAGG ACCGGCGGGA GCTGGTGGCG AAGATCGTTG GGTCCGACAA GCGTAGTGAC GTCGCCCTGC TCAAGATCGA GGCCAGCCAG CTGCCCACGG TGAAGCTGGG GTCGTCCGAA AAGCTCAAGG TCGGCGAATG GGTGCTCGCC ATCGGCTCGC CTTTCGGATT CGACCATTCC GCCACCGCCG GCATCGTCAG CGCCAAGGGG CGCAGCCTTC CCAGCGACAA TTACGTGCCG TTCATACAGA CCGACGTGGC CATCAACCCT GGCAACTCGG GAGGACCGCT GTTCAATCTG AACGGCGAAG TGGTCGGCGT CAATTCCCAG ATCTACAGCC GTACCGGCGG CTTCATGGGG CTGTCCTTCG CCATTCCCAT CGAAGTCGCC ATGCAGGTGG TGGACCAGCT CAAAGCCAGC GGAAGGGTTT CCCGCGGCTG GCTGGGTGTC CAGATCCAAG ACGTGACGCG AGAGCTGGCC GAGTCCTTCG ACATGAAGAA ACCACAAGGC GCCCTGGTAT CCAAGGTTCT TTCGAAGAGC CCGGCCGAAG CGGCCGGCGT CCAGATCGGC GACATCGTGC TGGAATTCAA CGGCCAGGCG GTGGACACGT CGGCTGCGCT GCCGCCCATG GTAGGCATGA CCAAGGTCGG CGAAGTCGCC AAAATCAAGT TGTTACGTAA CGGTGCGATC AAGGAGCTGA GTATCAAGAT CGGAGCGCTC CCCGATGAGG AAGAACCGGC GATGGGTACC GCCGAGCCCG ATGCGGTACC GCTGAAGCGC ATGGGAGCCA GCGTGGCCGA TCTGACCCCG GAACTGCGCG AGCAGTTCGA GGTGCCACGG GGCGGGGTGC TCGTCTACGG CGTCAATCCC GGTCCCGCCT ACGAGGCAGG GCTGCGGCGC GGAGACGTGA TCCTCCGGAT TCAGGACAAG GAAATCAACG GCGTGAAACA ATTGGTAGAG TTGGAGAAGA CCCTGCCGGC AGGGAAATCG CTGGCGGTGC TGGTGCAGCG GCGCGATGGC TCCATCTTCC TGGCGATGAA ATTGAAAGAC GAAAAGCAGT GA
|
Protein sequence | MFDRYRFVGA ICAALVLAVS PPARAQLPDF TQLVEQNNAA VVNISTTQKV AANEQQMPEG LEIPEGTPFD DFFRHYFGEG GGSDGQPSEA KSLGSGFIMS ADGYIITNHH VVKGADEIVV RLQDRRELVA KIVGSDKRSD VALLKIEASQ LPTVKLGSSE KLKVGEWVLA IGSPFGFDHS ATAGIVSAKG RSLPSDNYVP FIQTDVAINP GNSGGPLFNL NGEVVGVNSQ IYSRTGGFMG LSFAIPIEVA MQVVDQLKAS GRVSRGWLGV QIQDVTRELA ESFDMKKPQG ALVSKVLSKS PAEAAGVQIG DIVLEFNGQA VDTSAALPPM VGMTKVGEVA KIKLLRNGAI KELSIKIGAL PDEEEPAMGT AEPDAVPLKR MGASVADLTP ELREQFEVPR GGVLVYGVNP GPAYEAGLRR GDVILRIQDK EINGVKQLVE LEKTLPAGKS LAVLVQRRDG SIFLAMKLKD EKQ
|
| |