Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0875 |
Symbol | |
ID | 3104591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 921704 |
End bp | 923695 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637170076 |
Product | serine protease |
Protein accession | YP_113369 |
Protein GI | 53804956 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.923473 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGG AACGGGAACG GTTCGTCAAC CAGCGTGTCG CGATGTCTTT GATTTTGGCC GCAGGACTGT CGGCTGCCTC GTTCGGGGAG GCCGAGGCGG GTACGTTCCT GCGCGCCGCC CGGCCCATTC CCGGGCAATA CATCGTGATC TTCCGAAGCG AAGCGGAGAG CGCTGCGGGT TTTGACGGGC TGCTGGGCCG GGTGAGCCGG CTGTACAACA CCGTGCCGGC GCAAGTGTGG CGGCATGCCG TCAAAGGGTT CGTGGGCCGG ATGTCGCCGG TGGAAGCCGC GGCACTGGCC AACGATCCTT CCGTCGCGCT GGTGGAGGAG GATGGCATTA TCTCGATCCA CGCCACCCAG TCCGGTGCAA CCTGGGGACT GGACCGGATC GACCAGGCCA GTCTGCCGCT GAACGGCACC TACACCTACA ACACGACGGG GCAGGGCGTC ACGGTCTACG TGATCGACAC CGGCATCCGG ACCAGCCATT CGCAGTTCGG TGGCCGGGCC AGCGAGGGTT ACACGGCGAT CAATGACGGC CGGGGCGCCC GCGACTGCGA TGGCCACGGC ACGCATGTCG CCGGAACCAT CGGCGGCTCG ACTTACGGCG TCGCCAAGGG TGTCAAACTG GTTTCGGTAC GGGTCCTGGA TTGCAATGGT TCGGGAACGA CGTCCGGTGT GATCGCCGGG GTGAACTGGG TGACCGCCAA CGCGCAGGTC CCCGCCGTCG CCAACGTCAG CCTGGGCGGC AGCGCTTCCC AGGCGCTCGA TACCGCGGTG CAGAATTCGA TCAACGCGGG CGTGACCTAT GTGATCGCCG CCGGCAATTC CAACAGGAAT GCCTGCGACG AGTCGCCCGG CCGCACCGCC GCCGCTTTGA CCGTGGGAGC CACCACTTCT ACCGACGCCC GCGCGAGCTA TTCCAACTAT GGAACCTGCC TGGATATTTT CGCGCCGGGC AGCAGCATCA CTTCCGCGAG CAACGCGGGC GACAACGCGA CCGAGGTCAT GAGCGGTACC TCGATGGCGA CGCCGCACGT CGCCGGAGCC GCGGCCTTGT ATCTGTCTGC TTTCCCGGGC GCGGCACCGG CTCAGGTCGC ATCGGCACTG ACCTCCAACG CGACCCCCGG CAAGGTCGGC GGGGCCGGCA CGGGGTCGCC CAATCGCCTG TTGTACACGG GGTTCATCGC GGCGCCCGCC GTCGACACCA CGCCGCCTCA AGTGACGCTC ACCGCGCCGG CGGCTTCGGC CACCTTGACC GGCGCGGTGA ATCTCGCCGC CAATGCCGTG GACGAAAGCG GCGGAAGCGG CATCGCCAAG GTCGAATTCC GCGTCGATGG CAAGGTCGTG GGGACTGATA CCTCGGCGCC ATACGGCGTG GTGTGGGATT CTGCATCCGT GGCGGACGGA TCGCACGCTT TCGACGCGAT GGCCACCGAC AGTGCCGGGA ATTCGGCCCT GTCCAATTCG GTCAGTGCCG ATACCGCCAA TGGCGGCCAG TCCTCCCCCG CGTGTTCGAC GACCAGCCAA CTGTTGGCGA ATCCCGGTTT TGAAAGCGGG ACGGTGGCAT GGACGGCGTC CGCGGGGGTC ATAGACAATT CCAACTCGGC ACCGGCGCGC AGCGGAAACT GGAAGGCCTG GCTGAACGGC TATGGCACGA TCCATACCGA TGACCTGTAC CAGCAGGTGA CGGTGCCAGT CGATGCCTGC AGCGCCAATT TCAGCTTCTG GCTGCGGATC GCCACCAGCG AATTCACCGG CGCGCCGGCG CGCGACACGC TGACGGTGAC GGTGCGCAAT ACCGCAGGGA CTGTGCTGCA GACGCTGGCA ACCTATTCGA ACAGGGACCG CTCTTCGGGG TATGTCCAAC GGCGCTTCGA TCTGTCCGCG TACAAGGGGC AGACCATCCG GCTCCAGTTC CGGGGTGTCG AGAATTCGTC CCGGGCCACC AGCTTCCTGG TCGATGATGC GGAGCTGGCG GTGACGCGAT AG
|
Protein sequence | MSKERERFVN QRVAMSLILA AGLSAASFGE AEAGTFLRAA RPIPGQYIVI FRSEAESAAG FDGLLGRVSR LYNTVPAQVW RHAVKGFVGR MSPVEAAALA NDPSVALVEE DGIISIHATQ SGATWGLDRI DQASLPLNGT YTYNTTGQGV TVYVIDTGIR TSHSQFGGRA SEGYTAINDG RGARDCDGHG THVAGTIGGS TYGVAKGVKL VSVRVLDCNG SGTTSGVIAG VNWVTANAQV PAVANVSLGG SASQALDTAV QNSINAGVTY VIAAGNSNRN ACDESPGRTA AALTVGATTS TDARASYSNY GTCLDIFAPG SSITSASNAG DNATEVMSGT SMATPHVAGA AALYLSAFPG AAPAQVASAL TSNATPGKVG GAGTGSPNRL LYTGFIAAPA VDTTPPQVTL TAPAASATLT GAVNLAANAV DESGGSGIAK VEFRVDGKVV GTDTSAPYGV VWDSASVADG SHAFDAMATD SAGNSALSNS VSADTANGGQ SSPACSTTSQ LLANPGFESG TVAWTASAGV IDNSNSAPAR SGNWKAWLNG YGTIHTDDLY QQVTVPVDAC SANFSFWLRI ATSEFTGAPA RDTLTVTVRN TAGTVLQTLA TYSNRDRSSG YVQRRFDLSA YKGQTIRLQF RGVENSSRAT SFLVDDAELA VTR
|
| |