Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1812 |
Symbol | |
ID | 3103875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 1945078 |
End bp | 1946508 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637170972 |
Product | U32 family peptidase |
Protein accession | YP_114250 |
Protein GI | 53804107 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGGC CCAGGCATGG GCTGCGTCAA GCGGCGTCAG GTTCTGGTAT AGTGCCGTCG AGCTATGCCC TCTTCATATC GCTGCTCCCG ATGCCTTCTG TCGAACTGCT CTCCCCTGCC GGCACCCTGA AGAACCTGCG TTATGCCTTC GCATACGGCG CCGATGCCGT CTATGCCGGC CAGCCGCGCT ACTCGCTGCG GGTCCGCAAC AACGACTTCC TGGAGAAAAA CCTGGCGCTG GGCATCGCCG AGGCCCATCG CCTCGGCAAG AAGTTTTATC TCGCGGCCAA CGTGCTGCCC CACAACGCCA AGATCAAGAC CTTCATCAAA GACATCGCGC CCGTGGTTGC GATGGGACCG GATGCGCTGA TCATGGCCGA TCCCGGCCTC ATCCTGCTGG CGCGCGAGGC CTGGCCGGAA ATGCCCATCC ATCTGTCGGT GCAGGCCAAC ACGTTGAACT TTGCGGCGGT GAAGTTCTGG CAGTCGGCCG GCATCTCGCG GATCATCCTG TCGCGCGAGC TCTCGCTCGA CGAGATCGCC GAAATCCGCC AGCAATGCCC GGACACGGAG CTCGAAGTCT TCGTCCACGG CGCGCTGTGC ATCGCCTACT CCGGACGCTG CCTGCTCTCC GGCTATTTCA ATCACCGTGA TCCCAACCAG GGCACCTGCA CCAATGCCTG CCGCTGGAAA TACGACGTCG CCCCGGCGCA CGAGAACGCC GAAGGAGACT ATGTTCCGGG AGCATTGCGG CTGAACCTCG CGGAGTTGAA TGGCAGCCTG GCGGACTGCG GCGCCGCGGA GCGCCACCCG CTGGCCGACG AAGTGTATTT TCTGGAAGAG GAAAACCGCC CGGGCGAACT GATGCCGGTG ATGGAAGACG AGCACGGCAC CTACATCATG AACTCCAAGG ACCTTCGAGC AGTCGAGCAC GTGCAGCGGC TGGTGGAGAT CGGGGTAGAC AGCCTGAAGA TCGAAGGCCG CACCAAGTCT CATTACTATG TGGCGCGCAC CGCCCAGGTC TACCGCCGAG CCATCGACGA CGCGCTCGCC GGGCGGCAAT TCGACTGGAA CCTGCTCGGC GTGCTGGAAA ATCTGGCGCA GCGCGGCTAC ACGGACGGAT TCTACCAGCG CCACCATACC CAGGATTACC AGAACTATGT GCGGGGCTGC TCCGAAAGCC ACCGCCAGCA GTTCGTCGCC GAAGTCACCA GGTTCGATGG CGGGATGGCC GAAGTCACCG TCAAGAACAA GTTCGGCGTC GGCGACCGGC TAGAGCTGAT CCGCCCCGAA GGCAACCTCG ATTTCTTGCT CACCGGCATG GAGGACTCCC AGGGCAATGC CATGCAGGAA GCTCCCGGCG GCGGCTGGGA GGTCAGGATT CCCTTGCCCG CGCCCTGCGA CGATACTGCC TTGCTGTCAC GCTATCTCTG A
|
Protein sequence | MTRPRHGLRQ AASGSGIVPS SYALFISLLP MPSVELLSPA GTLKNLRYAF AYGADAVYAG QPRYSLRVRN NDFLEKNLAL GIAEAHRLGK KFYLAANVLP HNAKIKTFIK DIAPVVAMGP DALIMADPGL ILLAREAWPE MPIHLSVQAN TLNFAAVKFW QSAGISRIIL SRELSLDEIA EIRQQCPDTE LEVFVHGALC IAYSGRCLLS GYFNHRDPNQ GTCTNACRWK YDVAPAHENA EGDYVPGALR LNLAELNGSL ADCGAAERHP LADEVYFLEE ENRPGELMPV MEDEHGTYIM NSKDLRAVEH VQRLVEIGVD SLKIEGRTKS HYYVARTAQV YRRAIDDALA GRQFDWNLLG VLENLAQRGY TDGFYQRHHT QDYQNYVRGC SESHRQQFVA EVTRFDGGMA EVTVKNKFGV GDRLELIRPE GNLDFLLTGM EDSQGNAMQE APGGGWEVRI PLPAPCDDTA LLSRYL
|
| |