Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA2704 |
Symbol | |
ID | 3104402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 2888751 |
End bp | 2890445 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637171836 |
Product | serine protease |
Protein accession | YP_115106 |
Protein GI | 53803184 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0794852 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGAAA GCGATGTCGA CCGGGATGAG CTCGGGGCGG TGAGCGCCGC CGAGCTTGGT CGGCTGCAGG CGGACACCGG TTTGGATTTC GAATGGGTGG GGAATACCCG GACCAACGCG AGGATTTTCC GGACGCCCGC TGCGTTGACC CGAGATGAGG CGACGGCCCT GGCACGAAGG ATCGGTGTCA TGGAGGGCGT GCTGTGGGCT GCCGTCGAAG CCGGGCAGGG CACGGTATAT GCCCGGTCGG CCTTGCCCGC CGTTGCCGGT ATCGAGGACT TCATCATCAG ATTCAAACGC GGGGCTGGCT CGCTGCCGCT GCATGTCGCG AAACTCTCGG CCGTCGCCGG CGCAGGTCTG GTCCCGACAT CGGAAACCGT AGGCACGGCG CAGGTGTACC GCCTCCTTCG TCCGGTGAGC CTGACGGAGG CGGGCGAGAT CGAGCGGCGC CTCGAAGCCC TGCCGGAGGT GCGCTATGCC GATCCGGTGA CACGCCTGAA GGCCCAGGCG ACCGTCGAGC CGGTCACCCC CAGCGACGAA TATTTCCCCC AGCAGTGGCA TCTTCAGAGC GGTCCCGGGG CGGTTGCCGG CTCGGCCAAT GTGCAGGCCG CCTGGTCGCT CATCCGGGGG GCGCCCGAAA TCACGGTTGC CGTGCTGGAC AGCGGCGTCC TGTTCGCCCC GACCCATCCC GACCTCGAAA ACCGGCTGGC ATACCGGAAT GCGGAGAAAA CCGTGATCGT CGGCTGGGAC ATGATCCGGA ATCCGAGATT CGCGCGGGAC GGCAATGCCC GCGACAAAAA GCCGAAGGAC GAGGGCGACT GGGAGACCGT GGGCTGGTGC GGGAAAGAGG ATGGAAGACC CGTGCCTTCG GAGTTCAAGC CTGCCGAATG GCACGGCAGC CACGTCGCCG GCATCATCGG CGCCGTCACC GACAACGGTA TCGGTGTGAG CGGCGTCGAC TGGTTCGCCC GCATCGAACC CGTGCGGGTG CTCGGCCCCT GCGATGGCAA CGCCAAAGAC ATCATCCAGG GGATATGGTG GGCCGCCGGC AAGAAAGGGG TGGGAGGAAC CCAGCCGAAT CCCAGGCCGG CCCATGTCAT CAACATGAGC CTGGGGCCCG ATGCGCCCGG TTCGGGACCC TGTCCGGACG CCATGCAGGA AGCCATCGAC TACGCGCTCT CGCGCAACGC CGTGGTCGTC GTATCGGCGG GCAATGAAAA GGACGATACC CGCAACTATG CGCCGGCGAA TTGCCGCGGC GTGATCGCAG TCAGTGCCGT GGGGCCGTCC GGCGACCTGG CCTCGTACAG CAATTTCGGG GCGGAGGTCG CCGTCGCGGC GCCCGGCGGA GACACCGGCG CGAATGGCGA ACGTCCGCAG GACGGCATCC TCTCGACCCT GAATACCAGC CGGAAGCGGC CATCCGCCGA GGGCATGTCC TATGGGGCGA TGGACGGTAC CAGCATGGCG GCGCCCGTGG TTTCCGGCGT CGTGTCGCTA ATGCTGGCCG CGGATACCGA GAAAAAGCTC GATCCACAAC GGATCGTGGC CATCCTTCGG GACACGGCGC GGCCGTTTCC GGAAGGTTCA CGGTGCGCCA CCGAGCTCGA GGGACGGTGC GGAGCGGGGA TCGTGGACGC CTATGCCGCG GTGAAGGCTG CGATGTCTCA GACGATCGGC CCCCGTGCGC AATGA
|
Protein sequence | MFESDVDRDE LGAVSAAELG RLQADTGLDF EWVGNTRTNA RIFRTPAALT RDEATALARR IGVMEGVLWA AVEAGQGTVY ARSALPAVAG IEDFIIRFKR GAGSLPLHVA KLSAVAGAGL VPTSETVGTA QVYRLLRPVS LTEAGEIERR LEALPEVRYA DPVTRLKAQA TVEPVTPSDE YFPQQWHLQS GPGAVAGSAN VQAAWSLIRG APEITVAVLD SGVLFAPTHP DLENRLAYRN AEKTVIVGWD MIRNPRFARD GNARDKKPKD EGDWETVGWC GKEDGRPVPS EFKPAEWHGS HVAGIIGAVT DNGIGVSGVD WFARIEPVRV LGPCDGNAKD IIQGIWWAAG KKGVGGTQPN PRPAHVINMS LGPDAPGSGP CPDAMQEAID YALSRNAVVV VSAGNEKDDT RNYAPANCRG VIAVSAVGPS GDLASYSNFG AEVAVAAPGG DTGANGERPQ DGILSTLNTS RKRPSAEGMS YGAMDGTSMA APVVSGVVSL MLAADTEKKL DPQRIVAILR DTARPFPEGS RCATELEGRC GAGIVDAYAA VKAAMSQTIG PRAQ
|
| |