Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_0740 |
Symbol | |
ID | 4480534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | + |
Start bp | 915235 |
End bp | 916296 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639721484 |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | YP_864667 |
Protein GI | 117924050 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACGGG TTTTAGGCAT AGAGAGCAGT TGCGACGAGA CCGCCGCCGC TGTGGTGGAA GGGGCAGAAC ATGGTCACCC CCATGGGGTG GTGGTGCGCT CCAATGTGGT GTGGAGCCAG TTGGAGGTAC ACGCCCTCTA CGGCGGGGTG GTGCCCGAAC TGGCCAGCCG GGCCCACATA CGCCATATAC AGCCGGTGAT TGAGCAGGCT TTGGCTGAGG CTGGGGTGCG ACCCCAGCAG TTGGATGCCA TTGCGGTTAC GGTGGCTCCC GGGTTGGTGG GCGCACTGCT GGTGGGGGTA GCGGCGGCGC AGGGGTTGGC GGTGGCGCTG GATAAGCCGC TGGTGCCGGT ACACCACATG GAAGGGCACC TGATGAGCCC TTTTCTCATG GCGGGCGTGG TACCTGCCAT GGAGTTCCCC TTTGTGGCCT TACTGGTCTC CGGTGGGCAC ACCCTGTTGC TGCACGCCCG TGATTTTGGC GACTACCAGC TGCTGGGGCA GACCCGTGAC GATGCGGTGG GGGAGGCGTT TGACAAGGGG GCGCGCATGC TGGGCTTGGG GTATCCCGGT GGTCCAGAGG TCGCCGCCTT GGCCCAGTCG GGGGATCGGC AGGCGGTGGC TTTTCCCCGT GTGTTGCTGG ACCGCAGCCA ATTTGATTTC TCCTTTTCTG GCCTAAAAAC CGCCTTGCGT ACCCATCTTC TTAAATTCCC GCCGGAGTCC GGTGGTCCCT CTTTGGCCGA TGTGGCCGCC AGTTATCAAG AGGCCATTGT GGATACCCTG GTGATTAAAT CCTTGAGCGC CTGCCGCCAT GTGGGGGTGT CGCGTTTGGT GATTGCCGGT GGAGTAGGGG CCAATAGACG ATTGCGGGAA AAATTGGCCA AACAAGCTCT TAAACAGGGT GTGCAACTCT ACGCTCCCCC CATCCACCTG TGTACTGATA ATGGCGCGAT GATCGCCTCT GCCGGCGTGT GCCGCTTGGC CAGGGGGGAT CAAGCGCGGG GGGTGGTGAA TGCGGTGCCC CGGCTGCCGA TTCATGAACT GGAGAAAATT TATGGCCGTT GA
|
Protein sequence | MLRVLGIESS CDETAAAVVE GAEHGHPHGV VVRSNVVWSQ LEVHALYGGV VPELASRAHI RHIQPVIEQA LAEAGVRPQQ LDAIAVTVAP GLVGALLVGV AAAQGLAVAL DKPLVPVHHM EGHLMSPFLM AGVVPAMEFP FVALLVSGGH TLLLHARDFG DYQLLGQTRD DAVGEAFDKG ARMLGLGYPG GPEVAALAQS GDRQAVAFPR VLLDRSQFDF SFSGLKTALR THLLKFPPES GGPSLADVAA SYQEAIVDTL VIKSLSACRH VGVSRLVIAG GVGANRRLRE KLAKQALKQG VQLYAPPIHL CTDNGAMIAS AGVCRLARGD QARGVVNAVP RLPIHELEKI YGR
|
| |