Gene Mmc1_0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_0740 
Symbol 
ID4480534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp915235 
End bp916296 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content62% 
IMG OID639721484 
ProductO-sialoglycoprotein endopeptidase 
Protein accessionYP_864667 
Protein GI117924050 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACGGG TTTTAGGCAT AGAGAGCAGT TGCGACGAGA CCGCCGCCGC TGTGGTGGAA 
GGGGCAGAAC ATGGTCACCC CCATGGGGTG GTGGTGCGCT CCAATGTGGT GTGGAGCCAG
TTGGAGGTAC ACGCCCTCTA CGGCGGGGTG GTGCCCGAAC TGGCCAGCCG GGCCCACATA
CGCCATATAC AGCCGGTGAT TGAGCAGGCT TTGGCTGAGG CTGGGGTGCG ACCCCAGCAG
TTGGATGCCA TTGCGGTTAC GGTGGCTCCC GGGTTGGTGG GCGCACTGCT GGTGGGGGTA
GCGGCGGCGC AGGGGTTGGC GGTGGCGCTG GATAAGCCGC TGGTGCCGGT ACACCACATG
GAAGGGCACC TGATGAGCCC TTTTCTCATG GCGGGCGTGG TACCTGCCAT GGAGTTCCCC
TTTGTGGCCT TACTGGTCTC CGGTGGGCAC ACCCTGTTGC TGCACGCCCG TGATTTTGGC
GACTACCAGC TGCTGGGGCA GACCCGTGAC GATGCGGTGG GGGAGGCGTT TGACAAGGGG
GCGCGCATGC TGGGCTTGGG GTATCCCGGT GGTCCAGAGG TCGCCGCCTT GGCCCAGTCG
GGGGATCGGC AGGCGGTGGC TTTTCCCCGT GTGTTGCTGG ACCGCAGCCA ATTTGATTTC
TCCTTTTCTG GCCTAAAAAC CGCCTTGCGT ACCCATCTTC TTAAATTCCC GCCGGAGTCC
GGTGGTCCCT CTTTGGCCGA TGTGGCCGCC AGTTATCAAG AGGCCATTGT GGATACCCTG
GTGATTAAAT CCTTGAGCGC CTGCCGCCAT GTGGGGGTGT CGCGTTTGGT GATTGCCGGT
GGAGTAGGGG CCAATAGACG ATTGCGGGAA AAATTGGCCA AACAAGCTCT TAAACAGGGT
GTGCAACTCT ACGCTCCCCC CATCCACCTG TGTACTGATA ATGGCGCGAT GATCGCCTCT
GCCGGCGTGT GCCGCTTGGC CAGGGGGGAT CAAGCGCGGG GGGTGGTGAA TGCGGTGCCC
CGGCTGCCGA TTCATGAACT GGAGAAAATT TATGGCCGTT GA
 
Protein sequence
MLRVLGIESS CDETAAAVVE GAEHGHPHGV VVRSNVVWSQ LEVHALYGGV VPELASRAHI 
RHIQPVIEQA LAEAGVRPQQ LDAIAVTVAP GLVGALLVGV AAAQGLAVAL DKPLVPVHHM
EGHLMSPFLM AGVVPAMEFP FVALLVSGGH TLLLHARDFG DYQLLGQTRD DAVGEAFDKG
ARMLGLGYPG GPEVAALAQS GDRQAVAFPR VLLDRSQFDF SFSGLKTALR THLLKFPPES
GGPSLADVAA SYQEAIVDTL VIKSLSACRH VGVSRLVIAG GVGANRRLRE KLAKQALKQG
VQLYAPPIHL CTDNGAMIAS AGVCRLARGD QARGVVNAVP RLPIHELEKI YGR