Gene Mmc1_3524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_3524 
Symbol 
ID4481122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp4405889 
End bp4406914 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content54% 
IMG OID639724273 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_867415 
Protein GI117926798 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACCC ATTTAAACAC CCTTTTTATC AACACCCAGG GCAGCTATCT GGCCAAAAAG 
GGGGAGTGTA TTGATGTTCG TCAGGAGCAG CGCTCCATGG CTATGGTTCC CATCCATACG
CTAGAGGGTG TGGCTTGCTT TGGGCAGGTC TCTTACTCTC CTTATTTGAT GGCCCACTGC
GCCGAGCATG GTGTTAGCCT CAGCCATTTT GATGAAAGGG GTCGATTTTT GGCCGCCATG
CGTGGACCCA CCAGCGGCAA TGTGCTGTTA CGCAGGCAGC AGTATCGTTG GGCCGATGAT
CCCCAGCAGG CGGCCCAAAT GACCCGTTTT ATCCTGCACG CCAAACTACG CAATGGCCGC
ACGGTCATGG CGCGCGCCCT GCGTGAACGC GGGCAGACAA ACACGGCCTT GGAAACGACG
GTCCACGAAT TGGGGGTTTT GGTGGCGTTA CTGGATAAAA CCGATTCGGT AGAGAGTCTG
CGTGGGTTAG AAGGGCGGGC GGGTGCGCTT TATTGGGGAT CGTTTCAGCA TCTTATTTAT
CAGGATTCGC CGGAGTTTAA GTTCTTTGGC CGCAGTCGTC GTCCGCCGTT GGATCGGGTT
AATGCCCTGC TCTCTTTCCT CTATGCCATG TTGATGCATG ATGTACGCTC GGCGTTGGAG
GGGGTTGGGT TGGATCCTTA TGTGGGATTT TTGCATCAAG ATCGCCCTGG TCGGCCAGGC
TTGGCGCTGG ATATGATGGA GGAGTTTCGC CCCTATCTGG CCGACCGGCT GGCTTTGACG
TTGATTAATC GTGGTCAGCT CGGTGCGAAG GATTTTGAGG TGCAGGCTTC GCAGGCGACC
TATCTTACCG AGGCGGGGCG AAAAAAGGTC ATCGTGGCGT ATCAAAAGCG TAAGGATGAG
CAAATAACGC ACCCCTTTTT ACAGGAGCAC TGCGCCATAG GCATGGTGTG GCATTTACAG
GCTCTGCTGT TGGCGCGATA TATCCGTGGG GATTTGGATG GTTATCCTGC TTTTTTGTGG
CGTTAA
 
Protein sequence
MRTHLNTLFI NTQGSYLAKK GECIDVRQEQ RSMAMVPIHT LEGVACFGQV SYSPYLMAHC 
AEHGVSLSHF DERGRFLAAM RGPTSGNVLL RRQQYRWADD PQQAAQMTRF ILHAKLRNGR
TVMARALRER GQTNTALETT VHELGVLVAL LDKTDSVESL RGLEGRAGAL YWGSFQHLIY
QDSPEFKFFG RSRRPPLDRV NALLSFLYAM LMHDVRSALE GVGLDPYVGF LHQDRPGRPG
LALDMMEEFR PYLADRLALT LINRGQLGAK DFEVQASQAT YLTEAGRKKV IVAYQKRKDE
QITHPFLQEH CAIGMVWHLQ ALLLARYIRG DLDGYPAFLW R