Gene Emin_0244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0244 
Symbol 
ID6262916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp264709 
End bp265605 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content38% 
IMG OID642610708 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001875143 
Protein GI187250661 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCGAG TTTTGGATAT CCCCGGGGAC GGGTACCATT TATGCGTAAA AAACAATAAC 
TTCTCCGCAG TAAAAGACAG AGAGGAAAAA CTGCATTGTT TATTTGACGA TATAAACAGC
ATTATACTTT ACGGTAATAA TATTACCATT TCCAATACTT GCATACAAAA ATGTTTAGAG
CATAAAGTAC CGGTCATCTT CTGCGATAAA ACCTATAACC CCGCCGGAAT GCTGCTTTCT
TCTTTTACCA CAAATATTTA CGGACGCAGA CTCCAGTTAC AAATAAATGC CTCAAAACCA
CAAATAAAAC AAGCCTGGCA ACAAATAATC ACAAGTAAGT TAAACAACCA AGCTGAGGTG
TTAAAAAGAT TTGACACGCT TAAGGCGGCG GAAACCATTT TTAATATGGC CCGCGAGGTG
CGCTCTGGCG ATGCTACTTT TAAAGAAGGT GTCGGCGCAA AGGTATATTT TGAAAATTTA
TTTAATGATT TTCATAGAAA TACCGACGAT AAGGATATTA TAAATTCAGC GTTAAATTAT
GGCTATGCGA TTGTTAGAAG TTCTATTGCG CGGGCGGTTG TTTCCGCCGG ATTAAATCCC
GCCATCGGTA TTTTCCACAG TAAGAACCAT AATCCGTTTT GTTTAATAGA TGATTTGATA
GAACCACTGC GTCCTCTTAT AGATTTTATG GTAAAAAATA AATTGGATGT TTTGACGCAA
GAGGAAAGTC TGTCGCCTTC GGCTAAAAAA TATATGGCAA GCGTGATAGA AAGTAACTTG
TATTTTGAGG ATGGTGCCTT TAATCTTACG GCCGGGATAC AAAAATATAT CCAGTCGTAT
ATCGCGTTTT TGGAAGAACG GGAAAACAGG ATAATTTTCC CGGCAATTTT AAAATGA
 
Protein sequence
MWRVLDIPGD GYHLCVKNNN FSAVKDREEK LHCLFDDINS IILYGNNITI SNTCIQKCLE 
HKVPVIFCDK TYNPAGMLLS SFTTNIYGRR LQLQINASKP QIKQAWQQII TSKLNNQAEV
LKRFDTLKAA ETIFNMAREV RSGDATFKEG VGAKVYFENL FNDFHRNTDD KDIINSALNY
GYAIVRSSIA RAVVSAGLNP AIGIFHSKNH NPFCLIDDLI EPLRPLIDFM VKNKLDVLTQ
EESLSPSAKK YMASVIESNL YFEDGAFNLT AGIQKYIQSY IAFLEERENR IIFPAILK