Gene Mlg_0952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0952 
Symbol 
ID4269686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1082248 
End bp1083405 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content59% 
IMG OID638125703 
ProductCRISPR-associated Cse4 family protein 
Protein accessionYP_741795 
Protein GI114320112 
COG category 
COG ID 
TIGRFAM ID[TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.195706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCTGC AAATCCACAC CCTCACATCT TACCATGCCG CCCTACTTAA CCGGGACGAT 
GCCGGCCTTG CCAAGCGCAT TCCCTTTGGC AGTGCAGAAC GTATGCGGGT CTCCTCCCAA
TGCCTAAAGC GCCACTGGCG GCAGGCCCTG AAAGACGTAA TTTCCCTCCC GAGCGGAATC
CGCACCCGCC ATTTCTTCGA ACGAGAGGTC TGCCGACGTG TTATCGCCGA GGGCGTCGAA
GACGAGAAGG CGCGTGAATT AACAGGCAAG CTCATCGACG CCGTTATGCA CAGCAAAGAG
GCCCGCGAGA AAGACTCCCT TTTCCTCAAA CAACCTGTCC TTTTCGGCCG ACCAGAGGCT
GACTACTTCG TCAGTTTGAT CACCGAATGT GCCCGTAGCG GTGAAGATCC CGGCTCAACC
CTCAAGGATC GGGTCAAGGC GGAGAAAAAG AACTTCCGCG CCCTGCTTCA GGCGGCTGGA
GGCAGTGACC TTGAGTCGGG CATAGAGGGC GCCCTGTTCG GGCGTTTCGT CACGTCAGAC
ATCCTTGCCC GCACTGATGC CAGTGTCCAC GTCGCCCATG CCTTTACCGT GCATTCCCTG
AACAATGAGG TGGACTATTT CACTGTGGTA GACGACCTGA AGGAGCCAGG CGAGGATGCC
GGCGCAGCAC ATGCCGGGGA TATGGAACTG GGCGCTGGGC TTTTCTACGG GTACGTCGTG
GTGGACGTAC CACTGCTCGT CTCAAACCTT TCCGGTTGTG AGCGGCAGGC ATGGCGCGAA
CAGACAGAGG CTTGTGCCGA CGCCCGTGAT GTCTTGGCGG CCCTGGTACA CAGCATCGCA
ACCGTCTCGC CCGGAGCCAA ATTGGGTGCT ACGGCTCCCT ACGCCCGTAC CGACTGTGCG
TTGTTGGAGA CCGGTACGAC CCAGCCCCGT GCCTTGGCAA ACGCTTACCT TGAACCCCTG
CCAGCGCGGG GCGACCTGAT GCAGCAGTCC GTTAATACCA TGGGCCATTA CCTGAAATCC
CTTGATGACA TGTTCGGTGA GGAAACCAGT CGCTTCGTCT CTGCTACCAG GGACACAACG
TCGCTCCCCT GCGCCCACCG CGGCCCCCTT TCAGAAACGA TCGACGGCGC CTTGGATAGC
ATCTTCGGAG GTCAATGA
 
Protein sequence
MFLQIHTLTS YHAALLNRDD AGLAKRIPFG SAERMRVSSQ CLKRHWRQAL KDVISLPSGI 
RTRHFFEREV CRRVIAEGVE DEKARELTGK LIDAVMHSKE AREKDSLFLK QPVLFGRPEA
DYFVSLITEC ARSGEDPGST LKDRVKAEKK NFRALLQAAG GSDLESGIEG ALFGRFVTSD
ILARTDASVH VAHAFTVHSL NNEVDYFTVV DDLKEPGEDA GAAHAGDMEL GAGLFYGYVV
VDVPLLVSNL SGCERQAWRE QTEACADARD VLAALVHSIA TVSPGAKLGA TAPYARTDCA
LLETGTTQPR ALANAYLEPL PARGDLMQQS VNTMGHYLKS LDDMFGEETS RFVSATRDTT
SLPCAHRGPL SETIDGALDS IFGGQ