Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0952 |
Symbol | |
ID | 4269686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1082248 |
End bp | 1083405 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638125703 |
Product | CRISPR-associated Cse4 family protein |
Protein accession | YP_741795 |
Protein GI | 114320112 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.195706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCTGC AAATCCACAC CCTCACATCT TACCATGCCG CCCTACTTAA CCGGGACGAT GCCGGCCTTG CCAAGCGCAT TCCCTTTGGC AGTGCAGAAC GTATGCGGGT CTCCTCCCAA TGCCTAAAGC GCCACTGGCG GCAGGCCCTG AAAGACGTAA TTTCCCTCCC GAGCGGAATC CGCACCCGCC ATTTCTTCGA ACGAGAGGTC TGCCGACGTG TTATCGCCGA GGGCGTCGAA GACGAGAAGG CGCGTGAATT AACAGGCAAG CTCATCGACG CCGTTATGCA CAGCAAAGAG GCCCGCGAGA AAGACTCCCT TTTCCTCAAA CAACCTGTCC TTTTCGGCCG ACCAGAGGCT GACTACTTCG TCAGTTTGAT CACCGAATGT GCCCGTAGCG GTGAAGATCC CGGCTCAACC CTCAAGGATC GGGTCAAGGC GGAGAAAAAG AACTTCCGCG CCCTGCTTCA GGCGGCTGGA GGCAGTGACC TTGAGTCGGG CATAGAGGGC GCCCTGTTCG GGCGTTTCGT CACGTCAGAC ATCCTTGCCC GCACTGATGC CAGTGTCCAC GTCGCCCATG CCTTTACCGT GCATTCCCTG AACAATGAGG TGGACTATTT CACTGTGGTA GACGACCTGA AGGAGCCAGG CGAGGATGCC GGCGCAGCAC ATGCCGGGGA TATGGAACTG GGCGCTGGGC TTTTCTACGG GTACGTCGTG GTGGACGTAC CACTGCTCGT CTCAAACCTT TCCGGTTGTG AGCGGCAGGC ATGGCGCGAA CAGACAGAGG CTTGTGCCGA CGCCCGTGAT GTCTTGGCGG CCCTGGTACA CAGCATCGCA ACCGTCTCGC CCGGAGCCAA ATTGGGTGCT ACGGCTCCCT ACGCCCGTAC CGACTGTGCG TTGTTGGAGA CCGGTACGAC CCAGCCCCGT GCCTTGGCAA ACGCTTACCT TGAACCCCTG CCAGCGCGGG GCGACCTGAT GCAGCAGTCC GTTAATACCA TGGGCCATTA CCTGAAATCC CTTGATGACA TGTTCGGTGA GGAAACCAGT CGCTTCGTCT CTGCTACCAG GGACACAACG TCGCTCCCCT GCGCCCACCG CGGCCCCCTT TCAGAAACGA TCGACGGCGC CTTGGATAGC ATCTTCGGAG GTCAATGA
|
Protein sequence | MFLQIHTLTS YHAALLNRDD AGLAKRIPFG SAERMRVSSQ CLKRHWRQAL KDVISLPSGI RTRHFFEREV CRRVIAEGVE DEKARELTGK LIDAVMHSKE AREKDSLFLK QPVLFGRPEA DYFVSLITEC ARSGEDPGST LKDRVKAEKK NFRALLQAAG GSDLESGIEG ALFGRFVTSD ILARTDASVH VAHAFTVHSL NNEVDYFTVV DDLKEPGEDA GAAHAGDMEL GAGLFYGYVV VDVPLLVSNL SGCERQAWRE QTEACADARD VLAALVHSIA TVSPGAKLGA TAPYARTDCA LLETGTTQPR ALANAYLEPL PARGDLMQQS VNTMGHYLKS LDDMFGEETS RFVSATRDTT SLPCAHRGPL SETIDGALDS IFGGQ
|
| |