Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0949 |
Symbol | |
ID | 4269683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1079918 |
End bp | 1080814 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638125701 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_741793 |
Protein GI | 114320110 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0628062 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGG GAGGTTTCCT ACCAGGGCGC CTCGGCCTTT CCGGTGCCCG TATTCCTCAC GTGGATCGTC ATGGCTTGCT GTGGCTGACA CGCGGACGGC TGTACGTAGA GGATGGCACG CTGCATTTCA CGGCAGCAGA GTCCGAAGAC CTGGCCGCCG GTGATTATGC GATTCCGTAC CAAGGCCTCT CAATGATTCT GCTCGGCCCT GGCAGTACGG TCACGCACGA TGTATTACGC CTCCTTGCAC GCCACGGGAC GCTGTTGGCC GCTATTGGCG GGGGTGGCAC CAAATACTAC ACGGCTCCGC CAATGGGCCA AGGCCGCTCA GATGTGGCCC GGCGCCACGC CACGCTTTGG GCGAACAAAA CCCAACGGCT CGACGTCGCA CGACGCATGT ACGCCTTTCG CTTCGGGCGC GTGCTTCCTC ATAAAGACAT CGCTGTACTG CGCGGCATTG AAGGCGGACG CATCAAGGAG CTCTACCGGG TGGAAGCCAG CCGCTTCGGC ATTCCTTGGA AGGGGCGCCG TTACAATCGC AACAACCCAT CTGCAGCGGA CGTCCCCAAC CAGGCCATCA ACCATGCGGC AACGTTCGTG GAGGCCGCCG CAGACATTGC TGTGGCCGCC ACCGGCGCGC TGCCACCGCT CGGCTTTATC CATGAGGAAT CGAGTAACGC TTTTACACTG GACATTGCCG ACCTCTACCG GGGCGAAATC ACCGTCCCGT TGGCATTCCA GGCCGCCCGA AAGGTTCTTG ACGACCCGAC CCTCAGTATC GAACGCACCT TGCGCCGAGA CGCGGCGAGC GCATTTCAAC GCCATAAAGT CATCCCGAAG ATGATCGATC GAATAAAGGA CCTGATCAAT GCCGATGACA ATGGTCGTAA CACGTAA
|
Protein sequence | MSEGGFLPGR LGLSGARIPH VDRHGLLWLT RGRLYVEDGT LHFTAAESED LAAGDYAIPY QGLSMILLGP GSTVTHDVLR LLARHGTLLA AIGGGGTKYY TAPPMGQGRS DVARRHATLW ANKTQRLDVA RRMYAFRFGR VLPHKDIAVL RGIEGGRIKE LYRVEASRFG IPWKGRRYNR NNPSAADVPN QAINHAATFV EAAADIAVAA TGALPPLGFI HEESSNAFTL DIADLYRGEI TVPLAFQAAR KVLDDPTLSI ERTLRRDAAS AFQRHKVIPK MIDRIKDLIN ADDNGRNT
|
| |