Gene Mlg_0949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0949 
Symbol 
ID4269683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1079918 
End bp1080814 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content60% 
IMG OID638125701 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_741793 
Protein GI114320110 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0628062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGG GAGGTTTCCT ACCAGGGCGC CTCGGCCTTT CCGGTGCCCG TATTCCTCAC 
GTGGATCGTC ATGGCTTGCT GTGGCTGACA CGCGGACGGC TGTACGTAGA GGATGGCACG
CTGCATTTCA CGGCAGCAGA GTCCGAAGAC CTGGCCGCCG GTGATTATGC GATTCCGTAC
CAAGGCCTCT CAATGATTCT GCTCGGCCCT GGCAGTACGG TCACGCACGA TGTATTACGC
CTCCTTGCAC GCCACGGGAC GCTGTTGGCC GCTATTGGCG GGGGTGGCAC CAAATACTAC
ACGGCTCCGC CAATGGGCCA AGGCCGCTCA GATGTGGCCC GGCGCCACGC CACGCTTTGG
GCGAACAAAA CCCAACGGCT CGACGTCGCA CGACGCATGT ACGCCTTTCG CTTCGGGCGC
GTGCTTCCTC ATAAAGACAT CGCTGTACTG CGCGGCATTG AAGGCGGACG CATCAAGGAG
CTCTACCGGG TGGAAGCCAG CCGCTTCGGC ATTCCTTGGA AGGGGCGCCG TTACAATCGC
AACAACCCAT CTGCAGCGGA CGTCCCCAAC CAGGCCATCA ACCATGCGGC AACGTTCGTG
GAGGCCGCCG CAGACATTGC TGTGGCCGCC ACCGGCGCGC TGCCACCGCT CGGCTTTATC
CATGAGGAAT CGAGTAACGC TTTTACACTG GACATTGCCG ACCTCTACCG GGGCGAAATC
ACCGTCCCGT TGGCATTCCA GGCCGCCCGA AAGGTTCTTG ACGACCCGAC CCTCAGTATC
GAACGCACCT TGCGCCGAGA CGCGGCGAGC GCATTTCAAC GCCATAAAGT CATCCCGAAG
ATGATCGATC GAATAAAGGA CCTGATCAAT GCCGATGACA ATGGTCGTAA CACGTAA
 
Protein sequence
MSEGGFLPGR LGLSGARIPH VDRHGLLWLT RGRLYVEDGT LHFTAAESED LAAGDYAIPY 
QGLSMILLGP GSTVTHDVLR LLARHGTLLA AIGGGGTKYY TAPPMGQGRS DVARRHATLW
ANKTQRLDVA RRMYAFRFGR VLPHKDIAVL RGIEGGRIKE LYRVEASRFG IPWKGRRYNR
NNPSAADVPN QAINHAATFV EAAADIAVAA TGALPPLGFI HEESSNAFTL DIADLYRGEI
TVPLAFQAAR KVLDDPTLSI ERTLRRDAAS AFQRHKVIPK MIDRIKDLIN ADDNGRNT