Gene AnaeK_0784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_0784 
Symbol 
ID6786210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp891548 
End bp892432 
Gene Length885 bp 
Protein Length294 aa 
Translation table11 
GC content71% 
IMG OID642762236 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002133149 
Protein GI197121198 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00721188 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGAAGG GCAGGCTCGG GCTCGAGACC GCGCGCATTC CCCAGGGCGA CCGGCACGGA 
CTGCTCTGGC TGTCGCGCGG CAACCTGTAC GTGGAGGACG GGACGCTCCG CTTTCGCACC
GTGGGCTGGG CTGGCCTGCC CGCTGGCGAC TACGCCATCC CGTTCCAGAT GGTCACCGCC
GTCCTCCTCG AGCCGGGGAC CACCGTCAGC CATGACGCGC TCCGGCTGCT CGCGCGCCAC
GGGACGGGCC TCGTCGCCGT CGGAGAGGAA GGCACGCGAT TCTACGCGAG CATGCCGTTC
GGCCCGGACG CGTCCGCTCT CGCTCGGCGG CAGGTGACGG CGTGGGCGAG CGCCGCGGAC
GGCCGGCTCC GCGTCGCCCG CCGCATGTAC GCCTGGCGCT TCGGCGAGGT TCTGCCCGAC
GAAGACATCA CCGTCCTGCG CGGCATCGAG GGCGCCCGCA TGCGCGAGAT CTACCGGCGC
CTTGCCGAAC AGTACGGGGT TCCATGGTCG GGCCGGCGAT ATGACCGGCA ACGCCCAGAG
CAGAACGACC CGGTGAACCA GGCGATCAAC CACGCGGCGA GCGCGGTCGA AGCTGCCGCG
CTCGTCGCGG TGGCGGTGAC CGGGACGGTC CCGCAGCTCG GGTTCATCCA CGAGGACTCG
GGGAATGCGT TCGCGCTCGA CGTCGCCGAC CTGTTCCGTT CGGCGATAGC CCTGCCGGCC
GCCTTCTCGG CGGTCCGCGA GTGCGCGCGC GATCCCCGCC AACCGCTCGA GCGCACCGCG
AGGCGTGCTG CGGGTCGCCT GCTACGGCAG AAGGACGTCA TTCCTGAGAT GATCGACCGC
ATCAAGGAGA TGTTCGATGC CGATGACGGT CATCGTGACC CGTGA
 
Protein sequence
MLKGRLGLET ARIPQGDRHG LLWLSRGNLY VEDGTLRFRT VGWAGLPAGD YAIPFQMVTA 
VLLEPGTTVS HDALRLLARH GTGLVAVGEE GTRFYASMPF GPDASALARR QVTAWASAAD
GRLRVARRMY AWRFGEVLPD EDITVLRGIE GARMREIYRR LAEQYGVPWS GRRYDRQRPE
QNDPVNQAIN HAASAVEAAA LVAVAVTGTV PQLGFIHEDS GNAFALDVAD LFRSAIALPA
AFSAVRECAR DPRQPLERTA RRAAGRLLRQ KDVIPEMIDR IKEMFDADDG HRDP