Gene Cagg_0567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0567 
Symbol 
ID7267800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp701815 
End bp702828 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content56% 
IMG OID643565430 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002461942 
Protein GI219847509 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.424472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.165251 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTCA TAGTAGATGA ACGCGGTAGT TTCATTGGCA AGCATCAAGG TCGGTTGCGG 
GTAACCAAAG ATAACGAGCG GTTACGCGAA GTACCGATCA TGCATCTCCG GCAAGTGATC
ATCTGTGGTA GTGGGGTTGC CATCAGCAGC GACGCCGTGC GAGCGTGCAG TGAAGAAGGC
ATCCCAATCC ACTTCATCAG CACCAACGGC ACCCCACAGG CCAGCCTCTA CAGCGCCGGC
CTCACCGGCA CCGTTCTTAC ACGGCGCGCC CAATTACGCG CCTACGACGG GCCGGCCGGC
GTCACCCTTG CCCGTGCGTT TACGCTGGGT AAGCTTGGCA ACCAAGCCAA CCTACTGCGC
TACGCGGCAA AAAATCGTAA GGAGACGGCG CCTGACATAT ACGAACAACT GATGACCGCA
GCCGGTGAAG TGGTTGACTA CCAAATCGCA GTCGAACGGC TCAAAGGCGA AACGGTTGAC
GAGATTCGTG ACGAATTGAT GGGGATCGAA GGCCGCTACG CAGCGCGCTA CTGGAAAGCC
ATCGGCGCGT TGGTCCCGTC TGAACTCAAT TGGCCGGGGC GCGAGACACG TGGGGCAACC
GATCCGTTCA ATCAAGTACT CAACTATGGT TACGGTGTAT TGTATGGTCA AGTCGAACAC
GCCATCGTGC TTGCCGGTCT CGATCCGTAT GCCGGCCTAC TCCATGCCGA TCGACCGGGC
AAACCGAGTC TGGTTTTAGA TTTAATCGAA GAGTTTCGTC AAGCTGTGGT TGATCGACCG
CTGCTCGGCC AACTCACCCG TGGTTGGCAA ATTGGGCGGG AAGAAGATGG TCGGCTTGAT
CAACCTACAC GTGAGCGCAT TGTGACCAAA GTGCTCGAAC GGCTTGAATC AACCGAACCG
TATGAGGGGA AGCGGCAACC GTTACGTCAC ATTCTCCAGT GTCAAGCACG CCACATTGCT
ACCTTCGTGC GTGGTGAGCG TGAAAACTAC ACCCCATTCG TGATGGGCTG GTGA
 
Protein sequence
MELIVDERGS FIGKHQGRLR VTKDNERLRE VPIMHLRQVI ICGSGVAISS DAVRACSEEG 
IPIHFISTNG TPQASLYSAG LTGTVLTRRA QLRAYDGPAG VTLARAFTLG KLGNQANLLR
YAAKNRKETA PDIYEQLMTA AGEVVDYQIA VERLKGETVD EIRDELMGIE GRYAARYWKA
IGALVPSELN WPGRETRGAT DPFNQVLNYG YGVLYGQVEH AIVLAGLDPY AGLLHADRPG
KPSLVLDLIE EFRQAVVDRP LLGQLTRGWQ IGREEDGRLD QPTRERIVTK VLERLESTEP
YEGKRQPLRH ILQCQARHIA TFVRGERENY TPFVMGW