Gene Cagg_3451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3451 
Symbol 
ID7269676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4193268 
End bp4194287 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content61% 
IMG OID643568261 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002464729 
Protein GI219850296 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.124605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.143241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAC TCTACGTGAT TGAACAAGGC GCTGAAATCG GTTGCGATGG CGAACGGATC 
GAAGTGCGCC GGGGGGCCGA CATTATCGGC AGCGTACCGT TGGTCAAACT CGACGACATC
GTCATCTTTG GCAACGTCGG CATCAGTACG CCGGCGATGA AACGACTGCT CGACCGCGGC
ATTGAAGTTA CCTTCATGAC GGTTGATGGG CGTTATCAGG GTCGCCTCAT TGGGCAGGTC
ACGGCGCATG TCGCCCTTCG CCATGCGCAA TACGCCTGTG CCGCCGATCC GGCGCGGGCG
CTTGCCCTTG CCCAACGGTT CGTCGAGGGC AAACTACGCA ACCAACGCGC ACTACTACAA
CGGTTCAGCC GCAACCGGGC CGAACCACCA CCGGAAGCGC AGGCGGCGGC AGACGATCTT
GAGGCCTATA TCAAGCGGGT AAAACGCACA ACGCAACTCA GCTCACTACT GGGGGTAGAA
GGCAGTGCAA CCGCACGCTA CTTTGCCGGT CTACGCAGCC TGATCGGGCC GGAATGGTCA
TTCAGCGGAC GCCAACGACG CCCACCACCC GACCCGGTCA ATCTGCTCCT CTCGCTCGGC
TACACCCTCT TGGCGCACAA AGTGCTAGGG GCGGTACAGG CAGCCGGGTT CGACCCATAT
CTCGGCTTCT TACACAGCCT TGACTATGGG CGGCCTTCGC TTGCGCTCGA CATAATGGAA
GAGTTTCGTC CAATACTCAT CGACTCGCTG GTCGTGCGCA TCTGCAACGA CGGGCGCATC
CGACCCGAAC ACTTCCGGCC GGGTGAAGGT GAGCGACCGA TCATCATCAC CGACGAGGGC
AAACGGGCAT TTCTCACCGC GTTTGAAGAA CGCATGCGAA CCGAAGCCAC CCATCCCGAA
GGCGCGGACA GTGGGCCGGG CAAAGTACCG TACACGCGCT GCATCGCGTT ACAGGCCAGA
CGACTAGCGC GGGTGGTGCG TCAACGCACC GACGACTACG AGCCATTTGC CGTTCGATAA
 
Protein sequence
MATLYVIEQG AEIGCDGERI EVRRGADIIG SVPLVKLDDI VIFGNVGIST PAMKRLLDRG 
IEVTFMTVDG RYQGRLIGQV TAHVALRHAQ YACAADPARA LALAQRFVEG KLRNQRALLQ
RFSRNRAEPP PEAQAAADDL EAYIKRVKRT TQLSSLLGVE GSATARYFAG LRSLIGPEWS
FSGRQRRPPP DPVNLLLSLG YTLLAHKVLG AVQAAGFDPY LGFLHSLDYG RPSLALDIME
EFRPILIDSL VVRICNDGRI RPEHFRPGEG ERPIIITDEG KRAFLTAFEE RMRTEATHPE
GADSGPGKVP YTRCIALQAR RLARVVRQRT DDYEPFAVR