Gene Cagg_1186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1186 
Symbol 
ID7267935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1461803 
End bp1464202 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content51% 
IMG OID643566029 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_002462531 
Protein GI219848098 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00457345 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGCTTT ACCCGTACCA ACAGCGTGTG AGAAACCACA TTCTACAGGG TAAGTCGGTC 
ATCTTGCAAG CACCGACCGG GGCCGGCAAG ACGCGAGCTG CGTTGGCGCC GTTTATTGAA
GGATTTTTTG ATCGGCCCGA CTCGACTCCA CGCAAGTGCT TGTACGTGAC GCCGATGCGG
GTGTTGGCTA ATCAATTTTA TGCTGAGTAT AGGGCATTAG CTGATAGCTA TTGCCGTCGT
CATGGCCGCC ATTTAGATGT GCGGATTCAA ACTGGCGAAC AACCAGATGA TCGACGCTTT
GAAGGCGATC TCATTTTCTG CACAATCGAT CAATTTCTTA GCAGTTATTT GGTGATGCCG
TACAGTTTAC CCTACCGATT GGCTAATCTG AATGCCGGAG CGATAGCGGG TTCGTATTTA
GTATTTGATG AATTTCATCT GTTTGATCCT GAAGCGGCAT TACCGACGAT TTTGCATGCG
TTGCCTACCT TAAGCCGGTT AGCTCCGGTG ATGTTGATGA CGGCTACCTT TAGCATTCAC
ATGCTGGAAT CGTTAACGTC GTTCCTGTCC AATGCCGAGA TTGTGACACT TACGCGGGAC
GAGATTACCG CCATTGATTG CCGTGGCGGA CAATCACCCC GACAGCGATA TTGGACTGCG
GTCGATCAGC CGTTGTGTGC AGACGCAGTC TTACAGCGCC ACCAGCGTTC GTCGTTGGTG
ATTTGTAACA CGGTGACTCG TGCGCGTGCC TTGTATCGTG AACTGAAGCA AAAGGTTGGT
CAGAATACCG AACTTCTGCT CTTGCATAGC CAGTTTTTGC CTAATGATCG CCGTCGGATC
GAAGGGGAGT TGCAGCGACG AATCGGTGTA AACGCAAATC GCACTTGCGC CAATGTGATC
GTAGTCGCCA CCCAAGCGAT CGAAGTCGGT GTTGATATTT CCGCCGAAGT CTTACACACC
GAGCTGGCAC CGCCTGCCAG TTTGATTCAG CGCGCTGGTC GTTGCGCGCG TTACCCCGGT
GAAAAAGGGG AGGTGATCGT ATATCGGGTG GAGAAATATG CGCCGTATGC GTTCAAGCCT
GATGAGTTGT TAAAGCGAGA GATGGACACT GCTTGGCAGT GGTTGCACGA GTGTAAGAGA
GAGATTTTTG ATTTCACGCG AGAGCAGGAA CTGGTTAATA CAGTGAGTGC GCCACGCGAT
GAGCAGGTGA TAGATGGTCT GAACGCTGAT CGGGTTAATC GGTCAAATTA CATTCATACC
TGTCAACAAG GCAACCGTCG TGGGGCAAGC CGATTGTTGG TGCGAGATGT AGACAGCCGG
CTTGTGTTAA TCCATCCTAA TCCAGATCAG TTGCTCGACT CACCATACGA TGCAATTGGA
TTGAACATTC CGGTCTACTC GCTGCGCGCA ATGGTACGGG AATGGTTGCA ACGGTCAGTG
ACAGCACCTT GGCGCGTTAA ACGGCTCGAT GAACGTCAGG CCGATGATCG GGCAGAAAGC
CGGCAGATGG TATATCACTG GGTTGACGTT CAAGATACTA AAGAATTGGA TGAAACAGCC
ATACTCGTTG TCCATCCTGT TTTAGCCGGT TATTCGGAAC GCGAAGGGTT TCTACCTGAC
ACGGGTGATT TACCTTTTGT TTCGTCGTTA CCATCGGCAA CGACGCGGAT CGAGCGGGCC
TTGCAACCGC TGCAGGTAGA GACCTACACT GAACATATTC AACGGGTTTT ACAAGCATTT
AGCGAGCTAG CGCTTCCTGA ACTGCGCTAT GCTGCACCGG CGCTCGAGAG GGCGGCGAAT
TGGCCGGAAG GGAGTCTGAT ACAAGCCGCG TGGTTGGCCT GTCTCTTGCA TGATGTTGGC
AAGTTAAGTA GCGCTTGGCA GCAGTGGGCA CATGCCTATC AGCAAGCGAT CAATCAGCCG
GTGGCAAAGC ACGTCGCTTT AGCCCATACC GCATTTGATC GCCAAAACCC GTCGCATGTG
CAGGCACAGC AGCAGGTTTC AGCCAAGATA CCGCGTCCCC GTCATGCATC AGAGGGTGCG
TTGGCATGTG CGGAGATAAT CACAGAGGCA TTAGGCAAGC AACCATCCTT AACCAAGGCA
ACCATCACAG CAATTGCTCG TCATCATGCA CCGTTTGTGA ATGATTGCCA ACCATTTCAA
TTGGTACCAA ACGCTGACCA AATCTTAGCC TGTACGGTAC AATACATACC ATCAGCGCTA
CGCCAGCATA TCAAGGTGAA TTTGCTCTGG AAACAAAGTA ATATAAACCA AATCCAGCAG
TTCGGTAATC TGATCACAAC CCCCGATGAT CAGTTTGGTT GGATGGCATA CACCTTGCTG
GTACGGGCAT TACGCCGGGC AGATCAGAGA GGTACGGGGT TAGGAAGTGG AGCCTTATGA
 
Protein sequence
MVLYPYQQRV RNHILQGKSV ILQAPTGAGK TRAALAPFIE GFFDRPDSTP RKCLYVTPMR 
VLANQFYAEY RALADSYCRR HGRHLDVRIQ TGEQPDDRRF EGDLIFCTID QFLSSYLVMP
YSLPYRLANL NAGAIAGSYL VFDEFHLFDP EAALPTILHA LPTLSRLAPV MLMTATFSIH
MLESLTSFLS NAEIVTLTRD EITAIDCRGG QSPRQRYWTA VDQPLCADAV LQRHQRSSLV
ICNTVTRARA LYRELKQKVG QNTELLLLHS QFLPNDRRRI EGELQRRIGV NANRTCANVI
VVATQAIEVG VDISAEVLHT ELAPPASLIQ RAGRCARYPG EKGEVIVYRV EKYAPYAFKP
DELLKREMDT AWQWLHECKR EIFDFTREQE LVNTVSAPRD EQVIDGLNAD RVNRSNYIHT
CQQGNRRGAS RLLVRDVDSR LVLIHPNPDQ LLDSPYDAIG LNIPVYSLRA MVREWLQRSV
TAPWRVKRLD ERQADDRAES RQMVYHWVDV QDTKELDETA ILVVHPVLAG YSEREGFLPD
TGDLPFVSSL PSATTRIERA LQPLQVETYT EHIQRVLQAF SELALPELRY AAPALERAAN
WPEGSLIQAA WLACLLHDVG KLSSAWQQWA HAYQQAINQP VAKHVALAHT AFDRQNPSHV
QAQQQVSAKI PRPRHASEGA LACAEIITEA LGKQPSLTKA TITAIARHHA PFVNDCQPFQ
LVPNADQILA CTVQYIPSAL RQHIKVNLLW KQSNINQIQQ FGNLITTPDD QFGWMAYTLL
VRALRRADQR GTGLGSGAL