Gene Cagg_3443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3443 
Symbol 
ID7269668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4183322 
End bp4185481 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content61% 
IMG OID643568253 
ProductCRISPR-associated protein, Csm1 family 
Protein accessionYP_002464721 
Protein GI219850288 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02578] CRISPR-associated protein, Csm1 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.791619 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTT CCCACCCAAC CGATCCGGTT GCCACGGCCA CGCACGTGTT GCAGGCGTGG 
GTGGCGGCTG CCGTGGGCGC AACCATAACC GCCTCACCCG ATCTGGTAGC AACTGCGTGG
CAGATCGCCG GCCGGTCGCC ACAAGCGCAG TGGCCGCCGT TGCCGGATCA CCTGGCGACC
GTCTTCACCC GATTGGTCGG TCAAGCACCG CCGTCGGCCT GGCAACCACC GACGGTCTTG
CAGTGTGACC CTGACGCGCT TTTTCCGAGG GCGCAACCGG CTACGCCTAC GGATGAGTTG
CGCAATGGCT TACGTACTGG GCTCGCTACA GCAGCACAGG CCACCGATCC GGCCAACCGC
CTTGAACTGC TTCTGAACGC GCTGCAACGC TACGGATCGG CCTGGCCATC ACCACTAGCG
GCAGTGTCAC TCTACGATCT TGCGCGGTTG CATGCAGCCG TCGCGGCGGC GTTGACCGGT
GATGCACAAC AGCAGATCTG TCTGCTCGGC GGTGATCTGT CTGGCTTGCA AGAGTTTCTC
TACAGTATTC CGGCGGAAGG CGCGGCACGT CAGTTGCGTG GACGTTCCCT GTACTTGCAG
TTGCTTACCG ATGCATGTGC GCAATGGGTC TTGCGGAAGA GCGGGATGCC GCTCTGTAAT
CTGCTGTATG CCGGTGGTGG TCGGTTCTAC GCTGTGCTGC CGGGACGTTT CGCCAATGAG
GTGTCGGCTT GGCGGCGTGA ATTGGGGCAG GTGCTGCTCG AATACCATCG TGGTGCGCTC
TATGTGGCAC TCGGTGCAAC CGCGCCGTTT GCCCCCGATC AGTACAACGA GCAAACATGG
CACGAGTTGA CGCGCGCGAT TGACACCGAC AAGCGACGAC GGTTCGCAGC GTTGGACGAT
GAATCGTTTG CCCGTATCTT CCAACCGCGC CCACCACGTC CGCCGCGCAG TGATGGTAAG
GAAGCTCCCG ATCCGCTTGG CGAGTCGTTA GCCGATTTGG GGCGGCAACT GACCCGTGCT
GCTGTGCTGT TGGTCGATTC CCAGGCGTCG GCGGTGTCCG GTACGACGTG GCGCAGTGTG
GTGAGCAAAT TGGGAGTGAA TTACGAACTG ACAGAACAAG TTCGCTTGCC GGTCGCCAAG
CGGCGTGCAT TGGCATTGAC CGACGAGGTG GTGCCGTCCG AACCGGCGCA AGCGATTTTG
ATCGGTCAGC GTTATCTGGT GGCCGAAGCC TACCGTTTGC GCGACACCGA TCTGTCGCGT
TACCGTGCGC TTGATCCGAG TCAAGCCGAA GAACTACGTC CCGGCGATGT TGCTCCGTTC
AACTTGTTGG CCGACCAGAG CGAAGGGATT AGCCGGATTG CCGTCTTGCG GATGGACGTA
GATAACCTCG GTGATTTGTT TGGGCGCGGG TTACAGCGCC CCGCCGGTTT GGCCGGTTTA
GCGGTGACTG CGGCGCTCAG TAGTGCGCTG AGTCGCTTTT TTGAAGGTTG GGTTGGCGAA
CTCTGTCGCC AGGTTAACGA ACGCAGCCAA GGTCAGGGTG GAATCTATGC AGTCTACAGC
GGTGGCGACG ATCTGTTTCT GGTCGGGTCG TGGCATCTGT TGCCCGAACT GGCCCGCACC
ATTCGCGCCG ACTTTATCCG CTACGCCGGC GGTGCGGTGA CGGTCTCGGC CGGAATTACC
TTGCATCAGG CCGGCTACCC GCTCTATCAG GCGGCAGAAG ATGCCGGCAA CGCATTGGAC
GCGGCCAAAT CCTACGAACA TCCTGATGGG CGGAGCAAGG ATGCGATTAC GTTTTTGGGA
CAGACGTTGA GCTGGGTGGA GTTTGAACAG GCTGCCACCC TGAAAGATGA GCTGATCGAT
TTGATTGCAA ATGGTGCGCC GCGCAGCTTG CTCATGACGA TCCAAGCGTT GGCCGTCCGT
GCCCGCCAGC GGTTTAATCG GAGCGGTCAG GTGCAGCTTC TCGTCGGGCC GTGGGTGTGG
CAGGGTGCGT ACCAACTGAC CCGCCTCGCC GAACGGTCGG GTGATCTGCG TCGGCGGATT
GAAGCCCTGC GCGAGCAGCT ACTCAGCAGC GAAGGGATTG CGTCCCGCAC CATTATTCCC
GCCGGGTTAG CGGCGCGGTG GGCACAGTTG CTACTCCGCG GGCGTGATAC GCGGCGGTGA
 
Protein sequence
MTTSHPTDPV ATATHVLQAW VAAAVGATIT ASPDLVATAW QIAGRSPQAQ WPPLPDHLAT 
VFTRLVGQAP PSAWQPPTVL QCDPDALFPR AQPATPTDEL RNGLRTGLAT AAQATDPANR
LELLLNALQR YGSAWPSPLA AVSLYDLARL HAAVAAALTG DAQQQICLLG GDLSGLQEFL
YSIPAEGAAR QLRGRSLYLQ LLTDACAQWV LRKSGMPLCN LLYAGGGRFY AVLPGRFANE
VSAWRRELGQ VLLEYHRGAL YVALGATAPF APDQYNEQTW HELTRAIDTD KRRRFAALDD
ESFARIFQPR PPRPPRSDGK EAPDPLGESL ADLGRQLTRA AVLLVDSQAS AVSGTTWRSV
VSKLGVNYEL TEQVRLPVAK RRALALTDEV VPSEPAQAIL IGQRYLVAEA YRLRDTDLSR
YRALDPSQAE ELRPGDVAPF NLLADQSEGI SRIAVLRMDV DNLGDLFGRG LQRPAGLAGL
AVTAALSSAL SRFFEGWVGE LCRQVNERSQ GQGGIYAVYS GGDDLFLVGS WHLLPELART
IRADFIRYAG GAVTVSAGIT LHQAGYPLYQ AAEDAGNALD AAKSYEHPDG RSKDAITFLG
QTLSWVEFEQ AATLKDELID LIANGAPRSL LMTIQALAVR ARQRFNRSGQ VQLLVGPWVW
QGAYQLTRLA ERSGDLRRRI EALREQLLSS EGIASRTIIP AGLAARWAQL LLRGRDTRR