Gene Cagg_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0572 
Symbol 
ID7266044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp708644 
End bp710146 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content55% 
IMG OID643565435 
ProductCRISPR-associated CXXC_CXXC protein Cst1 
Protein accessionYP_002461947 
Protein GI219847514 
COG category 
COG ID 
TIGRFAM ID[TIGR01908] CRISPR-associated CXXC_CXXC protein Cst1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.316727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.798196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAATT CGATAGGTAT TACCTATACC GGGCATCCGT TTATTGATGT TGGTTTTGCA 
ACACTCGCCG CCTTTGCCAA TCGTCGTCAT TTAGCCGATC TTACGACAAA CGATCTCGCG
GAAATGGCTA ATTACATTGA AGCAAACTAC GTGCGGCAGC CGTTGCGCAG TTTTCTGACG
GTGGCGTTCA CCAGTAATGC ATGGTTCGCC CAATCGGCGT TCAACCCTGA TCGGTTTGAT
GACCCTGACA AGAAGATCGA AGCGCAGCAG AAACGCAAAT ATTGGGCGGA TCGACACCTG
CGCCAGTGGG CGCAGGCTGC TGCGGCGCTC GAAACTTGCC TCTTCACCGG ACTTCCGGCA
GCGGCGCTCG AGTTGTCGGG CAAACTGCAA CCAGGTCGGG TTGGGCGGGC GCAGATGCCC
CTGTTGCAGG GTGACGACTC GATCAACTTC TTCACCAACG GCGATCCGGG TTTGCCGATG
GCGCCGGAGG CGATCCTGGC GCTCCAAGCG ATGCCGCTCG GCTGCGCCAA GGTTGGCGGC
GGCTTGTTGG CGGTGCATAG CGATGACGAG CAGTTGACGA TCGATTTCGC CAAGCGCTTC
TTGGAACGCA ACCTCAGTGA TGTGGCCAAA GCGCAGGCCG CCGGCGAAGA GAAGCTGTCC
GGATCACCGC GCAGCTTGAA GACGTTGTTG ATTGAGACGC TGGCCGAGAT TATCCGTCGC
CAAATGCAGG AGGAAGTGCG GCGCGAGCGG CGGCCAACGG TGACGGCCTA CTATTTTAAC
AACGGTCAGT CACCTTCGCT CGATATCTAC CACTTGCCGT TGCAGATCAC CGGCTTTTTG
CTGGCGGTTC ACACGCCGGC CTACCGCGCA ATCTGGAACG AGCTGGTACA GCGGAGCTGG
CAACGTTTGG AGACACCGAC AAAGCGGCGA AAAGTCGCCG AACCGACCGA ACCGCGTTTC
AATTATCTGT ACGAAGACCT CTTTACGTTG CCGGCGCAGG CGGCACGGTT TGTTCGTACC
TATTTTCTCC GTATTCCCAA TCGGTCTACT GCGACCGATG ACCCACGGCG TGAATATTCG
ACGCGCCGCG AAGTCGATCT TGTCTCATGG CCGCTGGTGG AACTTTTTGT ACAGGAGGTA
ATGCTTATGA CCGATGACCG GGTAGCGAAG TTGAAAGAAC TGGGCGATAA GTTAGCCGAT
TATACCCGTT ATCAGGGTGG CAAACGCTTT TTCCGTCAGT TCTTTACCGT GCAGCGCAGT
GATCACTTTT TGACCCTGCT CAACAAGACG AATATTGACT ATACGCGCTA TAAGCGTGGC
ACCGAGACAT TGTTCGATCT CGATAGTTTT CTTACCCTGT TTATGGAAGG TGAGGAGGTG
TTGCGTAACG ATTGGCGGTT GATGCGTGAT TTGGTACTGA TTCGGATGGT TGAACAGTTG
CGTGATTGGA TTGCCAATAA CGCCGATGCT ATTCCGAGTG AGGAGGAAGT GGTGGAGGCG
TAG
 
Protein sequence
MVNSIGITYT GHPFIDVGFA TLAAFANRRH LADLTTNDLA EMANYIEANY VRQPLRSFLT 
VAFTSNAWFA QSAFNPDRFD DPDKKIEAQQ KRKYWADRHL RQWAQAAAAL ETCLFTGLPA
AALELSGKLQ PGRVGRAQMP LLQGDDSINF FTNGDPGLPM APEAILALQA MPLGCAKVGG
GLLAVHSDDE QLTIDFAKRF LERNLSDVAK AQAAGEEKLS GSPRSLKTLL IETLAEIIRR
QMQEEVRRER RPTVTAYYFN NGQSPSLDIY HLPLQITGFL LAVHTPAYRA IWNELVQRSW
QRLETPTKRR KVAEPTEPRF NYLYEDLFTL PAQAARFVRT YFLRIPNRST ATDDPRREYS
TRREVDLVSW PLVELFVQEV MLMTDDRVAK LKELGDKLAD YTRYQGGKRF FRQFFTVQRS
DHFLTLLNKT NIDYTRYKRG TETLFDLDSF LTLFMEGEEV LRNDWRLMRD LVLIRMVEQL
RDWIANNADA IPSEEEVVEA