Gene Athe_2664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2664 
Symbol 
ID7407028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2801428 
End bp2803488 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content34% 
IMG OID643717030 
ProductCRISPR-associated protein, Csh1 family 
Protein accessionYP_002574499 
Protein GI222530617 
COG category 
COG ID 
TIGRFAM ID[TIGR02556] CRISPR-associated protein, TM1802 family
[TIGR02591] CRISPR-associated protein, Csh1 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.481224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATAGAGG AAATAGTTGA GATTGGTGAT ATTTTAATTG GTGATGCATC TGGAAATGAT 
GCCTATTTAT CTGTTCTTAC AGAAGATATT GAGGTTCCAA CTGGGGATGA AAAGAGATAT
GTTGCAAAAA TTGACTTTTC TACAAATGAG AAAAAGATTA ATATAGACTG TGCTGAAGAA
ATTGATGATG AAACTGCTAA AAAATATGTT TATGTTGGTT CGGCAGAAGG TGCAAATTCG
TCACAATGGT TTGCTTCTAC GACATCTTTT GCATATTTTC TAACAGAAAC CATTCCAAAC
CTTGTTGAAT GTAAAATACC TGTAGTATCT GATATATGTA AAAAAATATT GGATATGTAT
TTTGTCAAAG TCAAAGAATA TCTGAGCAGA TCTTCTGAAC TTATCGATGA AGAGAAGAGG
TATTTGCAGC AGAAGATAGA GAAAAAATAC GTTTATTTTT TAGATACTGA CAATATAGTG
GTAAATGACA ACAAGAGATT GACTGAAAAG CAGCTATCAC AGATTTACAA GGAACTAATT
AACAATACAA AATCAACTGA TAAGATTTTT AAGCAATTGA GAGATGTTTT TACAAAGGAA
TGTACAAATG GATTAAAAAA ATTAACAGAG ATAAAACCTC AGCAAATAGG CTTATATGTA
CTTTGTGTTG ATGGAAAGCC TTTGACAAGC TATCCAGAGT ATATTGACGC TGTTATTGCA
TACAAACGCC AGGTCAAAAA AGGCTCAAAA AAGGCTAAAA ACAAACAGGA AGGTAATATA
TGTTATATAT GTTTGGACAC AGATAACTTA TCATTCGAAG GATTTAAAAA GACAAGGTTT
AAATATTTTA CAACAGACAA AAATATATTT GCATCTTATC TTGACCAAAA GAACTATGCC
AAGAATATAA CTGTATGCGA AAAATGTCTT TTAAAGCTTG TGGCAGGAGA CATATTTTTA
AGAAATAAAC TTAAAACACA GCTTGGAACA TTTGATGTAT ATGTTCTGCC AACTTTTGTT
TACACAAGTG CAAAGCTAAC AAAAAACTAT TTAGAGGAAC TTTCCCAAAA TATCACGAAT
TCGATGAACA CTGCATGGAA CTATAATAGC TTGGAAAAAC TGCGAGATGA TATATACAAC
TTTCTTTCAT ATTTCGACCA AAACCACTAT TTTTTACTGA ACCTGATTTT TTACAGAGAA
GCACAAGCAA GTACAAAGAT AATAAGGTTT ATAGCTGACA TCAATCCTTC AATTTTCGAC
AAGATTTACA ATGCAGCTTC AAAAGTCTTT TCTCAATATA CTGACCTCAT TGGGAATGAC
CCTTCTTTTA GGATTTCCCT TGAAAGCATT TATTACAGCG TTCCAATAAG GCTCAAAAAC
ATAAGTGAGA GCAAAGAAGC GCAAAGGCTT TTGAACATCT ACGATGCTAT TTTTTCTGGC
AAGAGAATAG CAAGAGATGT TCTCATCGAA AACTTCATAA AGGCAGTAGG TGTTGTTGTT
TACGGGAAAG AAGGGTATAA CCTTTCAAAG TTTATAGAAA ACGACATTGC TTCAATGGTT
ATCAGAATGG TTTTTGTCAT AAGATTTTTG GAAATTTTGG ATTGTTTGGA GGTGAAAAGA
GGGATGGATA TTGCGCAGCT GAATTTGTCC GACGATCTCA AAAGCTATAT TCAACAAATG
AATTATGATG AGCCAAGGAC CGCTCTATTT TTGCTTGGAG TTTTGATTGG CGAGATAGGA
GCAAAACAAT ATCTTACTAC CAAAGATAGG CAAGACGACT CTGCTGGGCA CAAACCAATT
TTGAACAAGA TAAATTACAA TGGAATTGAT AAGCCAAAGC TTATAAGACT GTGCAATGAT
GTTCACAACA AGCTGAGGCA AGAAAAACTT TTACCGTACA CCGAGATGAT TTTTGCCGAG
ATGAAGAGGC TTTTGGACAA GCACATAGAT TCATGGAAGC TTGACAAATA CGAAACTCTA
TTTTACATCC TCTCTGGCTA TGCATACAAA ACTCAAAAGG TTATATTAAA TGCTTTGAAC
TCTCAGGATA CATCTAACTA A
 
Protein sequence
MIEEIVEIGD ILIGDASGND AYLSVLTEDI EVPTGDEKRY VAKIDFSTNE KKINIDCAEE 
IDDETAKKYV YVGSAEGANS SQWFASTTSF AYFLTETIPN LVECKIPVVS DICKKILDMY
FVKVKEYLSR SSELIDEEKR YLQQKIEKKY VYFLDTDNIV VNDNKRLTEK QLSQIYKELI
NNTKSTDKIF KQLRDVFTKE CTNGLKKLTE IKPQQIGLYV LCVDGKPLTS YPEYIDAVIA
YKRQVKKGSK KAKNKQEGNI CYICLDTDNL SFEGFKKTRF KYFTTDKNIF ASYLDQKNYA
KNITVCEKCL LKLVAGDIFL RNKLKTQLGT FDVYVLPTFV YTSAKLTKNY LEELSQNITN
SMNTAWNYNS LEKLRDDIYN FLSYFDQNHY FLLNLIFYRE AQASTKIIRF IADINPSIFD
KIYNAASKVF SQYTDLIGND PSFRISLESI YYSVPIRLKN ISESKEAQRL LNIYDAIFSG
KRIARDVLIE NFIKAVGVVV YGKEGYNLSK FIENDIASMV IRMVFVIRFL EILDCLEVKR
GMDIAQLNLS DDLKSYIQQM NYDEPRTALF LLGVLIGEIG AKQYLTTKDR QDDSAGHKPI
LNKINYNGID KPKLIRLCND VHNKLRQEKL LPYTEMIFAE MKRLLDKHID SWKLDKYETL
FYILSGYAYK TQKVILNALN SQDTSN