Gene Athe_0125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0125 
Symbol 
ID7408487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp147106 
End bp149433 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content34% 
IMG OID643714533 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_002572056 
Protein GI222528174 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCT ATGAGGATAT AGATCTTTAC GCAAAGAGAT TTAAAACTGA GAAAGGTTAT 
GAATACCAGA CAATATACGA ACATACAATG ACACTACTGC AGAATATGGA AAAATTATTC
AAAGAGTATA GAGAGGAGAT CGAAGAAGGT TTCCAAAAAC TTCAGATTGA CCTTGAGATG
TTTAAATATT TGCTAAAATT AGCAATAATA TATCATGATT TGGGCAAGGC GAATTCAAAC
TTTCAAAGAA AGATAAGAGA CAAGACAAAA TCAAATGAGG TACCTCAAGT AAAGTGCTTG
TCTGTGGAGG TACCTCATAA CTTTATCTCA ATAGCTTTTG TGGCTTTGGA AGATGAGATA
GACAAGGGGA ATATCAGCCC AGAAGATTTT GAAAATTTGC TTTTTGCTAT TGCATTTTCG
CATGACAGGG GTTTTGATTT CAACTACCAA TATTTTGAAA AATACATTTG TGAGGATGTT
TCAAAGTATT TAACTAATAA GACATTACTT CCTTTGTTTA AAGAACTTCC TTCATTTCCT
ACCAAAGATG AAATAGAGAA GAGTTCAAAC TATGTATATA GAACTATTGA CTGGTTGAAA
GAGGTCATGC TCAAAAGTTA TACCGACCTT TACAATAAAA ACACAATTAT GAGAATTCTT
TTGAAAGGTT TTCTTCATAG ACTTGATCAT TCAAGTTCTG CAGGAATTAG TACAGAAGAA
GGAAAGATTA CAGATTTTTC TCATAAGGTT GAGAGCTACT TGAAAGAAAA GGGGAATTTC
GTTGGTTTTA AAGAATTTCA GCTAAAAGCT TTAGATTTCT CAAACCAGAA TGTTATTTTA
TTTGCTCCCA CAGGCAGTGG TAAGACAGAG TTTGGCTTAA ACTGGGCAGG CAGAAGCAAA
TTAATTTACA CTCTTCCAAT TCGTGTTTCA ATCAACGCAA TGTACGAAAG ACTGGCGAAA
ATTTTTGGTA GTAATAAGGT TGGAATATTG CATTCTGACA GTATGATTTA TCTGCTTGAA
AAGTATTCAC AATCAGCTGA AGATGTCCAA GAGTTAGAAT CCTTATTTGA CAATGTAAAC
CTTGCGAGGA ATTTAAGTTT TCCGATAATT GTAACAACCG GCGACCAGAT TTTTACATCT
GCTTTAAAAT GGCCCGGTTT TGAGAAAATA TATTCGTTAT TCTTATACTC AAAGATTATA
ATTGATGAGC CACAAAGCTA TTCTCCAGAG TCTTTGGCAA TTATAATAAA GACCTTAGAA
GAAATAGTAA ATTTAAATGG ACGATTCTGC TTGATGAGTG CCACTGTAAA TCCTCTTGTA
CTAAAGTATC TTGGAGATAT TTCTGAATAT TTACAGGCAT ATTCAGATGA AGAATTAAAA
AAATGCTGGT CGCATGTTGT TTCGGTCAAA CCACTTTCTA TTTTAGATTG TGTAGATGAA
ATTGTAAATT GTGGGAAGCA ACAAAATGTA CTTGTAATAT GCAACACAGT AAGAAGGTCA
CAGGAAGTTT ATAAGGCTAT AAAAGAAAGC ATAGGAAATT CTGATGATGT CCCGGTTGAA
CTTTTGCATT CAAGATTTTT GGAAGGGCAA AGAAGGCAAA AAGAACATTT TATTCTTTCA
AACCAGAGAA AAAATAGTAT TGTAATTTCA ACCCAGCTTG TAGAAGCGTC ACTTGACATA
GATTATGACG TTTTGTTTAC GGAACTGGCT TCTGCAGATT CATTGCTGCA GCGAATGGGG
AGAGTATACA GGAAAAGACC ATATGAGGGG CAAAAACCAA ATGTTATTAT CCTAACAAAG
GAACCAAGCG GAATTGGAAG AGTTTATCAG AAAGAAATTG TTACTAGAAC AGAAGAGTTT
TTGAAAAGGT TTGACGGAAG AAAAATTACA GAATATGACA AAAAAGAGCT AAATGAATAT
GCCTACGATG TTGAGGTACT TTCAAACACT AATTTCATGA AGAGTTTCAA AAAAGCATAT
GAACTTTTGA AGTTAGGATT TAAAGCAGAT AGAAAGATTG AAGCTCAAAA GATTTTTAGG
GATGTAGTTA CAGTTGAAGG TATACCGAGG AAGGTATTCG AGGTAAATGA AGAAACAATC
AGAGAGTGCC TGAAGAGAAT AGGTTCAAAA TCTTTAAGCC CTATTGAGAA ACTGCAGCTA
ATTTCAGATG TTCGAAAGTA CACAGTTACA GTTCCGGTGT ACTTTTTCGC AAAAGGTGGA
GCGAGTGAGT ATAGCAAAAA GCTGGGTATA TTCATTTTAA ATTGTGATTA TGACGACGAG
TTGGGCTTGT TACCACCTAA GGAATCAGAA AAAGATGATA TATGGTAA
 
Protein sequence
MKAYEDIDLY AKRFKTEKGY EYQTIYEHTM TLLQNMEKLF KEYREEIEEG FQKLQIDLEM 
FKYLLKLAII YHDLGKANSN FQRKIRDKTK SNEVPQVKCL SVEVPHNFIS IAFVALEDEI
DKGNISPEDF ENLLFAIAFS HDRGFDFNYQ YFEKYICEDV SKYLTNKTLL PLFKELPSFP
TKDEIEKSSN YVYRTIDWLK EVMLKSYTDL YNKNTIMRIL LKGFLHRLDH SSSAGISTEE
GKITDFSHKV ESYLKEKGNF VGFKEFQLKA LDFSNQNVIL FAPTGSGKTE FGLNWAGRSK
LIYTLPIRVS INAMYERLAK IFGSNKVGIL HSDSMIYLLE KYSQSAEDVQ ELESLFDNVN
LARNLSFPII VTTGDQIFTS ALKWPGFEKI YSLFLYSKII IDEPQSYSPE SLAIIIKTLE
EIVNLNGRFC LMSATVNPLV LKYLGDISEY LQAYSDEELK KCWSHVVSVK PLSILDCVDE
IVNCGKQQNV LVICNTVRRS QEVYKAIKES IGNSDDVPVE LLHSRFLEGQ RRQKEHFILS
NQRKNSIVIS TQLVEASLDI DYDVLFTELA SADSLLQRMG RVYRKRPYEG QKPNVIILTK
EPSGIGRVYQ KEIVTRTEEF LKRFDGRKIT EYDKKELNEY AYDVEVLSNT NFMKSFKKAY
ELLKLGFKAD RKIEAQKIFR DVVTVEGIPR KVFEVNEETI RECLKRIGSK SLSPIEKLQL
ISDVRKYTVT VPVYFFAKGG ASEYSKKLGI FILNCDYDDE LGLLPPKESE KDDIW