Gene Athe_2659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2659 
Symbol 
ID7407023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2795649 
End bp2796641 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content36% 
IMG OID643717025 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002574494 
Protein GI222530612 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAG ACCTTTATGT TTTCAATTCT GGGTTTTTAA GAAGGAAAGA TAACACCATC 
ATGTTTGAAA CAGATGAAGG CAAAAAGTAT TTTCCGGTAG AAGAGATAGA ATCAGTTTTT
ATATTTGGAG AGGTTGACAT AAATAAAAGA TTTTTGGAGT TCATGACAGA AAAGAATATA
TGTGTCCATT TTTTTAACAG GTATGAGTAC TACGTTGGCA CATACTACCC GCGCGAACAT
TACAACTCAG GCATTGTGAT ACTCAAGCAG GTAGAATTTT ACAACGACTA CAACAAAAGA
ATGACCATTG CAAGGTCAAT TGTTGAAGGA GCTGTTCTAA ATATGCTTGT GGTTCTGAGG
TACTACAATT CCCGTGGAAA TATGCTAAAA GATGAGATAG AGACAATCGA AAGAATGCGC
CATAACATAA ACTCCTGTGA TGATGTAAAT ACGCTTATGG CTTTGGAGGG GAATATAAGA
GAAATCTACT ACAGGTGTTT TAACAAGATA CTGGATAATG AAAATTTTAC ATTTGTCCGC
AGGAGCAAAA ATCCGCCTCT TGACAGGATA AATGCTCTGA TTAGCTTTGG GAATTCTCTT
TTGTATGCTA CAACTCTTGG AGAGATTTAC CAAACACAGC TTGACCCGCG CATAGGATAT
CTGCATTCCA CAAACCAGCG CAAGTTTTCA TTGAACTTAG ATATTTCCGA AATCTTCAAA
CCCATAATTG TTGACAGGGT AATTTTTTCT CTTGTTAACA AGAAAGTGCT GAGCGAAAAA
CATTTTGAAA AAGAACTAAA CGGCATTATA TTGAACGACC AGGGCAAAAA ATTGTTTGTC
TCTGAGTATA ACCAGAAGCT CTACTCTACT ATAATGCACC CGAAACTGAA TACTCAAGTA
AGTTACAAAA GGCTCATCCG AATGGAAGCA TACAAGCTCC AAAAATTGTT TTTGGAAAAT
ATAGAATACA AACCTTTTGT TGCAAGGTGG TAG
 
Protein sequence
MKKDLYVFNS GFLRRKDNTI MFETDEGKKY FPVEEIESVF IFGEVDINKR FLEFMTEKNI 
CVHFFNRYEY YVGTYYPREH YNSGIVILKQ VEFYNDYNKR MTIARSIVEG AVLNMLVVLR
YYNSRGNMLK DEIETIERMR HNINSCDDVN TLMALEGNIR EIYYRCFNKI LDNENFTFVR
RSKNPPLDRI NALISFGNSL LYATTLGEIY QTQLDPRIGY LHSTNQRKFS LNLDISEIFK
PIIVDRVIFS LVNKKVLSEK HFEKELNGII LNDQGKKLFV SEYNQKLYST IMHPKLNTQV
SYKRLIRMEA YKLQKLFLEN IEYKPFVARW