Gene Ccel_1482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1482 
Symbol 
ID7310251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1799141 
End bp1800172 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content41% 
IMG OID643608407 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002505815 
Protein GI220928906 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAC TCTTAAATAC TGTTTATATT ACATCTCCCG ACAGCTATCT TTCTCTGGAT 
GGAGAAAACC TTGTTGTATT TAAAGGAAAT ACAGAGGCTG TACGCTTACC GCTGCACAAT
CTGGAAAGTA TTATTGCTTT TGGATATACC GGAGCAAGTC CGGCATTGAT GGGTGCGTGT
GCAAAACGTA ATATTTCTCT AAGTTTTATG ACCCAAAACG GCAAATTTTT GGCAAGGGTC
GTTGGAGAAG TAAGGGGTAA TGTAACTTTA AGAAAAGTAC AATTCAGGTT GTCGGATAAT
ATGGAAGAAA GCACGAAGAT AGCTAGAAAC TTTATCTTTG GGAAGATTTA TAACGGCAGA
TGGGTTATTG AACGTGCTAC CAGGGATTAT TCAGAGAGGC TTGACGTCAA TAAGCTTAAA
AGGGTGTCGG AAGGGTTGGC AAAAGCCTTG AATCTTGTTC TGAACTGTGA AAATTTGGAT
GAACTTCGCG GTTTTGAGGG AGAAGCTGCA ACACAGTATT TTAGTGTGCT CGATGACTTA
ATACTTCAAC AGAAAGACAA GTTTTTCTTT CACGGAAGAA ACAAACGTCC TCCTCTTGAT
AATGTGAATG CAATGTTGTC ATTTGTTTAC ACACTGCTGG CACACGATAC AGCAGCCACA
CTGGAAACTG TCGGTCTTGA CCCTTATGTA GGTTTCATGC ACAGAGACAG GCCGGGAAGA
ATATCTCTGG CCCTGGATTT AATGGAGGAA ATGCGAAGTG TATATGCTGA TAGATTCGTA
ATATCGCTAA TAAATAAAAG AGTTATAAAT GACAGTGGCT TTACTCAAAA AGAAGATGGT
GCCGTAATTA TGGATGATGA TACCCGCAGA ACTATTTTAC AAGCGTGGCA GAGCAGAAAA
CAAGAGAAAA TTACTCACCC ATTCTTACAG GAAAAACTGG AATGGGGACT TGTACCTTAT
GCTCAGGCAA TGCTTCTGGC AAGGTTTATC CGAGGAGATT TGGACGAGTA TCCTCCATTT
TTGTGGAAGT AG
 
Protein sequence
MRKLLNTVYI TSPDSYLSLD GENLVVFKGN TEAVRLPLHN LESIIAFGYT GASPALMGAC 
AKRNISLSFM TQNGKFLARV VGEVRGNVTL RKVQFRLSDN MEESTKIARN FIFGKIYNGR
WVIERATRDY SERLDVNKLK RVSEGLAKAL NLVLNCENLD ELRGFEGEAA TQYFSVLDDL
ILQQKDKFFF HGRNKRPPLD NVNAMLSFVY TLLAHDTAAT LETVGLDPYV GFMHRDRPGR
ISLALDLMEE MRSVYADRFV ISLINKRVIN DSGFTQKEDG AVIMDDDTRR TILQAWQSRK
QEKITHPFLQ EKLEWGLVPY AQAMLLARFI RGDLDEYPPF LWK