Gene Ccel_3119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3119 
Symbol 
ID7311713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3653330 
End bp3654226 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content33% 
IMG OID643610022 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002507390 
Protein GI220930481 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.679902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGATGGA GAAATATAAT AGTTTCAAAT CCAACTAAAC TGAAATTAAA ACAGAATAAC 
TTATGGGTTG AACAATCAGA TGGCTTTAGT ATACCGATTG ATGATATAAA TACAATAGTA
CTTGATAGTG CGGATGTTAC GATTACATCC GCACTATTAT CAAAATTGGC AGAAGAAGAC
ATTGCTTTGT ATTCTTGTGA TGGGAAGCAC ACACCGAATG GAGTACTTCT TCCATTCAGT
TGTCATAGTA GACAATACAA AATTGTAAAA ACTCAAATAA ATCTTTCAGC ACCTTTTAAA
AAAAGGTGCT GGCAAAGAGT TGTTCAACAG AAAATAGAAA ATCAGGCCTT TTGCTTAAAT
ATTCTAGAAT TAAAAGGAAG AGATGAATTA ATAAATCTAT CTAAGAGTGT TCTATCTGGT
GATTCAACTA ATGTAGAGGC TCATGCTGCA AAATATTATT TCTCTGTTCT ATTCACAAAC
TTCAAAAGGG GTATGCAGGA TAACACAAAC TATGCATTAA ACTATGGCTA TTCAATATTA
AGGGGAGCTG TAGCCAGAAC CATAGCATCG TATGGATTTA TCCCTTCTAT TGGAATACAT
CATAGAAGCG AATTGAATAA TTTTAATCTT GCTGATGACT TTATCGAACC GTTCAGACCA
ATTGTTGATA TGTGGGTAAA ACAAAATATA AATGAGGATA CACTTTTAAC ACCTAAACAT
AAGTTAAATC TTATAAGTTT GTTGGGTTAC GAATGTGTCT TTGAGGGAAA AATAATATCT
ATAAGGTCTG CAATCGAAAA GGTGATTTCA AGTTTTTCAA GTTCTTGTGC AAAGAACGAT
TATAGTTTAT TGAAATTACC TGAAATAATA CCATTAGAGG TACATGCAAA TGAGTGA
 
Protein sequence
MGWRNIIVSN PTKLKLKQNN LWVEQSDGFS IPIDDINTIV LDSADVTITS ALLSKLAEED 
IALYSCDGKH TPNGVLLPFS CHSRQYKIVK TQINLSAPFK KRCWQRVVQQ KIENQAFCLN
ILELKGRDEL INLSKSVLSG DSTNVEAHAA KYYFSVLFTN FKRGMQDNTN YALNYGYSIL
RGAVARTIAS YGFIPSIGIH HRSELNNFNL ADDFIEPFRP IVDMWVKQNI NEDTLLTPKH
KLNLISLLGY ECVFEGKIIS IRSAIEKVIS SFSSSCAKND YSLLKLPEII PLEVHANE