Gene Acel_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1950 
Symbol 
ID4484920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2210483 
End bp2211370 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content62% 
IMG OID639730742 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_873708 
Protein GI117929157 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGAC CGTGGCGTGT CGTGGATCTG TCCGAACTGT CCGGTGAGGT GCATGCGGCT 
CAAGGGGCAC TTCTTGTCGG TGACGAGCGG GTGCCGCTGG TTGACGTCGC GATGATGCTT
ACGGGGCCGT ACGTCTCCCT GCACGGCAGC GTTATTGACC GCGCTGCGGC GTTCGGGGTA
GGCGTGGTGC ACTGCGACTG GCGGGGTGTT CCGGTAGCCG CTACATTGCC GTGGTCGACT
CACAACCGGG TGGCGGCTCG TCATCGCGCG CAGGCGGAGC TTTCGTTGCC TAGGCAAAAG
AACGCATGGA TGAATATCGT GAAGACAAAG ATCCGCAATC AGGCCGCTGT GCTACGGGCG
CTTCGCCGAG ACGGTGTGGC GCAACTGGAG CGACTCGCGG CGCAGGTTCG ATCAGGTGAT
GCAAGCAATG CTGAAGGGGC TGCCGCGCGC GTGTATTGGG CTCGCTTGTT TCAGGACAAG
CACTTTCGTC GCGTTCCGCG AGCACGTGAC GTTGTCAACG GCCTCCTAGA CTACGGCTAT
GCGATCTTAC GTGGTTGTTG CCTTCGCGCG GTGGTCGGTG CGGGACTCGC GCCGTCCCTC
GGCCTTTGGC ACCGGCGCCA CGATAATCCG TTTACGCTGG TTGACGATCT TATCGAACCA
TTCCGACCTG CGGTGGACAA GACGGTCATA GAGATCGTCA CTGCGGGCGC ATCGGGTCTT
GACCGTCCCA CTAAGCGCCT TCTTGTAGCG GTGCTTGATC ACCAATTTGA TGCGAGCGGA
GCGACCGTGG GAACAGCCGT GGAGCGGTTT GCCCAGCAGG TCGGTCGGTA CGTCGAGGGC
GAGATACGAA GTCTGAGACC ACCCGCCATG GAGCTGTCGC ATGCTTAA
 
Protein sequence
MTGPWRVVDL SELSGEVHAA QGALLVGDER VPLVDVAMML TGPYVSLHGS VIDRAAAFGV 
GVVHCDWRGV PVAATLPWST HNRVAARHRA QAELSLPRQK NAWMNIVKTK IRNQAAVLRA
LRRDGVAQLE RLAAQVRSGD ASNAEGAAAR VYWARLFQDK HFRRVPRARD VVNGLLDYGY
AILRGCCLRA VVGAGLAPSL GLWHRRHDNP FTLVDDLIEP FRPAVDKTVI EIVTAGASGL
DRPTKRLLVA VLDHQFDASG ATVGTAVERF AQQVGRYVEG EIRSLRPPAM ELSHA