Gene Moth_0495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0495 
Symbol 
ID3832818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp509899 
End bp510792 
Gene Length894 bp 
Protein Length297 aa 
Translation table11 
GC content57% 
IMG OID637828429 
ProductCRISPR-associated Csh2 family protein 
Protein accessionYP_429368 
Protein GI83589359 
COG category[L] Replication, recombination and repair 
COG ID[COG3649] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR01595] CRISPR-associated protein, CT1132 family
[TIGR02589] CRISPR-associated protein, Csd2 family 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTT ATACCAATCC CGAAGTACGC CATGATTTTG TCCTCTTATT CGACGTCCGG 
GACGGCAACC CCAATGGCGA TCCCGATGCC GGCAATCTGC CGCGCCTTGA CCCCGAAACC
ATGCAGGGTC TGGTGACCGA CGTCTGCCTT AAGCGCAAAA TCCGCGACTG GGTGGATATG
ACCCGCGGCA GCGAGGCTAA CATGAAGATT TATGTCCAGC ATCACGGCAT TTTAAACGCC
CAGCACCAGC GAGCCTATGA CGCCATCGGG GAAAAATCCA CCGGCAGCAA ACAAAACCGG
GAGATCGTCG ACAAGGCCAG GCAGTGGATG TGCCAGAACT TCTATGATAT CCGCATGTTC
GGCGCCGTAA TGACTACCGG CGTCAACTGC GGCCAGGTGC GGGGGCCAAT GCAGCTAACC
TTTGCCCGGT CAATCGACCC CATCGTTCCC CTGGACATCT CCATCACCCG CGTCGCCATC
ACCAGGGTAG AAGATGCCGC TACAAGCGAA CAGGGTGAGG GAGGCAAGGT CACAGAAATG
GGCCGTAAAA CCCTGGTACC CTATGGCCTG TACCTGGGCT ATGGATTTTT CAACCCCCAT
TTTGCCGCCG ATACTGGCGT CAGCGCCGCC GACCTGGAGA TCTTCTGGGA GGCCCTGCAG
CGGATGTGGG ATGTGGATCG TTCCGCCAGC CGCGGCATGA TGGCCTGCCG GGGACTTTAT
ATCTTCAGCC ATGCATCCGC CCTGGGCAAT GCTCCGGCGG ATAATCTCTT TAAACTCATC
ACCGTTAAAC GCCGGGATGG AGTAAAAGCA GCGCGCTCTT TTGCCGACTA CCAGGTGACA
ATTAATGAAG AGGACTTGCC GCCTGGGGTA ACTCTGACAC GGTTAGTGGG ATAG
 
Protein sequence
MTVYTNPEVR HDFVLLFDVR DGNPNGDPDA GNLPRLDPET MQGLVTDVCL KRKIRDWVDM 
TRGSEANMKI YVQHHGILNA QHQRAYDAIG EKSTGSKQNR EIVDKARQWM CQNFYDIRMF
GAVMTTGVNC GQVRGPMQLT FARSIDPIVP LDISITRVAI TRVEDAATSE QGEGGKVTEM
GRKTLVPYGL YLGYGFFNPH FAADTGVSAA DLEIFWEALQ RMWDVDRSAS RGMMACRGLY
IFSHASALGN APADNLFKLI TVKRRDGVKA ARSFADYQVT INEEDLPPGV TLTRLVG