Gene Cthe_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3201 
Symbol 
ID4809503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3791468 
End bp3793309 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content32% 
IMG OID640108635 
ProductCRISPR-associated Csh1 family protein 
Protein accessionYP_001039589 
Protein GI125975679 
COG category 
COG ID 
TIGRFAM ID[TIGR02556] CRISPR-associated protein, TM1802 family
[TIGR02591] CRISPR-associated protein, Csh1 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.573889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTTGCAG GTGTATTGCA GCTGGGACAG TACGCACTCA ATAAAAAGAG TACTGACACC 
GAGGAATATC TTCAAGTCAT TGAAAACCCT AATGATAAAG GCAATTACAA TCATGTCTTA
AAAATTGCAT TTGAATCTAC CGAAGGAAAC ATAGTCTATC GTGGGGTTGA ATACGAAGAG
TTCTCCTCGA AAAAAATAAA TAACTATGCA TACAAAAAGG GAAGTGCAAG GGGTGGCGAT
GTAACACCTA CTTCAAAGTA TACAGATTCA ATGAAGACTT TAAACAAGAT AATGATATCT
TTTAATGATA TTTTAAAGAG TGCAAATCAA AACAATGACG AACAAAAAAT ATTTAAAGGC
ATTTATGAAT ACATGGTTTC AAATCAAGAA AACATTGCAA ACGATATTAC GGAAAAAATC
AAATCCATCT CGTTAAAAAA AGGAGAATCT TTCATTATAA CTGTTACTTT GTTTGATAAC
AATACTGAAA AATATATGGG CAATTTTCAA TTAATAAGAA ATCATCTTGC CAGAATTTTG
AATGAACAGT ATTACAATAA ATACGGTAAA ACATCAAAAG GTAAAGGTAT CTGCTATTAT
TGTAAAAATG AAGGTGAAGT TTTCGGGTTT GTCAATACAT ATAATTCCTA CACGGTTGAT
AAAATAGGAT TTGTAACCGG TGGTTTCAAG CAGGAAAACG CGTGGAAAAA CTATCCTGTC
TGTTCTTGTT GCGCACAGAA GCTGGAGCAA GGTAAGAAAT ATATCAGAGA AAATCTTACA
TCTAAATTTT CAGGTTTCGA CTATTTCGTA ATTCCTAAAG CAGTAATAAG TGATGAACAT
GATGAAGCGG AATTTATAGA GACCCTCGAA GAATTTGAAA AAAATACGAA CTTTTCCACC
CAAGAATCCA CAAAGCAAAA CTTGCTTGGC AGTGAAAAAG ATTTTCTGGA GATAATGAAG
GACAGCAAAA ATTATCTTAA TTACAATATG CTTGTTTTTA AAGAAGAGCA GTCCGGAAGC
GTTTTTAGGA TACTTCTTTA TATAGAGGAT ATTGTGCCAA GCAGGGTAAA AAACATATTG
AGAGTAAAAG ACAGAGTAGA TGAAACAGTG CTGTTTAAAA ATCTTCCGGG TAAAGATAAT
GCAACTTATG ATTTAAAGTT TGGATTCGAT AAAATAAGGA CCTTTTTCCC GAATAACAAG
ACAGAAGGAA ATTTTGACAA AAGTTTTCTT GAGATTTTAA ACAATGTGTT TACATATAAA
AAGATAAGTT ACAAATTTCT TTTAGGGAGA ATGATTTCAA AAATAAGGAG TGATTTTGCA
AGGGAGGAAT ATGTAAAAAA CCTTGTACTT CAGGCGCTGA TGTGCATTAT GTTTATTGAT
AAGCTTAATT TGCTCAGTGG TAAAGGGAAA GAGGTGCAAA AAATAATGAT TGAGAAGACC
GAGAAAAACA AAAAATATCT TGATTTTTTT GAAAACGAAA GTTATAAGGA CGTTTTCAAT
TCAGATTATA AGAGAGCAGT TTTTTTGACA GGAGTGCTGA CGGAAAAGTT ATTAAACATC
CAATACAAAG AAAGAGGAAG CAAACCTTTC TACAGCAGGC TCAATGGCTT GAAACTGAAT
AAAAACATAG TAAAAAGAAT ATATACCGAG GCCATTAATA AATTAAATGA ATATAACAAG
AACTATTATA AAGAATTGGA ATATCTTATT GGCATGTACA TGTTGTCGGA AGAATCCCAA
AAGAATGTTT CCGATGATGA AATCAGTTTT TATTTTGTGC TTGGAATGTC ATTGGCAAGG
TTCTTTAATG AAGAAAAGAA AGACGGAGAG GATGAGGAAT AA
 
Protein sequence
MLAGVLQLGQ YALNKKSTDT EEYLQVIENP NDKGNYNHVL KIAFESTEGN IVYRGVEYEE 
FSSKKINNYA YKKGSARGGD VTPTSKYTDS MKTLNKIMIS FNDILKSANQ NNDEQKIFKG
IYEYMVSNQE NIANDITEKI KSISLKKGES FIITVTLFDN NTEKYMGNFQ LIRNHLARIL
NEQYYNKYGK TSKGKGICYY CKNEGEVFGF VNTYNSYTVD KIGFVTGGFK QENAWKNYPV
CSCCAQKLEQ GKKYIRENLT SKFSGFDYFV IPKAVISDEH DEAEFIETLE EFEKNTNFST
QESTKQNLLG SEKDFLEIMK DSKNYLNYNM LVFKEEQSGS VFRILLYIED IVPSRVKNIL
RVKDRVDETV LFKNLPGKDN ATYDLKFGFD KIRTFFPNNK TEGNFDKSFL EILNNVFTYK
KISYKFLLGR MISKIRSDFA REEYVKNLVL QALMCIMFID KLNLLSGKGK EVQKIMIEKT
EKNKKYLDFF ENESYKDVFN SDYKRAVFLT GVLTEKLLNI QYKERGSKPF YSRLNGLKLN
KNIVKRIYTE AINKLNEYNK NYYKELEYLI GMYMLSEESQ KNVSDDEISF YFVLGMSLAR
FFNEEKKDGE DEE