Gene Cthe_2057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2057 
Symbol 
ID4810653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2446982 
End bp2448814 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content33% 
IMG OID640107462 
ProductCRISPR-associated TM1812 family protein 
Protein accessionYP_001038457 
Protein GI125974547 
COG category 
COG ID 
TIGRFAM ID[TIGR02221] CRISPR-associated protein, TM1812 family
[TIGR02549] CRISPR-associated DxTHG motif protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAAT TCATTTCTCT TCTTGGTACG TCGAAATACG TTCCTTGTAA TTATTTTATT 
AAAAACAGGG AAGAACTCAA AATTAATGAT TGCTGTTATG TTCAGAAAGC AATTTTGGAT
ATTTTGAGAC AGGAAAATGT TATACCTGAT AAAATAATTA TTTTTACTAC TGATGAAGCA
CATGTAAGTA ATTGGGAAAA TAACAAATGG AATAACAAAG GAGACAAAAA TGATGTTCAA
GAGATTGAGG ATGAACTGCA AAAACGGCCG GGATTAAAAG GTGAGCTTGA AAAATACAAA
GAGTTAACAG GTGCTGATTT TAAAAGTGTA AGAATACCTA ATGGAATTAG AGAAGAAGAT
TTATGGGAAA TATTTAGAAC GATATTTGAT GAAATAGATG AAAATGATGA AATAATATTT
GATGTTACCC ATTCATTCAG ATATTTGCCA ATGTTGGTTT TTATAGTGAT AAATTACGCC
CGGGTAGTAA AAAAATGCAA GCTTAAGAGT ATTTATTATG GAGCGTTTGA AGTATTGGGC
ACTCCCGAAA CAGTGGCAAA AATTCCATTG GAGGAAAGAA ATGCTCCCAT TTTTGATCTC
ACATCTTTTG TGGATTTGTT TGACTGGACG ATGGGAATTG ACAGATATTT GAATACCGGC
GATGTATCAG TAGTGCATGA ACTTACAGAT ATACAAATAA AAAGAGTAAA TAAGGAAAAA
CTTAAATTTA TTTCAAAATC AGAACAAGGA GTTGATCCAA AAACTTTATT TATGGATTCG
AGGCAGTTAA GAGCACTATC TGAATCAATG AAAAATTTTA GCGACGTAGT GTTTACATGT
AGAGGTTTGG AATTGACACG GGTTGCATGT GAATTAAAAG AAAAACTGAG AGAAGTTACT
GAAAGTGCTT CAAGACAGCA TATAATACCA TTGATGCCGG TTCTTGAAAT GATGAAAGAA
AGATTTGATA AATTTAGCAA AGATAACGAT TACATAAATA TTATTGAAAC TGCGAGATGG
TGTGCTGACA ATAAAATGTA CCAGCAAGGA CTGACAATTT TGGAAGAAGG ACTGATTAGT
TTTGGCTGTG AAAAGTTGGG TTATGAAAAT TTAAGCGATA AAATAGACCT TGATAAGAGA
AGAAAGATAG GTTCATATGC TTTTGTTGTT AGTAATGATT TTGGAAGAAA AAGCGACAAT
AAAAAAGAGA AGATACCTTT ATTAAATATA AAATCAGATG TTATAACTGA TCTGTTGATA
CTAATAGATG AAATAAGCAG TATCAGAAAT GACATTAATC ATGCCGGTTG GAGGAAAGAT
CCTTCTGAAG CCGGTGCTTT TGGGGAACAA CTTAAGAATT TTATTTCACG TGCGGAAAAA
ATTATTTCTC CAGAAAACTT TTCCACAGAA TCAGATAAAT GTATTAATAA TGACAGTGTA
GACAAGGAAA AGAAAATGCT TTTGATATTG TCCCATAAAC TTGTGCCCAA ACAGGAAGAA
GAAGCAAGAC AACGCTTTGG CATATCAAAG TTTTTGCCAA TGCCCAATGA GTTACATAAT
AAATGGTCAA ATATACCTCC TGAGCTGGAG GATTTACGAG ACTATCTCAA CGACATTTTG
GAATGGATTG ATTTAAATGC GCAAGAAGGA GATTATGCTC TGATACAGGG AGATTACGGA
GCTACATTTA TTGCTGTCAA CCATTGCCTG GCCAAAGGAA TTATTCCTGT TTATTCAACA
ACTCACAGAA TTGTGCGAGA GGAAAAAAAT GAAGACAAGG TTATAAGTGT AAGAGAATTT
GAACATGTAA TTTTAAGAAA ATATGAAAAT TAA
 
Protein sequence
MLKFISLLGT SKYVPCNYFI KNREELKIND CCYVQKAILD ILRQENVIPD KIIIFTTDEA 
HVSNWENNKW NNKGDKNDVQ EIEDELQKRP GLKGELEKYK ELTGADFKSV RIPNGIREED
LWEIFRTIFD EIDENDEIIF DVTHSFRYLP MLVFIVINYA RVVKKCKLKS IYYGAFEVLG
TPETVAKIPL EERNAPIFDL TSFVDLFDWT MGIDRYLNTG DVSVVHELTD IQIKRVNKEK
LKFISKSEQG VDPKTLFMDS RQLRALSESM KNFSDVVFTC RGLELTRVAC ELKEKLREVT
ESASRQHIIP LMPVLEMMKE RFDKFSKDND YINIIETARW CADNKMYQQG LTILEEGLIS
FGCEKLGYEN LSDKIDLDKR RKIGSYAFVV SNDFGRKSDN KKEKIPLLNI KSDVITDLLI
LIDEISSIRN DINHAGWRKD PSEAGAFGEQ LKNFISRAEK IISPENFSTE SDKCINNDSV
DKEKKMLLIL SHKLVPKQEE EARQRFGISK FLPMPNELHN KWSNIPPELE DLRDYLNDIL
EWIDLNAQEG DYALIQGDYG ATFIAVNHCL AKGIIPVYST THRIVREEKN EDKVISVREF
EHVILRKYEN