Gene Cthe_3215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3215 
Symbol 
ID4809517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3808630 
End bp3809880 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content29% 
IMG OID640108649 
ProductCRISPR-associated TM1812 family protein 
Protein accessionYP_001039603 
Protein GI125975693 
COG category 
COG ID 
TIGRFAM ID[TIGR02221] CRISPR-associated protein, TM1812 family
[TIGR02549] CRISPR-associated DxTHG motif protein 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAA AACTTTTTGC GTTTTTGGGA ATTGGAGATT ATAAAAGCGT AGAGTATTAT 
TTTCAAAATA AAAAAGAAGG ATATAAAACA GAGTATATAC AAGAGGCTAT TACTAAATTG
CTAAATGAAC AAGATTTAAA TGTTACAGTA TTTGTGACTG CAGAAGCACG GAAAAAGCAC
TGGGAACCGG AAAATAATAA AGGGCTTGAA AGCAGATTGA AAAAATTGAA CATTAATTGC
AAAGCAGTTA ATATACCTGA TGGAAAAGCA AACGACGATG TATGGAAAAT ATTCACAAGT
GTATATAGTG AAATTGAATT TAATGATGAA ATATATGTTG ATGTGACTCA CTCTCTTAGA
AATATACCAA TAATTTTTAT GTCTGTTTTG AATTATGCAA AGGTTACAAA AAATTGCACT
ATCAAAGGAA TATTTTATGG AGCGTATGAA GCTAAAGAAA ATGAGAGAGC ACCTATATAT
GATTTGACGT TGTTTGATCA GATTGGCGAA TGGAGTTCGG GAGTTGAACA ATTGCTTACA
ACAGGTGAAT GTGAGATGTT TTGTTCTACT GTGGAAAAAA CTTTGGATCC TTTATTAAGG
GAAGCAAAAG GAAAAGATGA ATTAATTAAA CTTGTAAAAA AGTGTTCAAA ACTTATCAAA
GAATTTTATA CAGATTTGAA ACTTGTAAGA GGAAAATCAG TTTTAGAAGA TGGTAGAAAG
TTGTATCAAG TTTTGTGTGA AATTAAGGCG TTAAATACGG AAAAACATAT TACTATGCAA
CCATTTTTTC ATATACTTGA ACGGGTTGAG AATCAGGTCG CTTTCTTTCA AAATGAAAAT
TTAATAGAAA ATATATTGGA GTGTGTTAAA TTATGTAAAA AGTTTGGACA GTATCAACAG
GCCTATACAT TTTTAAGAGA GAATATAATA AATTATGTTT GTATAAATAC AGGGCTTGAT
TGGAAAAAGG AAGATCCGGA TAGATTAAAA GCAGAAGAAT TAATTGGTAA ATTGTATATG
AGAAAGATAA AGAAAGTTCA AATAGAAGTT AGTGAAGACA TAAAGTCTAT ACTCGAAAAT
GGAGAGGATT TTATTTGTGA TGATGCAATT GAGTTATTTG GAGAGTTAAT TGAATTTAGG
AATGATCTTG ATCATGCACA ATTTAGAATG ATTAACCCTT CGAAAGATAA AATTACATGC
AAACTTGATT CGTTTATTGA GAGGTTTGAA AAATATTATA TTTCTAAATA A
 
Protein sequence
MAKKLFAFLG IGDYKSVEYY FQNKKEGYKT EYIQEAITKL LNEQDLNVTV FVTAEARKKH 
WEPENNKGLE SRLKKLNINC KAVNIPDGKA NDDVWKIFTS VYSEIEFNDE IYVDVTHSLR
NIPIIFMSVL NYAKVTKNCT IKGIFYGAYE AKENERAPIY DLTLFDQIGE WSSGVEQLLT
TGECEMFCST VEKTLDPLLR EAKGKDELIK LVKKCSKLIK EFYTDLKLVR GKSVLEDGRK
LYQVLCEIKA LNTEKHITMQ PFFHILERVE NQVAFFQNEN LIENILECVK LCKKFGQYQQ
AYTFLRENII NYVCINTGLD WKKEDPDRLK AEELIGKLYM RKIKKVQIEV SEDIKSILEN
GEDFICDDAI ELFGELIEFR NDLDHAQFRM INPSKDKITC KLDSFIERFE KYYISK