Gene Cthe_3217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3217 
Symbol 
ID4809519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3811624 
End bp3812916 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content34% 
IMG OID640108651 
Producthypothetical protein 
Protein accessionYP_001039605 
Protein GI125975695 
COG category 
COG ID 
TIGRFAM ID[TIGR02710] CRISPR-associated protein, TIGR02710 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCTCA CACTTGGTAC GTCTTATGAA CCGTTGGTAT TATCCATTTC TGTATTAAAA 
CCTGAGAAAG TATTGATTTT ATATACTGAT AAGTCTCATC ATCTTCTCGA TGATGTAATA
GAATTTACAA AACTAAAACC AAGTCAATAT GTAGCAACTG ATGTGGATGC GGAAAATCCG
TTGCAATTAT ACAGAAAGAT AAAAGATGTG TATGAAAAAT GGGGAAGACC CAGGAATATA
TATGTTGATT TTACAGGTGG AACAAAGTCA ATGGCTGCAG GTTGTGCAAT GGCAGGTTCG
GCTATAGGGG CGAAACTGAT TTATATTGCC GGCAATTTTC TTACAGATCT TAGGAAGCCT
GAACCGGGAA GCGAAAAACT CTGTTATATT GATGATCCTT ATACTGTTTT TGGAGACTTG
GAAAGAGAAC AGGCCATTTC GTTATTTAAC AAAATGGACT ATGTATCTGC GTACAGGATA
TTTGAAGAAT TGGAACAAAG AGTCCCGGGT ACGAAAGAAT ATAGTGCTTT AAAGTATATA
TCAAAAGCTT ATAATGCTTG GGATAGCCTT GATATAAGCG GAGCAGCGGA TAATTTGTCA
AAATGCTATG AGATAGTTGT AACAGAAGGA AAGATAGATA AAAGTTTTGT TTTGAATTCC
CATATTGAGA AACTGGAAAA ACAATTGGGA GTTATTAGGG TATTGGAAAA AATTCATTGC
AGTGAAGAAG CGGCTAAAAA TAAAAGTGTA ATTTTTGACA ATATAGGCTA TTTGATTGCC
AATTTATATC AGAATGCAAT GAGAAGAGAA AAGCAAGAAA AATATGAGAT GGCATCACTT
CTTTTATATC GTATACTTGA AATTGTTGAG CAAAAAAGGT TATGGAATTA CGGTGTGGAT
ACCTCGGATG CAGATTTTAC CAAATTATGT GAAGATGAAA AAGTTTTGCT TGAGAAGGCC
AATAAAATCA TAAGAAGTGT AAAAGGGTTT AATGAATGGA AGCAACTAGA CAAGAAAATA
AGCCTTTTAG CCGGATATAT ACTCCTTGCG GCAGTTGGGG ATGATATAAT AAAAACTAAA
AAACCCGGTA AAGAGATTGA CAGTATAAAT AGATTGAGAA ACAAAGTTGA GGCAAGAAAT
AATAGTATTT TTGCTCATGG TTATGAATTT ATTAGTAAAG AAAAGTATAA TGAATTTAAA
AAAGTGGTTG AAGATTATAT GAACTTGTTG TGTTCTATAG AAGGAATAGA TAAAGAAGAA
CTGTTTGATT CATGCGAGTT TATAAAGCTA TAG
 
Protein sequence
MVLTLGTSYE PLVLSISVLK PEKVLILYTD KSHHLLDDVI EFTKLKPSQY VATDVDAENP 
LQLYRKIKDV YEKWGRPRNI YVDFTGGTKS MAAGCAMAGS AIGAKLIYIA GNFLTDLRKP
EPGSEKLCYI DDPYTVFGDL EREQAISLFN KMDYVSAYRI FEELEQRVPG TKEYSALKYI
SKAYNAWDSL DISGAADNLS KCYEIVVTEG KIDKSFVLNS HIEKLEKQLG VIRVLEKIHC
SEEAAKNKSV IFDNIGYLIA NLYQNAMRRE KQEKYEMASL LLYRILEIVE QKRLWNYGVD
TSDADFTKLC EDEKVLLEKA NKIIRSVKGF NEWKQLDKKI SLLAGYILLA AVGDDIIKTK
KPGKEIDSIN RLRNKVEARN NSIFAHGYEF ISKEKYNEFK KVVEDYMNLL CSIEGIDKEE
LFDSCEFIKL