Gene Cthe_0739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0739 
Symbol 
ID4810357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp900407 
End bp902131 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content40% 
IMG OID640106156 
Producthypothetical protein 
Protein accessionYP_001037167 
Protein GI125973257 
COG category[S] Function unknown 
COG ID[COG1520] FOG: WD40-like repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0166259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA AAACATCCAA AATCATACTT ACTATAAATA TAATAATACT GCTTTTGGCT 
GTTCCGGGGT TTATTATTCT GGATAAATAT CTTGAACGCT CAAATTCCAA TCCGGACAAT
ATCGTTGCGC CACCTTCTCC GGATGTGGCT TCCAATCCCG GTGAAACTAC CATACCGGAC
GTTACCGCAA GTCCGGAAAA CACTGACACC AAAAATTGGA CAGCAATAAC TCCTGATGAA
ATCGATATTG TAAAAGAGTG TCTTCCCGAA AGCCACAATT TTAAATGGGA CGTTATTAAA
GACGGCAAGA AACTTGATAC ATATTCAAGA GACAATACCG TTGTTTTTAA ACCGGCTGAT
GAATACAATG AAATTGACGG TGTAACCACC TTCAGAGGAA ACAACTACAG AAACTCCGCA
AGCTTTGGTT CTGCAAACGT CAGGGAAGAA AAACTCGAGA AAGTTTGGAG TATCAAAATA
GGATATATAG ATACATGGAC AGGTGTTGGC TGGAACGGTC AGCCGGCAAT TGTAAAATGG
AGCAACGAAC TTCGAAAAAA AATGAATTTG TTTCAGGATA AAAAGGATAA AAATGATCTC
AAGGAAGTCA TATATGCCAC TTTGGACGGA AAAATATATT TTCTCGACCT TGACGACGGT
TCCTACACCA GAAATCCAAT TAATGTAGGT GCTCCGCTCA AAGGAAGTGT AACCGTGGAC
CCCAGGGGTT ATCCCCTTCT CTATTCAGGT CAAGGCATTG ACGAAGTAAA AGGCCAAAAG
GTTTCGATAG GTTTTCGCAT ATACAGCCTT CTGGATCAAA AACTTCTCTA CTTTATAAAC
GGCCTTGACA ATACTGCTTT CAGATACTGG GGAGCTTTTG ACTCTTCCCC TCTTTTGCAC
AAAGAAACCG ATACGCTGTT TTTATGCGGG GAAAACGGCC TTTTGTATTC CATAAAGCTA
AATACGGATT ATGACCCTGC ACAACCTGCT ATTTCAATAA AGCCTGATAT TGTAAAATAC
AGATATGTTT CTCCCGTCAA CGGCAGACTT GGAACTGAAA ACTCCATAGC CGCTTTCAAA
AATTTCGGCT ACTTTGCTGA CAACAGCGGA ACTCTCCAGT GCGTTGACTT AAACACTCTG
TCTCCGGTAT GGATAAGAAA CATAACCGAT GATACGGACA GTACAATGGG TATTGAGGAT
TTAGGAGGAA ACAACGTTTA TATCTATATT GCAAACGAAG TTGACCTCCA GGGAGAAAAC
GGATACAGTT ATGTCAGAAA AATAAATGCT TTGACAGGAA GTCTTGTATG GGAAAAGAAA
TACAAATGCT CATATAACGC AGATACAAAC GGCGGAACAT TGGCCTCTCC CGTAATTGGA
AAAAACGAAA TCAGCAATCT GGTTATATTC AGCATAGCCA AATCCTATAA GAAAAACGGC
GGAAAGCTAA TTGCCTTTGA CAAAAATACC GGCGACGAAG TATGGGTTAT AGATTCGGAT
TTCTACAGCT GGAGTTCACC GGTTGACGTA TATACTGAAG ACGGCAAAGC TTATATCATT
CATTGCGATT CCGCTGGGTA TATGAACCTC ATTGAAGGCA AAAGCGGCAA AATCCTTGAC
AAAATACCTC TTGGCGGAAA TATTGAAGGT TCACCCGCAG TTTATGACAA TATGATTGTA
GTAGGCACAA GAGGTCAGCA AATATATGGA ATAAGAATAA AATAG
 
Protein sequence
MNTKTSKIIL TINIIILLLA VPGFIILDKY LERSNSNPDN IVAPPSPDVA SNPGETTIPD 
VTASPENTDT KNWTAITPDE IDIVKECLPE SHNFKWDVIK DGKKLDTYSR DNTVVFKPAD
EYNEIDGVTT FRGNNYRNSA SFGSANVREE KLEKVWSIKI GYIDTWTGVG WNGQPAIVKW
SNELRKKMNL FQDKKDKNDL KEVIYATLDG KIYFLDLDDG SYTRNPINVG APLKGSVTVD
PRGYPLLYSG QGIDEVKGQK VSIGFRIYSL LDQKLLYFIN GLDNTAFRYW GAFDSSPLLH
KETDTLFLCG ENGLLYSIKL NTDYDPAQPA ISIKPDIVKY RYVSPVNGRL GTENSIAAFK
NFGYFADNSG TLQCVDLNTL SPVWIRNITD DTDSTMGIED LGGNNVYIYI ANEVDLQGEN
GYSYVRKINA LTGSLVWEKK YKCSYNADTN GGTLASPVIG KNEISNLVIF SIAKSYKKNG
GKLIAFDKNT GDEVWVIDSD FYSWSSPVDV YTEDGKAYII HCDSAGYMNL IEGKSGKILD
KIPLGGNIEG SPAVYDNMIV VGTRGQQIYG IRIK