Gene Cthe_1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1204 
Symbol 
ID4809896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1435506 
End bp1436774 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content42% 
IMG OID640106627 
Producthypothetical protein 
Protein accessionYP_001037629 
Protein GI125973719 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02877] sporulation protein YhbH 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAGGAT TAAACCCCTT AATCCCGCAA AAATCAAACC GGGGGGTGAT TGATGTGGCT 
ATTTTCAGAG AATATGTGAG CGGCGGAAAA GACAGAGCTG CTGAAGACAG GCGGCGCCAC
AGGGAATTGG TTGAAGAATC CATAAAAAAG AATATAGGTA ATATCATTGC AGAGGAAAGT
ATTATCGGTC AGAGCAAGGA CAAAAAAATA AAAATACCGA TAAGAAGCAT TAAAGAATAT
CAGTTTGTAT ACGGGAAAAA CGGTCCTTCG GTGGGTTCCG GAGACGGAAC GGAAAAGCGC
GGGGAAAAAA TAGGCGAGGA GAAAACGGCT AACGGCGGAC AGGGAGTTGG GCAGGCCGGA
AACCAGGAAG GGGAAGAAAT TTATGAGACA GAAATTACAA TTGAGGAACT CATAAATTAT
CTCTTTGATG ATTTGAATCT TCCGGATATA GATAAAAAGA GGATTGCGGA GCAGGAATCC
ATAAGAAGCT ACAAGAACCT GGGTTATCAG CGAAAAGGAA TACCCCCAAG ACTTGCCAAG
AAGCGTTCCG TTATTGAAAA GATAAAAAGA AAGCAGGCAT ATCTGAGAAA CAGCAGAGAA
TTGGGCGATT TGGACGAAAG TGCCGAAGAG GATATCGCCG CGCAGGAGAC CTTGGACGGT
GTCAGAAAGA GATTTCCTTT CAGCGAGGAT GATTTGCGAT ACAGAAGGGT CAGAGAGGAC
CGCAAAAAAG ATTTCAATGC AGTGGTTATC TGTATTATGG ATGTTTCCGG TTCCATGGAC
CAGACAAAGA AGTATCTTGC CCGAAGCTTT TATTTTTTGC TGTACCAGTT TATAAGATTG
AAATATGCCA ATGTTGATGT TGTTTTTATA GCCCATACCA CCACTGCGAA AGAAGTCAGT
GAAGATGAAT TTTTTCACAG AGGCGAATCG GGAGGAACAT ATATCAGCAG CGGCTATGAA
AAAGCTCTTG AAATAATCGA GCAGAGATAC AACCCCAACA GTTGGAATAT ATATGCTTTT
CATTGCAGTG ACGGCGACAA CTGGTCCGAG GACAATAAAA AAGCCGTCGA ACTTGGATTG
AAACTCTGTG ATGTGTGCAA CCTGTTTGGA TACGGTGAAA TAGTGCCGGG TTATTATTCC
ACCGGAAGTA CTATAAAAGA CGAGTTTCAA AAAAGCATTA AAAGAGACAA TTTTTCTGTC
ATAACAATAA CCAATAAAGA TGATGTGCTT CCAGGATTGA AGAAACTCCT GGAAAAGGAA
GGAGAATAA
 
Protein sequence
MEGLNPLIPQ KSNRGVIDVA IFREYVSGGK DRAAEDRRRH RELVEESIKK NIGNIIAEES 
IIGQSKDKKI KIPIRSIKEY QFVYGKNGPS VGSGDGTEKR GEKIGEEKTA NGGQGVGQAG
NQEGEEIYET EITIEELINY LFDDLNLPDI DKKRIAEQES IRSYKNLGYQ RKGIPPRLAK
KRSVIEKIKR KQAYLRNSRE LGDLDESAEE DIAAQETLDG VRKRFPFSED DLRYRRVRED
RKKDFNAVVI CIMDVSGSMD QTKKYLARSF YFLLYQFIRL KYANVDVVFI AHTTTAKEVS
EDEFFHRGES GGTYISSGYE KALEIIEQRY NPNSWNIYAF HCSDGDNWSE DNKKAVELGL
KLCDVCNLFG YGEIVPGYYS TGSTIKDEFQ KSIKRDNFSV ITITNKDDVL PGLKKLLEKE
GE