Gene Cthe_1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1151 
Symbol 
ID4810819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1368803 
End bp1369870 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content38% 
IMG OID640106573 
Producthypothetical protein 
Protein accessionYP_001037576 
Protein GI125973666 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000920746 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAAT ATGTGTTCAG GAATGTGAGT CAAATAGGCG AAAAGGAGTA CCAGACAGTA 
ATTGATGGAC AGGAATTAGC AGAGATGTGG AGGGATGGGA TAATAACATA TAACCCAGAA
ATCCAGAGAG GAACAAAGGT AAAAAGGGGA AAGGACAATA GTGAAGTTGA GGTTGCTGTG
TACAATAAAG CCAATGTGAA AAAAATCTAT ACTTCTATGC TTTCAGGCCA ATATTTTACG
GATATGATTA CATTAAATGT TCTTGAAGAT GGCAATGAAA AAGTCGAACT TGATGATGAG
GGTAATCTTG CTGTAGATGG CGAAATAAAC ATTGCAGATG GGCAGCACAG AATTAGGGCT
TTAAGCATGA TTCTTGAAGG AAACGAAAAA GGGGATACAT TCTTCGATTT AACCGAACTA
AAATTCCCTG TAAAAGTTAC CCACTATAAT GTCCAGACCG CACAGCAGCA ATTCCACCAG
TTCTCTCTAG GGCTGAAAAT CAGTTCAAGC CGTGCGGAAT ATTTTAATCA AACGGGCCTT
GCAAACATTA TTGTTAGAGA ACTTATGAAA AACAGCGACC TGGCTGGCAG GGTAGAAGTG
GTGAGAAATT CCATATCAAA GAACGACGAA CGACACATTG TAACCTTTGC TACTCTTGTA
AATGCCATAG AGATAGTTTA CAAGGATTTA GAAACAAGGG TTCAGGCGAT GGAATTATCT
AAATACCTTG CAGAATTTTT CAATGAACTG ATAAACCTTA TTCCCGAGTT GCACAACTAT
GAAAAGAGGG CGCAAAGCAA GGAAACATCG CTAATAGGGG AAAATTTCAT GTTCTACGGA
TATGTGGCCA TAAGCAAAGT TCTAAGGGAT AAGGAGAATT GGAAGGAGTA TTTGCCATTA
ATTAATGAAC TAGATTTATC AAAAGGCTCT AAGCAGTGGT ACGGAGATGT TATTAAAAGA
GGAAAGGAAA AAGGATATAC TATCGTAAAT AACAATGAAA GCAGAAAAAC ATTTGTTAAT
AAGATTGAAA GAATGTTTAA AAAGTTATTA AACGAAAAAA CAGCGTGA
 
Protein sequence
MAEYVFRNVS QIGEKEYQTV IDGQELAEMW RDGIITYNPE IQRGTKVKRG KDNSEVEVAV 
YNKANVKKIY TSMLSGQYFT DMITLNVLED GNEKVELDDE GNLAVDGEIN IADGQHRIRA
LSMILEGNEK GDTFFDLTEL KFPVKVTHYN VQTAQQQFHQ FSLGLKISSS RAEYFNQTGL
ANIIVRELMK NSDLAGRVEV VRNSISKNDE RHIVTFATLV NAIEIVYKDL ETRVQAMELS
KYLAEFFNEL INLIPELHNY EKRAQSKETS LIGENFMFYG YVAISKVLRD KENWKEYLPL
INELDLSKGS KQWYGDVIKR GKEKGYTIVN NNESRKTFVN KIERMFKKLL NEKTA