Gene Cthe_0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0156 
Symbol 
ID4808644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp195093 
End bp196934 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content43% 
IMG OID640105567 
Productradical SAM family protein 
Protein accessionYP_001036590 
Protein GI125972680 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGTATTA GAGTAAGTGA CAGAATATTA CAGAGTGTGG AAAAACCATC AAGATATACG 
GGCAATGAAT GGAACAGCGT AAAAAAAGAT TTAAAGGGAA TAGATATAAG ATTTGCTTTC
TGTTTCCCTG ATGTTTATGA AGTTGGGATG TCTCATCTTG GCATGAAGAT TTTGTATCAC
CTTCTCAACG AGAGGGAGGA TACTTACTGT GAAAGAGTTT TTGCTCCATG GGTTGACATG
GAAGCAAAGA TGAGAGAGCA CAACATACCT CTTTTTGCCC TTGAGACCCA TGACCCCATA
AGGGAATTTG ATTTTATAGG TTTTACTCTT CAGTATGAGA TGAGTTATAC AAACATAATA
AATATGCTTG ACCTTGCGGG GGTGCCTGTT TTAAGCGGTG AGAGGACGAA AGAGCATCCC
TTTGTCTGTG CCGGCGGTCC TTGTGCATAC AATCCGGAGC CTTTGGCAGA CTTTATAGAC
TTTTTTATGA TGGGTGAAGG CGAGGAAATA ATCAACGAAG TGATGGATGT GTATGTACAA
TGGAAGAAGA AAAATTTGCC AAGGGAAGAG TTTTTGCGCT GCATATCGTC AATTGAGGGA
GTGTATGTCC CTCAATTCTA TGATGTAAAA TACAACGACG ATGGCACCAT AAGCTCTTTT
TTGCCGATAA GGGATGAGTA TCCCAAAAAA ATAAGAAAAA GGATTATAAA GGACCTGGAC
AAGGTCTTTT TCCCTGAAAA AATAGTAGTT CCCTTTACGG GAATCGTTCA TGACAGGATA
ATGGTTGAGT TGTTCCGGGG CTGTATCAGG GGATGCAGAT TCTGTCAGGC AGGTTTTATT
TACAGGCCTG TAAGGGAAAG ATCGGCGGAC AGGCTTTTGG AGATATCCCG AAAGCTTGAG
GAAAGCACGG GTTATGAGGA GATTTCACTT ACTTCCTTAA GTACCAGTGA CTATACGGCG
CTGAAAGAAC TAACCGACGG ACTGATTTGT GAGATGGAGC CGAAAAAAGT GAATCTTTCG
CTTCCGTCTC TGAGGGTGGA TTCCTTTTCT CTTGAACTTA TGGAAAAGGC CCAGAAAGTT
CGAAAAAGCG GTCTTACTTT TGCACCGGAA GCGGGTACCC AGAGGCTTCG CAATGTTATA
AACAAGGGTG TAACCGAAGA AGACCTCATA AAATCTGTTT CTCTGGCTTT TGAAGGCGGC
TGGAGCGGAG TAAAGCTTTA CTTTATGCTG GGGCTTCCGA CGGAAAGCTA TGAAGATATT
GAGGGTATAG CGGAACTTGG ACATAAAGTT GTTGAAGCAT ATAAAAATAC GCCAAAAGAC
AAAAGGGGCA AAGGACTTAG TGTCACTATC AGCACATCGT CCTTTGTTCC AAAGCCTTTT
ACGCCTTTTC AGTGGGAGCC GCAGGACAGT ATCGAGACTT TGAGGGAAAA ACAGATTTTC
CTGAAAAGCA AAATAAAAAG CAAGAGCATC AAGTACAACT GGCATGACCC TGAATTGAGC
TTTTTGGAGG CAATTTTTGC CCGCGGAGAC AGAAAACTGG GTAAAGTGCT GCTTAAGGCT
TTTGAGAAAG GCTGCAAGTT TGACAGTTGG GGAGAGCACT TCAAATTTGA CAAATGGATG
GAGGCTTTCC GTGAATGCGG AATTGACCCT TCATTCTATG CCAACAGGAA AAGGTCATAT
GGTGAGATTT TGCCTTGGGA TCATATTGAT GTGGGAGTGT CGAAGAAATT TTTGGAAAGA
GAACATGAAA AGGCATTAAA AGAAGAAGTT ACTCCAAATT GCAGAGCAAA CTGTTCCGGA
TGCGGAGCCA CCGTGTTTGA GGGGGGAATT TGTGTTGAGT AG
 
Protein sequence
MGIRVSDRIL QSVEKPSRYT GNEWNSVKKD LKGIDIRFAF CFPDVYEVGM SHLGMKILYH 
LLNEREDTYC ERVFAPWVDM EAKMREHNIP LFALETHDPI REFDFIGFTL QYEMSYTNII
NMLDLAGVPV LSGERTKEHP FVCAGGPCAY NPEPLADFID FFMMGEGEEI INEVMDVYVQ
WKKKNLPREE FLRCISSIEG VYVPQFYDVK YNDDGTISSF LPIRDEYPKK IRKRIIKDLD
KVFFPEKIVV PFTGIVHDRI MVELFRGCIR GCRFCQAGFI YRPVRERSAD RLLEISRKLE
ESTGYEEISL TSLSTSDYTA LKELTDGLIC EMEPKKVNLS LPSLRVDSFS LELMEKAQKV
RKSGLTFAPE AGTQRLRNVI NKGVTEEDLI KSVSLAFEGG WSGVKLYFML GLPTESYEDI
EGIAELGHKV VEAYKNTPKD KRGKGLSVTI STSSFVPKPF TPFQWEPQDS IETLREKQIF
LKSKIKSKSI KYNWHDPELS FLEAIFARGD RKLGKVLLKA FEKGCKFDSW GEHFKFDKWM
EAFRECGIDP SFYANRKRSY GEILPWDHID VGVSKKFLER EHEKALKEEV TPNCRANCSG
CGATVFEGGI CVE