Gene Cthe_3186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3186 
Symbol 
ID4809637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3766432 
End bp3768183 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content41% 
IMG OID640108620 
Productradical SAM family protein 
Protein accessionYP_001039574 
Protein GI125975664 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACAA TAATAATTGG AATTAATTCA AAATATATAC ATTCATCTTT GGCGGCATGG 
TACCTTAAGG CGTCCTGTGA CGAGCAGTGC GGGGAAGTTA AGGTCATGGA GTTTACAATA
AATGACAATT CGGAATATGT CCTTTCCCGT ATTTACGCTG AAGGGTGCGA TGTGGCTGCT
TTTTCCTGCT ATATCTGGAA CATTGGTTTT GTCCTTAAGC TGGCTGAAAA TTTGAAAATG
GTTCTGCCCG ATGTGAAGAT AATACTTGGC GGACCGGAAG TCTCCTTTGA CCCTCATGAT
ATTTTGCGCT CAAACCGGTT TGTGGACTAT GTGATGGCAG GAGAGGGGGA AGGGGCTTTC
GGGCTTTTGC TGCGCTCTTT TACCGATGAA GGAATAAATC CTGATGCCAT TGAAGGACTA
AGCTACAGAA AAAACGGAGA AATCCACGCT TCAACCTCAT TTCGCCTGGT AAAGGAGCTT
GACACCGTTC CTTCTCCTTA TACCCCGGAA ATGCTTGAAG CCATTGGCAA CAGAATAATA
TATTTCGAAT CCTCAAGGGG ATGTCCATTT TCGTGTTCCT ACTGTATTTC ATCAACCTTT
GAAGGTGTGA GATATTTTTC CATGGACAGG GTCAAATCGG ACCTTTTAAC GCTGATTGAT
GCCAGGGTAA AACTTGTGAA GTTTGTGGAC AGAACTTTCA ATTGCAACAG ACAAAGGGCA
AAGGAAATAT TTTCATTTAT TATTGAAAAC GCAAAAGAGA CGAGTTTTCA CTTTGAAGCG
GCGGCCGACC TTTTTGACGA CGAAATGTTT GACATACTGT CCCGGGCTCC AAAGGGCTTG
ATTCAGTTTG AAATTGGAAT ACAATCAACC AATGAGGCGG CCCTTGAGGC TATCAGGAGA
AAAACTGACC TGAAAAAAGT GTTTGAGAAC ATAAAAAAGC TTAAAGAGCT TGGAAACGTG
CATATACATG TGGATTTGAT AGCCGGACTT CCTTTTGAGG ATTATAATTC CTTTTTGAAT
TCTTTCAATG AAACCTATAA ACTTTATCCT CACCAGCTGC AGCTGGGATT TTTAAAGCTT
CTCAAAGGTT CCGGCATAAG GCGGGAATAT CAAAAGTACG GTTATAAATT CAGGCAATAT
CCTCCGTATG AGGTATTGTC CAATGCATAC TTGAGTTTTG GTGATATTAT TAAATTAAAA
AAGATTGAAG AACTACTCGA AAGGTATTAC AACTCGGGAA GGTTCCAAAG AACGCTGAAA
TATCTTATTG AAGGGTTTTT TCCTTCACCG GCCGCTTTTT TTGAAGAATT CTCAAAGTAT
TACGAGGCGG CGGGATGTTA CGACAGGTCC ATTTCCTCCA GGGAGCTTTA CACGATACTT
TTGGATTTTG CATCAAGCCT TGATATGGAA GTGAACCTGG TATTGCTTAA CGAAATTCTG
AAATTTGACT TTTTGGTTTC GGATAATACA AACAATCTTC CAAAAGGCTT GGAACGGCTG
TATATCGATA ATTTCGGCGA GAGATGCTTT GAATTTCTGA AGAACAGGGA AAATGTTGAG
AAGTTTTTGC CCGAGTTTTC GGACACGCCT GCCAAGAAAA TATACAACAA GGTTCATTTT
GAAGCATTCA GATTTAATGT TGCCGATGAG ACTTTGGCTC AGGATGAAGA TACTACTGTA
ATTTTGTTTG ACTATAGTCA AAAGGACAGC ATAACGGGCC ATTACAGGTT TTATAAGATA
CAGCTTCCCT GA
 
Protein sequence
MKTIIIGINS KYIHSSLAAW YLKASCDEQC GEVKVMEFTI NDNSEYVLSR IYAEGCDVAA 
FSCYIWNIGF VLKLAENLKM VLPDVKIILG GPEVSFDPHD ILRSNRFVDY VMAGEGEGAF
GLLLRSFTDE GINPDAIEGL SYRKNGEIHA STSFRLVKEL DTVPSPYTPE MLEAIGNRII
YFESSRGCPF SCSYCISSTF EGVRYFSMDR VKSDLLTLID ARVKLVKFVD RTFNCNRQRA
KEIFSFIIEN AKETSFHFEA AADLFDDEMF DILSRAPKGL IQFEIGIQST NEAALEAIRR
KTDLKKVFEN IKKLKELGNV HIHVDLIAGL PFEDYNSFLN SFNETYKLYP HQLQLGFLKL
LKGSGIRREY QKYGYKFRQY PPYEVLSNAY LSFGDIIKLK KIEELLERYY NSGRFQRTLK
YLIEGFFPSP AAFFEEFSKY YEAAGCYDRS ISSRELYTIL LDFASSLDME VNLVLLNEIL
KFDFLVSDNT NNLPKGLERL YIDNFGERCF EFLKNRENVE KFLPEFSDTP AKKIYNKVHF
EAFRFNVADE TLAQDEDTTV ILFDYSQKDS ITGHYRFYKI QLP