Gene Cthe_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1094 
Symbol 
ID4811392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1300520 
End bp1302439 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content46% 
IMG OID640106516 
Producthypothetical protein 
Protein accessionYP_001037519 
Protein GI125973609 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0145194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTTT TACCTATAAC AAGGGAAGAT ATGAAAAACA GAGGATGGGA TGAACTGGAT 
TTTCTTTATA TCAGCGGAGA TGCGTATGTG GATCATCCCA GCTTTGGGCA TGCCATAATA
ACGAGACTTT TGGAAAGTGA AGGCTACAGG GTGGGGATTG TCGCCCAGCC GGACTGGAGA
AAGGACGATG ACTTTCTGGC ATTGGGAAAG CCACGTTTGG CCGTGCTTAT ATCATCGGGA
GTAATAGACT CCATGGTAAA CCATTATACT GCGAGTAAAA AGCCAAGAAG CGATGATTTG
TACAGTCCGG GAGGAAAAAG CCACAGACGG CCGGACAGGG CGGTGATTGT ATATACCAAC
AAAGCGCGGC AGCTTTTCAG GGATGTGCCG GTGATTATCG GCGGGATTGA GGCAAGCCTG
AGGAGATTTG CCCATTATGA TTATTGGGAT GACAGGGTCA GACGTTCCAT TCTAGTTGAC
TCGAAAGCTG ACCTTTTAAT TTACGGAATG GGAGAAAAAC CGATACTTGA GATTGCCCGG
TATCTTTCCA TGGGAGTGCC GATAAAGAAG ATCCAGAATG TAAGGGGAAC CGCTTTTCTG
GCAAGAAAAG AGGACTTGCA TGGAGAGTTG AGAAAATTTA TTGATAATTC GGAAGACAAG
CCGGAAAAAG GTTATATTCT GCTTCCGTCA TTTGAAGAGG TGTCCACGAG CAAAAGAAAA
TATGCCGAGG CTTTTATGAT TCAGTACAAT GAGCAGGACC CTTACACCGG AAGCGTTCTT
GTGCAGCCTC ACGGTGACAG GTTTGTGGCT CAGAATCCGC CGGCTTATCC CCTTTCCGAA
AAGGAGATGG ACAGGATATA TTCTCTTCCG TATGAAAGGA CTTATCATCC TGTCTATGAC
AAAGACGGCG GAGTTCCTGC CATAGAGGAG GTACAGTTCA GCATAACAAG CCACAGAGGC
TGTTATGGCG GTTGTTCCTT TTGCGCGTTG AATTTCCACC AGGGCAGGAT AATTCAAAAA
CGCAGCCAGG CTTCAATAAT AAATGAGGCA AGAAAGCTTA CATGGCTTCC GGGCTTTAAA
GGCTATATTC ACGATGTGGG AGGACCCACG GCCAACTTTA GGAACAAGGC CTGCAAAAAG
CAGGAAATTT CCGGTGCGTG CAAGGAAAGG CAATGCCTTT ACCCCAAGCC TTGCAAAAAC
CTTATAGTTG ACCACAGCGA ATACCTGGAG CTTTTAAGAA AGCTTCGGGA AATACCGGAA
ATAAAAAAGG TTTTTATTCG TTCGGGTATA AGATATGATT ATCTGATGCT GGATAAAAAC
GACGATTTCT TTGTCGAACT TTGCCGGCAT CATGTCAGCG GGCAGCTTAA AGTTGCGCCG
GAGCATGTGG TGGACCGGGT GCTTGAGAAG ATGGGAAAGC CCCAAAGGGA GGTGTATGAC
AGATTCGTCA AAAAGTTTTA TGAGATAAAC AGAAAAATAG GCAAGGAACA GTACCTTGTT
CCCTATTTGA TTTCAAGTCA TCCGGGAAGC GACCTTAATG CGGCGATAGA GCTTGCCGAG
TACCTGAGGG ATATAAATTA CACGCCTCAG CAGGTACAGG ATTTTTATCC CACGCCCGGG
ACATTGTCCA CCTGCATGTT TTATACCGGG CTGGACCCAA GGACGATGAA AAAGGTGTAT
GTTCCAAGGT CGCCGAAGGA AAAGGCAATG CAAAGGGCTC TCTTGCAATT TAGAAGGAAG
GAAAACTACA AGCTGGTGTA TGAGGCTTTA AAACTTGCCC ACAGAGAGGA TTTAATCGGT
TACGGCAGGA AATGCCTCAT AAAGCCTCCG GCTAATCTTT CAAAAAACAA TTTGAAAAAA
GACAGCTCAA AAAGAAAATT AAAAAAAGCC GGAAAAAGCA GAAGAAAGAG CTCAAGATAA
 
Protein sequence
MAFLPITRED MKNRGWDELD FLYISGDAYV DHPSFGHAII TRLLESEGYR VGIVAQPDWR 
KDDDFLALGK PRLAVLISSG VIDSMVNHYT ASKKPRSDDL YSPGGKSHRR PDRAVIVYTN
KARQLFRDVP VIIGGIEASL RRFAHYDYWD DRVRRSILVD SKADLLIYGM GEKPILEIAR
YLSMGVPIKK IQNVRGTAFL ARKEDLHGEL RKFIDNSEDK PEKGYILLPS FEEVSTSKRK
YAEAFMIQYN EQDPYTGSVL VQPHGDRFVA QNPPAYPLSE KEMDRIYSLP YERTYHPVYD
KDGGVPAIEE VQFSITSHRG CYGGCSFCAL NFHQGRIIQK RSQASIINEA RKLTWLPGFK
GYIHDVGGPT ANFRNKACKK QEISGACKER QCLYPKPCKN LIVDHSEYLE LLRKLREIPE
IKKVFIRSGI RYDYLMLDKN DDFFVELCRH HVSGQLKVAP EHVVDRVLEK MGKPQREVYD
RFVKKFYEIN RKIGKEQYLV PYLISSHPGS DLNAAIELAE YLRDINYTPQ QVQDFYPTPG
TLSTCMFYTG LDPRTMKKVY VPRSPKEKAM QRALLQFRRK ENYKLVYEAL KLAHREDLIG
YGRKCLIKPP ANLSKNNLKK DSSKRKLKKA GKSRRKSSR