Gene Cthe_2991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2991 
Symbol 
ID4811139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3510336 
End bp3512273 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content45% 
IMG OID640108412 
ProductNADH:flavin oxidoreductase/NADH oxidase 
Protein accessionYP_001039380 
Protein GI125975470 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase
[COG1902] NADH:flavin oxidoreductases, Old Yellow Enzyme family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTATG ACAAACTTTT TGAGCCGGGA TATATCGGAA AAGTAAAGAT TAAAAACAGA 
CTGGTAATGT CTCCGATGAA TACCCATTTT TCCATAGGAG ACCCTGCAGT ACTTTCCGAA
AGGTATTTTG AATATTACAA AGCACGGGCA AGAGGAGGAG TGGGACTTAT AATTACAACC
CATGTAAAGG CGGAAAAAAA CATTGACCCG TATCCTCTTA CCTATGGCTA TGCCACTTTT
GATTCTGTAA GCCAGATAAA GTATTTCAAT GAAATAACCG AAATGGCTCA CAGATATGAT
GCAAAGATTG CAATTGAACT GTCTCCGGGT ACCGGAAGAC TGGCCGATGC AACGTTAAAG
GACAAATGGC CTGTCGGGCC TTCGGAAATT GAGATACTGG GTATGCCGGG AGTTAAAACA
CGGGCGCTTA CCAAGGATGA GATACACGGA CTTGTGGAGG CTTATGGGAA AGCGGCGGGA
TTGGCAAAGC AGGCGGGTTT TGACATAATT TATGTTCACT TTACCGCTTA TCTCGGAGAC
CAATTCCTTT CTTCGGCCTG GAATCACAGA ACGGATGAAT ATGGTGGAAG TCTTGAAAAC
CGAATGCGGT TTTTGCTCGA ATGCATTGAG AGTGCACGAA ACAATGTGGG AAGCGATTTT
CCCATGATTG TGGGATTGGC GTTGGATCAT GGATTCCCCG GAGGAAGGGA GCTTGACGAA
ACAATAGAAA TTGCAAAAAG GCTCAAACAG ATAGGCATAG ATACATTGCA TCTCAGACGT
GGAAGCTATG ACAACATGAA TCTTCTTATA CCTACCGAAT ATATGGAGGA CGCCGTTTCT
GTCGACTATG CGGCCAAAGT CAGAGAACAG GCAGGTATAC AGGTGATTTC CGATGGAAAC
ATTTCAGATC CGGTCCTTGC GAATAAACTG ATGGAAGAAA ACAAGCTTGA CTTTGTAGGC
CTTGGAAGAG CTCTTTTGGC CGACCCGGAG TGGGTGAACA AGGTACGGGC TGACAAGAAA
GAGGATATAG TACCTTGTGT GCGGTGCATG CAGTGCATTA ACAGAATATT TTTTGGGCAA
TATGCCGCAT GCAGCGTAAA TCCTGTTCTT GGAAAAGAGT ACTTAAGCCC GATACTGCCT
GCAAAGAAGC CTAAGAAAGT GCTTGTAATA GGCGGAGGGA TGGCAGGTAT GGCATTTGCA
AAGATGGCAG AAGAAAAAGG GCATGACGTT ACATTGCTTG AGAGCACTTC AGAGCTTGGA
GGACACTTGC TTGAAGGTGC GGTGATGGAT CACAAGAAGG AAGTTGACGC ATACTGCAGG
CATTTGGTAA GGGAGATTAA AAACTCCGGT GTGAAAGTAA AGTACAACAC CAGGGCTACA
AAAGAATTGG TGAAGGAGCT CAATCCGGAT GCGGTTGTGG TGGCAACAGG TTCCGTGCCT
GTAATTCCTG ATGTTCCGGG CATTGACAGA CCCAATGTGA GGATAGCCAC CAAGCTTCTT
AAAGAAGGGC AGGACACCGG GCAGAATGTG ATTATCGTCG GCGGAGGTTT GGTGGGCTGC
GAAACGGGAT TGCACCTTGC AGAAAAGGGA AAGAAAGTAA CCATAATAGA TATGCTTCCG
GAAGTGGCTC AGGATGTTAT TTTCATGGCG AGATTTTCCC TGCTTGAGGC ACTTAAGAAT
AAAGGGATAG AAACCTATGG AGGGCTTAAA CTGACAGAAA TAACAGAGTC GGGTATCGTT
GTTGAGGATT CCAATGGAGA TAAAAAGGAG ATGGCTTGCG ACACTGTGGT AATTGCTGTG
GGATTAAAGG CGGATGACAC TTTGTACAAT GAGCTTGTAA ATGAGTTTGA TGAAGTGTAT
CGAATTGGCG ACTGCATCAA GGCAAGAAAG TTTATTGATG CAATCCAGGA AGCCTTCCAG
GTGGCGGTGG ATATATAA
 
Protein sequence
MAYDKLFEPG YIGKVKIKNR LVMSPMNTHF SIGDPAVLSE RYFEYYKARA RGGVGLIITT 
HVKAEKNIDP YPLTYGYATF DSVSQIKYFN EITEMAHRYD AKIAIELSPG TGRLADATLK
DKWPVGPSEI EILGMPGVKT RALTKDEIHG LVEAYGKAAG LAKQAGFDII YVHFTAYLGD
QFLSSAWNHR TDEYGGSLEN RMRFLLECIE SARNNVGSDF PMIVGLALDH GFPGGRELDE
TIEIAKRLKQ IGIDTLHLRR GSYDNMNLLI PTEYMEDAVS VDYAAKVREQ AGIQVISDGN
ISDPVLANKL MEENKLDFVG LGRALLADPE WVNKVRADKK EDIVPCVRCM QCINRIFFGQ
YAACSVNPVL GKEYLSPILP AKKPKKVLVI GGGMAGMAFA KMAEEKGHDV TLLESTSELG
GHLLEGAVMD HKKEVDAYCR HLVREIKNSG VKVKYNTRAT KELVKELNPD AVVVATGSVP
VIPDVPGIDR PNVRIATKLL KEGQDTGQNV IIVGGGLVGC ETGLHLAEKG KKVTIIDMLP
EVAQDVIFMA RFSLLEALKN KGIETYGGLK LTEITESGIV VEDSNGDKKE MACDTVVIAV
GLKADDTLYN ELVNEFDEVY RIGDCIKARK FIDAIQEAFQ VAVDI