Gene Cthe_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1946 
Symbol 
ID4810729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2321366 
End bp2322649 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content45% 
IMG OID640107362 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_001038357 
Protein GI125974447 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase
[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID[TIGR01292] thioredoxin-disulfide reductase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000621838 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGACG CAATAGTTAA ATCAACGATG GAGAACTTTG ACAAAGATGT GTTAAAAAGT 
GACATTCCTG TGGCTGTTTT GTTTTATACT GAAAGCTGTC CTGTTTGTGA CGCTTTTATG
CCCATATTTG AGAGGACGGC TCAAAAATAC GGCAAATACA TGAAATTTGT TAAGATTTAC
CGGCAGCAGA ACCGTCAACT GGTGGAAGAC CTTAAGATAA AGAGCAGTCC GACGGTACTC
TTTTATAAAG AAGGCAATGA GGTTTGCACC CGGTTGAACG GCTACATAAG CAACGCAGAG
TTTGTGGAAG CCATAGAGAG GGTTATTGGA GATGTTTGTA AAGGAGAGAA GAGGGAAAAG
GTACATTGTG ATTTTCTGAT ATTGGGCGGT GGCCCGGCCG GCTTGACTGC TGCGATTTAC
GCGGCCAGGG CAAAGCTTCA TACAGTGGTT GTGGATGAAG GACTCATTGG AGGGCAGGTG
GCTACAACCT TCCAGGTCGC AAACTACCCC GGTACAAATG GTGTTGTAAG GGGTATTGAC
CTGATGGAAA ACATGAAAAA GCAGGCGCTG GACTTCGGTG CATATATTGA CGACCTCAAA
GAGATTTCCG ATGTAAATCT GGAGGGAAAG GAAAAACTTG TAACCGCAAA GGATACCGAC
TATTATGCAA AAGCCGTGCT GATAGCAACC GGAGCAACTC CAAGAAGGCT TCCGGCCGAA
GGTGAAAAAG AGTTTAGAGG AAGAGGTGTG CATTATTGTG CCACATGCGA CGGTGCCATG
TACTTTGATG CCAACATCCT TGTGGTGGGA GGAGGAGAGT CCGCGGCGGA AGAAGCTGTT
TTTTTGACTA GATATGCAAA GCATGTTACA ATAATAAACA GGCATGATTA TTTGAAAGCT
TCAAAAACTG CCCAGGATGA GGTGTTCAGG AACCCGAACA TCAGTGTTGT ATGGAATTCT
GAAGTACGAA AGATTAACGG TGACAGTTTC GTAAAAAGTG TTACAATAGA AAACCTTAAA
ACAGGGAAAA TTGAGGAAAT AGAGACTGAC GGGCTGTTTG TCTATATTGG CACGCAGCCA
AAAACGGAGC TTTTTGCCGG CAAGGTCGGT ATGAATGAAG AGGGATATAT TCTGACGAAC
GAGGATATGG CGACGAACAT TCCGGGAGTT TTTGCCGCCG GAGACGTCCG GGCCAAAAAA
GTCCGGCAGA TTGCCACTGC TGTCGGAGAC GGCGCAGTAG CAGGAATAAT GGCAGAAAGA
TATATTAACG GAAAATTCTA TTAA
 
Protein sequence
MNDAIVKSTM ENFDKDVLKS DIPVAVLFYT ESCPVCDAFM PIFERTAQKY GKYMKFVKIY 
RQQNRQLVED LKIKSSPTVL FYKEGNEVCT RLNGYISNAE FVEAIERVIG DVCKGEKREK
VHCDFLILGG GPAGLTAAIY AARAKLHTVV VDEGLIGGQV ATTFQVANYP GTNGVVRGID
LMENMKKQAL DFGAYIDDLK EISDVNLEGK EKLVTAKDTD YYAKAVLIAT GATPRRLPAE
GEKEFRGRGV HYCATCDGAM YFDANILVVG GGESAAEEAV FLTRYAKHVT IINRHDYLKA
SKTAQDEVFR NPNISVVWNS EVRKINGDSF VKSVTIENLK TGKIEEIETD GLFVYIGTQP
KTELFAGKVG MNEEGYILTN EDMATNIPGV FAAGDVRAKK VRQIATAVGD GAVAGIMAER
YINGKFY