Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1946 |
Symbol | |
ID | 4810729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2321366 |
End bp | 2322649 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107362 |
Product | FAD-dependent pyridine nucleotide-disulphide oxidoreductase |
Protein accession | YP_001038357 |
Protein GI | 125974447 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0492] Thioredoxin reductase [COG3118] Thioredoxin domain-containing protein |
TIGRFAM ID | [TIGR01292] thioredoxin-disulfide reductase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000621838 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGACG CAATAGTTAA ATCAACGATG GAGAACTTTG ACAAAGATGT GTTAAAAAGT GACATTCCTG TGGCTGTTTT GTTTTATACT GAAAGCTGTC CTGTTTGTGA CGCTTTTATG CCCATATTTG AGAGGACGGC TCAAAAATAC GGCAAATACA TGAAATTTGT TAAGATTTAC CGGCAGCAGA ACCGTCAACT GGTGGAAGAC CTTAAGATAA AGAGCAGTCC GACGGTACTC TTTTATAAAG AAGGCAATGA GGTTTGCACC CGGTTGAACG GCTACATAAG CAACGCAGAG TTTGTGGAAG CCATAGAGAG GGTTATTGGA GATGTTTGTA AAGGAGAGAA GAGGGAAAAG GTACATTGTG ATTTTCTGAT ATTGGGCGGT GGCCCGGCCG GCTTGACTGC TGCGATTTAC GCGGCCAGGG CAAAGCTTCA TACAGTGGTT GTGGATGAAG GACTCATTGG AGGGCAGGTG GCTACAACCT TCCAGGTCGC AAACTACCCC GGTACAAATG GTGTTGTAAG GGGTATTGAC CTGATGGAAA ACATGAAAAA GCAGGCGCTG GACTTCGGTG CATATATTGA CGACCTCAAA GAGATTTCCG ATGTAAATCT GGAGGGAAAG GAAAAACTTG TAACCGCAAA GGATACCGAC TATTATGCAA AAGCCGTGCT GATAGCAACC GGAGCAACTC CAAGAAGGCT TCCGGCCGAA GGTGAAAAAG AGTTTAGAGG AAGAGGTGTG CATTATTGTG CCACATGCGA CGGTGCCATG TACTTTGATG CCAACATCCT TGTGGTGGGA GGAGGAGAGT CCGCGGCGGA AGAAGCTGTT TTTTTGACTA GATATGCAAA GCATGTTACA ATAATAAACA GGCATGATTA TTTGAAAGCT TCAAAAACTG CCCAGGATGA GGTGTTCAGG AACCCGAACA TCAGTGTTGT ATGGAATTCT GAAGTACGAA AGATTAACGG TGACAGTTTC GTAAAAAGTG TTACAATAGA AAACCTTAAA ACAGGGAAAA TTGAGGAAAT AGAGACTGAC GGGCTGTTTG TCTATATTGG CACGCAGCCA AAAACGGAGC TTTTTGCCGG CAAGGTCGGT ATGAATGAAG AGGGATATAT TCTGACGAAC GAGGATATGG CGACGAACAT TCCGGGAGTT TTTGCCGCCG GAGACGTCCG GGCCAAAAAA GTCCGGCAGA TTGCCACTGC TGTCGGAGAC GGCGCAGTAG CAGGAATAAT GGCAGAAAGA TATATTAACG GAAAATTCTA TTAA
|
Protein sequence | MNDAIVKSTM ENFDKDVLKS DIPVAVLFYT ESCPVCDAFM PIFERTAQKY GKYMKFVKIY RQQNRQLVED LKIKSSPTVL FYKEGNEVCT RLNGYISNAE FVEAIERVIG DVCKGEKREK VHCDFLILGG GPAGLTAAIY AARAKLHTVV VDEGLIGGQV ATTFQVANYP GTNGVVRGID LMENMKKQAL DFGAYIDDLK EISDVNLEGK EKLVTAKDTD YYAKAVLIAT GATPRRLPAE GEKEFRGRGV HYCATCDGAM YFDANILVVG GGESAAEEAV FLTRYAKHVT IINRHDYLKA SKTAQDEVFR NPNISVVWNS EVRKINGDSF VKSVTIENLK TGKIEEIETD GLFVYIGTQP KTELFAGKVG MNEEGYILTN EDMATNIPGV FAAGDVRAKK VRQIATAVGD GAVAGIMAER YINGKFY
|
| |