Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2541 |
Symbol | |
ID | 4809297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3009705 |
End bp | 3010577 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107957 |
Product | nitrite and sulphite reductase 4Fe-4S region |
Protein accession | YP_001038936 |
Protein GI | 125975026 |
COG category | [C] Energy production and conversion |
COG ID | [COG2221] Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000365571 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAAATG TTGATTATAA AGAACTGAAA AAAGGCGGAT TCATGAAGCA GGTTGACAAG GACCGCTTTT CACTGCGCCT CAGAATCGTC GGAGGTCAGA TAAAAGCAGA GCAGCTTGCA AAGGTGAATG AGATAGCGCA AAAATACGGT GCAGGCTATA TCCATTTGAC TTCAAGACAG AGTATCGAAA TTCCGTATAT TAAGCTTCAG GATATTGATG CTGTCAAGGA GGAGCTTGCC AAGGCAGGGC TTCAGCCCGG TGCATGCGGA CCCAGGGTTA GAACGATTAC CGCATGTCAG GGGGCTGTTA TTTGTCCCAG CGGGCTTATC AACACAACGG AGCTTGCCAG GGAATTTGAT GAAAGGTATT ATGCCCGTGA ATTGCCCCAT AAGTTTAAGT TGGGTATTAC CGGCTGCAGA AATAATTGCT TGAAGGCGGA GGAAAACGAC CTTGGGGTTA AGGGCGGAAT GATGCCAAGT TGGGTTAAGG ATAAATGTAT TTATTGCGGA TTGTGTCAGG CAGTTTGCCC GGCAAAGGTC ATTGAGGTGA AAAAGCAGGA AAAAGAGCTG ACATTCAATG AAAAGGATTG CATCTATTGC GGCAAATGCG TCAAGGTGTG CCCTACAAGT GCATGGGAAG GCAGAGGCGG GTTTATCGTG TATTTTGGCG GATTGTTCGG CAACAGAATA GCAGTCGGAA AGCAGCTTTT GCCTATTATT TTCTCAAAAG AGGATTTGCA TAAGGTTATT GAAGCAACTT TGGCATTTTT TGAGGAGCAT GGAAAGCCCG GTGAAAGATT TGGCAATACC TTGGACAGAG TAGGCTGGGA TTTGCTTAAA AACAGGCTTG AAGAAGTATT GAAAGCGGGA TGA
|
Protein sequence | MANVDYKELK KGGFMKQVDK DRFSLRLRIV GGQIKAEQLA KVNEIAQKYG AGYIHLTSRQ SIEIPYIKLQ DIDAVKEELA KAGLQPGACG PRVRTITACQ GAVICPSGLI NTTELAREFD ERYYARELPH KFKLGITGCR NNCLKAEEND LGVKGGMMPS WVKDKCIYCG LCQAVCPAKV IEVKKQEKEL TFNEKDCIYC GKCVKVCPTS AWEGRGGFIV YFGGLFGNRI AVGKQLLPII FSKEDLHKVI EATLAFFEEH GKPGERFGNT LDRVGWDLLK NRLEEVLKAG
|
| |