Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0341 |
Symbol | |
ID | 4808490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 428410 |
End bp | 430203 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105755 |
Product | NADH dehydrogenase (quinone) |
Protein accession | YP_001036772 |
Protein GI | 125972862 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [COG3411] Ferredoxin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.024737 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTAT ATAGAGCACA TGTTCTGGTT TGTGGAGGTA CAGGCTGTAC TTCATCAAAC TCCAATAAGA TAATCACTGA ATTGGAAGAA CAAATTGCCA GAAACGGTAT ACAGAATGAA GTAAAAGTTG TAAGAACCGG ATGCTTTGGA CTTTGCGCCG AAGGCCCGAT TATGGTGGTT TATCCTGAAG GTGCAATGTA TACAATGGTC AAAGTTGAAG ATGTGAAAGA GATAGTTGAG GAGCATCTGG TAAAAGGAAG AATTGTAAAA AGACTTCTTC CAGGCGGCGG TGGCAAGGAT GAAAAAGCTT CCCTTGAAGG CAATGACTTC TTTGCAAAGC AGCAAAGAAT TGCGTTAAGA AACTGCGGTG TTATAAATCC TGAAAATATT GACGAATATA TTGCTTTTGA CGGATATAAG GCATTGGCAA AAGTATTGAC TGAAATGACT CCCGAACAAG TTGTTGATGT TATTAAAAGA TCCGGTTTGA GAGGAAGAGG GGGCGGAGGT TTCCCTACCG GACTTAAATG GGAGTTTGCA ATGAAACAGG ATGCAGACCA GAAATATGTT TGCTGTAATG CCGACGAGGG AGACCCGGGA GCATTCATGG ACAGAAGTGT CCTTGAAGGA GACCCTCATT CTGTAATTGA AGCTATGGCG ATTGCAGGTT ATGCCATAGG CGCAAATCAG GGATATGTTT ATGTAAGAGC GGAATACCCG ATTGCCGTAA AGAGGCTTGG CATTGCCATT CAGCAGGCAA GAGAATATGG ACTTTTGGGT AAAAACATAT TTGGAACGGA CTTTAGCTTT GATGTTGATA TAAGGCTTGG AGCAGGTGCT TTTGTGTGCG GTGAGGAAAC AGCTCTCATG ACGTCCATAG AGGGACACAG GGGAGAGCCA AGACCAAGGC CTCCGTTCCC TGCGGTAAAA GGTTTGTGGC AAAAGCCGAC TTTGCTGAAT AACGTTGAGA CTTATGCAAA CATTCCCCAG ATTATTCTGA AAGGTCCCGA ATGGTTTGCA AGCATTGGTA CTGAAAAGAG TAAAGGTACC AAAGTATTTG CGGTGGGCGG TAAAATTAAC AATACAGGTT TGGTAGAAAT TCCTATGGGT ACAACTTTGA GAGAGGTTAT TTATGACATA GGAGGAGGAA TACCGAACGG CAAGAAGTTC AAAGCAGCCC AGACCGGAGG TCCTTCCGGA GGATGTATTC CTGCAAGCCA TATTGATACG CCAATTGACT ATGATTCATT AACCCAGTTG GGCTCAATGA TGGGTTCCGG CGGACTTATA ATCATGGATG AAGACACATG TATGGTTGAC ATTGCAAAGT TCTTCCTTGA ATTTACTGTT GACGAATCCT GCGGTAAGTG TCCGCCGTGC CGTATAGGAA CAAAGAGAAT GTATGAGATT CTTGAAAGGA TTACTGAAGG CAAGGGAGAA GAAGGGGACA TTGAAAAGCT GGAGATGCTT GCAAAGAATA TAAAGGCATC GGCTTTGTGC GGACTGGGTC AGACAGCTCC GAATCCTATT CTCAGTACTT TAAGATACTT CAGGCATGAA TATATTGAGC ACGTAAGGGA TAAGAAGTGT GCCGCAGGGG TTTGCAAGGC GTTGATGCAT TATGAAATAG ATGCTGAGAA ATGTAAGAGC TGCGGAATAT GCGCAAGACA ATGTCCTGTA AAAGCAATAA GCGGAGAAAA GAAGGTTCCG TACGTTATAG ATCAGAACAA ATGTATCAAA TGTGGCGTTT GTATGGAGAA ATGTCCGTTC AAGGCCATTT CCAAAAAAGC CTAA
|
Protein sequence | MQLYRAHVLV CGGTGCTSSN SNKIITELEE QIARNGIQNE VKVVRTGCFG LCAEGPIMVV YPEGAMYTMV KVEDVKEIVE EHLVKGRIVK RLLPGGGGKD EKASLEGNDF FAKQQRIALR NCGVINPENI DEYIAFDGYK ALAKVLTEMT PEQVVDVIKR SGLRGRGGGG FPTGLKWEFA MKQDADQKYV CCNADEGDPG AFMDRSVLEG DPHSVIEAMA IAGYAIGANQ GYVYVRAEYP IAVKRLGIAI QQAREYGLLG KNIFGTDFSF DVDIRLGAGA FVCGEETALM TSIEGHRGEP RPRPPFPAVK GLWQKPTLLN NVETYANIPQ IILKGPEWFA SIGTEKSKGT KVFAVGGKIN NTGLVEIPMG TTLREVIYDI GGGIPNGKKF KAAQTGGPSG GCIPASHIDT PIDYDSLTQL GSMMGSGGLI IMDEDTCMVD IAKFFLEFTV DESCGKCPPC RIGTKRMYEI LERITEGKGE EGDIEKLEML AKNIKASALC GLGQTAPNPI LSTLRYFRHE YIEHVRDKKC AAGVCKALMH YEIDAEKCKS CGICARQCPV KAISGEKKVP YVIDQNKCIK CGVCMEKCPF KAISKKA
|
| |