Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0429 |
Symbol | |
ID | 4808357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 538363 |
End bp | 540237 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640105843 |
Product | NADH dehydrogenase (quinone) |
Protein accession | YP_001036860 |
Protein GI | 125972950 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [COG2221] Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits [COG3411] Ferredoxin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTAAAA ACAGAGAAGA GTTGCGAAAG GCCCGGGAGA TGTACTCAAG ATACCTTAAG GCGGAAAAGA GAAGAGTTCT TGTTTGTGCA GGCACCGGCT GCGTATCCGG CGGTTCCATG GAGATTTTCG AGCGGCTTTC GGAGTTGGTT TCAAAGAGAG GGATGGATTG CCAGGTTGAA TTGAAAGAAG AACCACACGA CAATACCATA GGCATGAAAA AAAGCGGGTG CCATGGTTTT TGTGAAATGG GACCCCTTGT AAGAATTGAG CCTGAAGGTT ATCTGTACAC CAAAGTAAAG CTTGAAGACT GTGAAGAGAT TGTGGACAGA ACGATAGTGG CGGGGGAACA TATAGAAAGG CTTGCCTATA AACAAAACGG AGTTGTATAC AAAAAACAGG ATGAAATTCC CTTTTACAAG AAGCAAACCC GTCTTGTACT GGAACACTGT GGTCAAATTG ACTCCACATC CATAACCGAA TACCTTGCAA CCGGGGGATA TTATGCGCTG GAGAAAGCGT TGTTTGACAT GACCGGTGAT GAAATTATAA ATGAAATAAC TGAAGCAAAT CTTCGTGGAC GCGGAGGAGG AGGTTTTCCG GCAGGACGTA AATGGGCACA GGTAAAAAGG CAGAATGCAA AACAAAAGTA TGTTGTGTGC AACGGAGATG AAGGCGATCC GGGAGCGTTT ATGGACAGGA GCATTATGGA GGGTGACCCC CACAGAATGA TTGAGGGGAT GATAATTGCA GGCATTGCCT GCGGGGCGTC CGAGGGCTAT ATTTATGTGC GTGCAGAATA TCCACTTGCT GTGTCCAGAC TTAAAAGGGC CATCGAGCAG GCAAAAGAGT TTGGCTTGCT TGGTGAAAAC ATACTGGGAA GCAATTTTAG CTTCAATATT CATATAAATC GGGGCGCAGG TGCGTTTGTT TGCGGTGAGG GAAGCGCGCT TACGGCATCG ATTGAAGGAA AACGTGGCAT GCCCAGGGTA AAGCCGCCAA GAACCGTGGA GCAGGGCTTG TTTGATATGC CTACGGTTTT AAACAATGTT GAAACCTTTG CCAACGTTCC CCTTATTATC AAAAACGGAG CTGACTGGTA TAAATCCATC GGGACTGAAA AAAGTCCGGG AACAAAAGCT TTTGCGCTGA CGGGCAACAT AGAAAATACC GGTCTTATTG AGATTCCCAT GGGAACAACC TTGAGGGAAG TTATATTTGA CATCGGCGGA GGAATGAGGA ATGGCGCGGA TTTTAAAGCC GTGCAAATCG GAGGCCCGTC GGGAGGGTGC CTGTCGGAAA AAGATTTGGA CCTTCCACTG GATTTTGACT CACTGAAAAA AGCTGGTGCC ATGATTGGTT CCGGAGGATT GGTTGTAATG GACAGCAATA CTTGTATGGT AGAAGTTGCG CGCTTTTTTA TGAATTTTAC CCAGAATGAG TCCTGCGGAA AATGTGTTCC CTGCCGCGAA GGTACAAAGA GAATGCTGGA GATTTTGGAA AGGATTGTAG AAGGAAACGG CCAGGACGGT GACATAGAAC TTCTTTTGGA ATTGGCGGAC ACCATTTCAG CAACGGCACT TTGCGGACTT GGCAAAGCTG CGGCATTTCC TGTTGTAAGT ACGATTAAGA ATTTTAGAGA AGAATATGAA GCACATATTT ATGACAAGAG ATGTCCCACA GGAAATTGTC AGAAGCTTAA AACCATTACT ATTGATGCTT CTTTGTGCAA AGGCTGCTCA AAGTGTGCAA GAAGTTGTCC TGTGGGCGCA ATAACAGGAA AAGTTAAAGA GCCTTTTGTT ATAGATCAAA GCAAATGTAT CAAGTGCGGT GCATGTATTG AAACTTGTGC ATTCCACGCA ATATTGGAGG GCTGA
|
Protein sequence | MLKNREELRK AREMYSRYLK AEKRRVLVCA GTGCVSGGSM EIFERLSELV SKRGMDCQVE LKEEPHDNTI GMKKSGCHGF CEMGPLVRIE PEGYLYTKVK LEDCEEIVDR TIVAGEHIER LAYKQNGVVY KKQDEIPFYK KQTRLVLEHC GQIDSTSITE YLATGGYYAL EKALFDMTGD EIINEITEAN LRGRGGGGFP AGRKWAQVKR QNAKQKYVVC NGDEGDPGAF MDRSIMEGDP HRMIEGMIIA GIACGASEGY IYVRAEYPLA VSRLKRAIEQ AKEFGLLGEN ILGSNFSFNI HINRGAGAFV CGEGSALTAS IEGKRGMPRV KPPRTVEQGL FDMPTVLNNV ETFANVPLII KNGADWYKSI GTEKSPGTKA FALTGNIENT GLIEIPMGTT LREVIFDIGG GMRNGADFKA VQIGGPSGGC LSEKDLDLPL DFDSLKKAGA MIGSGGLVVM DSNTCMVEVA RFFMNFTQNE SCGKCVPCRE GTKRMLEILE RIVEGNGQDG DIELLLELAD TISATALCGL GKAAAFPVVS TIKNFREEYE AHIYDKRCPT GNCQKLKTIT IDASLCKGCS KCARSCPVGA ITGKVKEPFV IDQSKCIKCG ACIETCAFHA ILEG
|
| |