Gene Cthe_0429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0429 
Symbol 
ID4808357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp538363 
End bp540237 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content46% 
IMG OID640105843 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_001036860 
Protein GI125972950 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit
[COG2221] Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits
[COG3411] Ferredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAAAA ACAGAGAAGA GTTGCGAAAG GCCCGGGAGA TGTACTCAAG ATACCTTAAG 
GCGGAAAAGA GAAGAGTTCT TGTTTGTGCA GGCACCGGCT GCGTATCCGG CGGTTCCATG
GAGATTTTCG AGCGGCTTTC GGAGTTGGTT TCAAAGAGAG GGATGGATTG CCAGGTTGAA
TTGAAAGAAG AACCACACGA CAATACCATA GGCATGAAAA AAAGCGGGTG CCATGGTTTT
TGTGAAATGG GACCCCTTGT AAGAATTGAG CCTGAAGGTT ATCTGTACAC CAAAGTAAAG
CTTGAAGACT GTGAAGAGAT TGTGGACAGA ACGATAGTGG CGGGGGAACA TATAGAAAGG
CTTGCCTATA AACAAAACGG AGTTGTATAC AAAAAACAGG ATGAAATTCC CTTTTACAAG
AAGCAAACCC GTCTTGTACT GGAACACTGT GGTCAAATTG ACTCCACATC CATAACCGAA
TACCTTGCAA CCGGGGGATA TTATGCGCTG GAGAAAGCGT TGTTTGACAT GACCGGTGAT
GAAATTATAA ATGAAATAAC TGAAGCAAAT CTTCGTGGAC GCGGAGGAGG AGGTTTTCCG
GCAGGACGTA AATGGGCACA GGTAAAAAGG CAGAATGCAA AACAAAAGTA TGTTGTGTGC
AACGGAGATG AAGGCGATCC GGGAGCGTTT ATGGACAGGA GCATTATGGA GGGTGACCCC
CACAGAATGA TTGAGGGGAT GATAATTGCA GGCATTGCCT GCGGGGCGTC CGAGGGCTAT
ATTTATGTGC GTGCAGAATA TCCACTTGCT GTGTCCAGAC TTAAAAGGGC CATCGAGCAG
GCAAAAGAGT TTGGCTTGCT TGGTGAAAAC ATACTGGGAA GCAATTTTAG CTTCAATATT
CATATAAATC GGGGCGCAGG TGCGTTTGTT TGCGGTGAGG GAAGCGCGCT TACGGCATCG
ATTGAAGGAA AACGTGGCAT GCCCAGGGTA AAGCCGCCAA GAACCGTGGA GCAGGGCTTG
TTTGATATGC CTACGGTTTT AAACAATGTT GAAACCTTTG CCAACGTTCC CCTTATTATC
AAAAACGGAG CTGACTGGTA TAAATCCATC GGGACTGAAA AAAGTCCGGG AACAAAAGCT
TTTGCGCTGA CGGGCAACAT AGAAAATACC GGTCTTATTG AGATTCCCAT GGGAACAACC
TTGAGGGAAG TTATATTTGA CATCGGCGGA GGAATGAGGA ATGGCGCGGA TTTTAAAGCC
GTGCAAATCG GAGGCCCGTC GGGAGGGTGC CTGTCGGAAA AAGATTTGGA CCTTCCACTG
GATTTTGACT CACTGAAAAA AGCTGGTGCC ATGATTGGTT CCGGAGGATT GGTTGTAATG
GACAGCAATA CTTGTATGGT AGAAGTTGCG CGCTTTTTTA TGAATTTTAC CCAGAATGAG
TCCTGCGGAA AATGTGTTCC CTGCCGCGAA GGTACAAAGA GAATGCTGGA GATTTTGGAA
AGGATTGTAG AAGGAAACGG CCAGGACGGT GACATAGAAC TTCTTTTGGA ATTGGCGGAC
ACCATTTCAG CAACGGCACT TTGCGGACTT GGCAAAGCTG CGGCATTTCC TGTTGTAAGT
ACGATTAAGA ATTTTAGAGA AGAATATGAA GCACATATTT ATGACAAGAG ATGTCCCACA
GGAAATTGTC AGAAGCTTAA AACCATTACT ATTGATGCTT CTTTGTGCAA AGGCTGCTCA
AAGTGTGCAA GAAGTTGTCC TGTGGGCGCA ATAACAGGAA AAGTTAAAGA GCCTTTTGTT
ATAGATCAAA GCAAATGTAT CAAGTGCGGT GCATGTATTG AAACTTGTGC ATTCCACGCA
ATATTGGAGG GCTGA
 
Protein sequence
MLKNREELRK AREMYSRYLK AEKRRVLVCA GTGCVSGGSM EIFERLSELV SKRGMDCQVE 
LKEEPHDNTI GMKKSGCHGF CEMGPLVRIE PEGYLYTKVK LEDCEEIVDR TIVAGEHIER
LAYKQNGVVY KKQDEIPFYK KQTRLVLEHC GQIDSTSITE YLATGGYYAL EKALFDMTGD
EIINEITEAN LRGRGGGGFP AGRKWAQVKR QNAKQKYVVC NGDEGDPGAF MDRSIMEGDP
HRMIEGMIIA GIACGASEGY IYVRAEYPLA VSRLKRAIEQ AKEFGLLGEN ILGSNFSFNI
HINRGAGAFV CGEGSALTAS IEGKRGMPRV KPPRTVEQGL FDMPTVLNNV ETFANVPLII
KNGADWYKSI GTEKSPGTKA FALTGNIENT GLIEIPMGTT LREVIFDIGG GMRNGADFKA
VQIGGPSGGC LSEKDLDLPL DFDSLKKAGA MIGSGGLVVM DSNTCMVEVA RFFMNFTQNE
SCGKCVPCRE GTKRMLEILE RIVEGNGQDG DIELLLELAD TISATALCGL GKAAAFPVVS
TIKNFREEYE AHIYDKRCPT GNCQKLKTIT IDASLCKGCS KCARSCPVGA ITGKVKEPFV
IDQSKCIKCG ACIETCAFHA ILEG