Gene Cthe_1492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1492 
Symbol 
ID4810642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1812018 
End bp1813640 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content41% 
IMG OID640106912 
ProductNAD(P)H dehydrogenase (quinone) 
Protein accessionYP_001037913 
Protein GI125974003 
COG category[R] General function prediction only 
COG ID[COG0655] Multimeric flavodoxin WrbA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAC TTGTTATAAA CGGAAGTCCG AAAGGTGATG CCAGCAACTC TTTGAAACTT 
ACCAAAGCCT TTCTCGAAGG TATGGGAGAC AATGATGTAA GGGAAGTTAC GGTGTCCAGG
CTGAATTTAT CGCCCTGCAA GGGCTGTTTT TGCTGCTGGA GCAAAACGCC CGGAAAATGC
GTGATAAATG ATGATATGAG CCGGGTTATA GAGGATGAAT TGTGGGCTGA CATTATCATT
TGGAGTTTTC CTTTGTACTA TTTTAATGTT CCGGGCCCGC TTAAAAACTT GATTGACAGA
CAGCTTCCAA TGAACCTTCC CTTTATGACG GAGCGAGAAG ACGGTATGGG AAGCGGAAGT
CACCCTTCAA GATATGATAT GAGCGGCAAA AGATACATTT TAATTTCCAC CTGTGGGTTT
TATTCCGCCG AAAAAAATTA TGAAAGTGTA AAAAGTATGT TTGACTATAT TTGTGGCAAA
GGCAATTATG AGACAATTTT CTGCGGTCAG GGAGAACTGT TTCGTGCTCC GGAATTGAAA
AAAAGAACGG ACGAATACCT CGATATAGTA AGAAAAGCCG GCAGAGAATA TATATCAACA
GGTATTTCAA ATGAGACAAG AAGCAAGCTG AATGAACTTT TGTATCCTAA GGAAGTATTT
GAACAAATGG CTGATGCCAG TTGGGGAATT GACAAAGAAA CCGGCAATGA AGTTGACAAG
AGTCTTTCTT TCACCAGGCA AATGGCGGCT TTGTACAATA AGGGAAGTTA TGACGGAAAA
GACCGTGTGC TGGAGATATG TTACACGGAT TTGGGCAAAA CCTATCAAAT TTTATTGGGG
AAAGACGGCA GCAAAGTTTT TACCGCCGGC AGTTTGCCGG CAACAACAAG AATTGAGACA
CCGTGGGAAG TATGGACATC CATTGCCAGA GGTGAGATAA GAGGAGATGT GGCACTTTTT
AAAGGTATGT ATAAGGTTAC CGGTGATTTT TCTTTGATGA TGAATTGGGA TAAATATTTT
AGCAAAACCA AAGAACAACA GGAAAATGAG ATTGACAAAA GCCTGACATC AAAGAATAAA
AAGCCGTCAA TGATGACAAT GCTGATTCCG TGGATTACGT TTTGGATTGC CGTATCCATT
AACGCCAATA TAGGTGCAAT TATTACCCTT GCAGTTTGTG CATGCACTCC CATGGTTATG
GCACGAAAAG AGCTGACCGT TTATGACAAA ATTTCAATGG CGGTTGTATC ACTTTTATCG
GTTCTGACTT TACAGAATGA CATGAAAATA ATATCCATTG TGGCAGGATA CCTTGCATTC
GGGCTTATGT GGCTTCTGTC CTGCTTTACA CGAGAGCCTC TTTGCGCCGC GTATGTCAAG
TACGATTATA ACGGTGAAGA TGCGTTAAAC AACCCTATTT TCATGAAAAC AAATTATGTA
TTGGCCGTAG GCTGGGGAAT TTTATATATT TTAACAGCGA TTTGGTCGTG GTTTTTGATG
CGTTTGAACA TGATCGTGCT GTTGCAAATT CTGAATAATG CTGCGACCTG GGCTATGGGT
ATTTTTACGA TATGGTTTGT AAGATGGTAT CCGCAGCATA TTGCGTTAAA AGGCAAGCGT
TAA
 
Protein sequence
MKILVINGSP KGDASNSLKL TKAFLEGMGD NDVREVTVSR LNLSPCKGCF CCWSKTPGKC 
VINDDMSRVI EDELWADIII WSFPLYYFNV PGPLKNLIDR QLPMNLPFMT EREDGMGSGS
HPSRYDMSGK RYILISTCGF YSAEKNYESV KSMFDYICGK GNYETIFCGQ GELFRAPELK
KRTDEYLDIV RKAGREYIST GISNETRSKL NELLYPKEVF EQMADASWGI DKETGNEVDK
SLSFTRQMAA LYNKGSYDGK DRVLEICYTD LGKTYQILLG KDGSKVFTAG SLPATTRIET
PWEVWTSIAR GEIRGDVALF KGMYKVTGDF SLMMNWDKYF SKTKEQQENE IDKSLTSKNK
KPSMMTMLIP WITFWIAVSI NANIGAIITL AVCACTPMVM ARKELTVYDK ISMAVVSLLS
VLTLQNDMKI ISIVAGYLAF GLMWLLSCFT REPLCAAYVK YDYNGEDALN NPIFMKTNYV
LAVGWGILYI LTAIWSWFLM RLNMIVLLQI LNNAATWAMG IFTIWFVRWY PQHIALKGKR