Gene Cthe_0866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0866 
Symbol 
ID4810484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1041517 
End bp1042557 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content44% 
IMG OID640106282 
Productpyruvate flavodoxin/ferredoxin oxidoreductase-like protein 
Protein accessionYP_001037293 
Protein GI125973383 
COG category[C] Energy production and conversion 
COG ID[COG0674] Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000123554 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGGAA ACGAAGCTAT TGCAGAAGCT GCTTTAAGAG CAGGATGCAG GCACTATTTC 
GGATACCCGA TTACACCACA AACAGAAATT GCACATTATT TGGCAAAGAA AATGCCGGAG
GTTGGCGGAA CTTTTATCCA GGCAGAGAGT GAGGTTGCCG CCATAAACAT GGTTTACGGT
GCTGCCAGTG CGGGAGCAAG GGTCTTAACT TCGTCATCCA GTCCGGGTAT AAGTCTGAAA
CAGGAAGGAC TGTCTTATCT TGCCGGTGCG GAGCTTCCGG CTGTTGTTGT CAATATCGTA
AGATGTGGAC CGGGTCTTGG AGGAATACTG CCTGCACAGG GAGATTATTT CCAGGCTGTG
AAAGGTGGAG GTCACGGAGA TTACAAGATG GTTGTACTGG CACCTTCCAG CGTTCAGGAA
CTTTATGAGC TTACTGTGGA GGCTTTTAAT ATTGCCGACA GATACAGAAT TGTATCAATG
ATTATGGGTG ACGGAATTTT AGGACAGATG ATGGAAGCCG TTGAGTTTAA AGATGTTGAG
AATATAGAAA AAATTGACAA GCCCTGGGCT ACAACAGGTA CACAGATGAA GAGAGAGCAT
AATACCATAA CCTCCATCTA TATTCAACCC GAAGTTCTGG AGAAGCACAA TCAAAAGCTG
CAGGCAAAAT ACAGATTGAT TGAGGAAAGG GAAACCCGTG TTGAAAGTTA CAATTGTGAA
AATGCGGATA TTATAGTGAC CGCTTTTGGT ACGGTTGCAA GAATAGTGAA AAATGTTATC
AAGATGGCCG AGAAGGAAGG AATAAAAGTT GGTTTGATCA GACCTATAAC TTTGTGGCCT
TTCCCGACAA AAGAGTATGA AAAATATGCG GATGTGCCGA AAGCATTTTT GACTGTGGAA
CTTAATGCCG GCCAAATGGT TGAGGATGTA AGGCTCGCGG TCAACGGCAA AAAACCTGTG
TATTTCCATG GAAGAATGGG CGGAATGATA CCGACACAAA AGGAAATATT GGACAAGATA
AAGGAAATTT TGAACAATTA A
 
Protein sequence
MKGNEAIAEA ALRAGCRHYF GYPITPQTEI AHYLAKKMPE VGGTFIQAES EVAAINMVYG 
AASAGARVLT SSSSPGISLK QEGLSYLAGA ELPAVVVNIV RCGPGLGGIL PAQGDYFQAV
KGGGHGDYKM VVLAPSSVQE LYELTVEAFN IADRYRIVSM IMGDGILGQM MEAVEFKDVE
NIEKIDKPWA TTGTQMKREH NTITSIYIQP EVLEKHNQKL QAKYRLIEER ETRVESYNCE
NADIIVTAFG TVARIVKNVI KMAEKEGIKV GLIRPITLWP FPTKEYEKYA DVPKAFLTVE
LNAGQMVEDV RLAVNGKKPV YFHGRMGGMI PTQKEILDKI KEILNN