Gene Cthe_2392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2392 
Symbol 
ID4811044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2857007 
End bp2858191 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content43% 
IMG OID640107805 
Productpyruvate ferredoxin oxidoreductase, alpha subunit 
Protein accessionYP_001038787 
Protein GI125974877 
COG category[C] Energy production and conversion 
COG ID[COG0674] Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00104285 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAATAA GAGAAAGGCT TAGCGGTAAT GAGGCAACGG CGATTGCCAT GAGACAAATA 
AATCCTGATG TGGTTGCTGC TTTTCCGATA ACACCGTCAA CGGAAATTCC TCAATATTTC
TCGTCATATG TCGCTGACGG ACTTGTAGAT ACGGAATTTG TTGCTGTGGA ATCAGAGCAC
AGTGCAATGT CTGCATGTAT AGGTGCTCAG GCTGCAGGTG CAAGAGCAAT GACTGCCACA
TCCGCAAACG GTTTGGCATA TATGTGGGAG GCTTTGTATA TAGCGGCAAG TATGAGACTT
CCGATTGTAT TGGCGGCTGT AAACAGAGCA CTTTCAGGTC CTATCAATAT CCACAACGAC
CACAGCGATA CAATGGGAGC TAGGGATTCG GGATGGATCC AGTTATACAG TGAAAACAAC
CAGGAGGCTT ATGACAACAT GCTTATGGCT CACAGGATAG GTGAGCATCC TGATGTAATG
CTTCCTGTCA TGGTCTGCCA GGACGGATTT ATTACTTCTC ACGCAATAGA AAATATTGAA
CTGGTGGAAG ATGAGAAAGT TAAGGCTTTT GTAGGAGAAT ACAAACCGAC TCATTATCTT
CTCGACAGGG AAAATCCGAT TTCTGTGGGT CCTTTGGATT TGCAGATGCA TTATTTCGAG
CACAAGAGAC AGCAGGCACA GGCAATGGAA AACGCCAAAA AGGTAATTCT TGAAGTGGCG
GAAGAATTCT ACAAGCTTAC GGGAAGAAAA TACGGATTTT TTGAAGAATA CAAAACCGAT
GATGCCGATG TTGCCATTGT TGTTATGAAC TCCACTGCCG GTACTGTAAA ATATGTTATC
GACGAGTACA GGGCAAAAGG CAAAAAAGTT GGTTTGATAA AACCTAGAGT ATTCAGACCT
TTCCCTGTTG ATGAACTGGC ACAGGCTTTG TCAAAGTTTA AGGCAGTGGC CGTTATGGAC
AAGGCTGACA GCTTCAATGC AGCCGGAGGA CCTTTGTTTA CAGAGGTAAC AAGTGCACTC
TTCACAAAAG GAGTATTTGG TCCTAAGGTT ATTAACTATA AGTTTGGATT GGGTGGAAGA
GACGTTAAAG TTGATGATAT TGAAGTTGTT TGTGAGAAGC TTCTGGAAAT TGCAAGTACA
GGCAAGGTAG ACTCAGTATA CAATTACCTT GGTGTTAGAG AGTAG
 
Protein sequence
MGIRERLSGN EATAIAMRQI NPDVVAAFPI TPSTEIPQYF SSYVADGLVD TEFVAVESEH 
SAMSACIGAQ AAGARAMTAT SANGLAYMWE ALYIAASMRL PIVLAAVNRA LSGPINIHND
HSDTMGARDS GWIQLYSENN QEAYDNMLMA HRIGEHPDVM LPVMVCQDGF ITSHAIENIE
LVEDEKVKAF VGEYKPTHYL LDRENPISVG PLDLQMHYFE HKRQQAQAME NAKKVILEVA
EEFYKLTGRK YGFFEEYKTD DADVAIVVMN STAGTVKYVI DEYRAKGKKV GLIKPRVFRP
FPVDELAQAL SKFKAVAVMD KADSFNAAGG PLFTEVTSAL FTKGVFGPKV INYKFGLGGR
DVKVDDIEVV CEKLLEIAST GKVDSVYNYL GVRE