Gene Cthe_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1964 
Symbol 
ID4810747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2343495 
End bp2345024 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content45% 
IMG OID640107380 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_001038375 
Protein GI125974465 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3634] Alkyl hydroperoxide reductase, large subunit 
TIGRFAM ID[TIGR01292] thioredoxin-disulfide reductase
[TIGR03140] alkyl hydroperoxide reductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTTGG ACGCGGAAAT CATGCAGCAA CTGGAACAGT ATTTGACGTT GCTGGAAAAT 
GATATTGTTA TCAAGGTTAA CGCCGGAAAC GATAAAGTAT CCGTTGATAT GGTCGGGCTA
ATAGATGAAA TTGCAAAACT TACGCCAAAG ATTCACGTTG AAAAAGCAGA ACTCGACAGA
ACACCAAGCT TTAGCGTTAA CCGTCCGAAT GAGGATACGG GTATTGTCTT TGCCGGTATT
CCGTTGGGTC ATGAATTCAA CTCCTTAGTT TTGGCTCTCC TCCAGGTAGG AGGAAGAGCC
CCTAAGGTGG ATGAGGCTCT CATCAATCAG ATTAAGGGAA TTAAAGGAGA ATATCATTTT
GAAACTTACG TCAGCCTAAG CTGCCATAAT TGCCCTGATG TTGTCCAGGC ACTTAACATA
ATGAGCGTAC TCAATCCTAA TATCACCCAT ACCATGATCG ACGGAGCAGT CTTCAGAGAA
GAGGTTGAAA GCAAGGGCAT AATGGCGGTG CCTACGATAT ACTTAAATGG AAATTTCTTC
GAAAGCGGCC GATTGACGCT GGAAGAAATT CTTGCAAAAC TGGGGCAGGC GACGGAAAGT
CCATCCATTA ATGAAAAAGA GCCTTTTGAC GTTCTTGTCA TAGGCGGAGG ACCTGCCGGC
GTAAGTTCTG CCATCTATGC TGCCCGGAAA GGACTTCGTA CCGGTATTAT AGCTGAAAGA
TTTGGAGGGC AGATTTTAGA TACCTTGGGA ATTGAGAATT TTATCTCAGT TCCATACACC
GAAGGTCCCA AGCTTGCCGA AAATTTCAAG GAGCATGTTA AAAGGTATGA CATAGATGTA
ATGGAGCGTC AAAGGGCAAA AAGTATAAGA CGCAATGAAC TTTTGGAAGT AGAGCTGGAA
AAGGGAGCTG TTGTTAAAAG TAAGACGGTT ATTATTGCAA CGGGGGCGAG ATGGCGCAAT
GTGAACGTTC CCGGAGAGAA AGAGTTTAAA AACAAAGGCG TGGCTTATTG CCCCCACTGC
GACGGACCGC TGTTTGCAGG GAAGGATGTG GCCGTAATTG GCGGAGGCAA TTCCGGTATA
GAAGCTGCCA TTGATTTGGC GGGCATTGTA AGACACGTAA CGGTACTGGA ATTTCTGCCG
CAGCTGAAAG CTGACAAAGT GCTTCAGGAA CGTTTGTACA GGCTTCCCAA TGTAACTGTG
CTTACAAATG TCCAGACAAA AGAGTTTACC GGGAAGGAAA AGCTCGACGG GATTACTTAC
ATTGAGCGTG ACACTAATCA AGAAAAGCAC ATTGAGGTTC AGGGAGTGTT TGTTCAGATT
GGCCTTGTCC CAAATACCGA ATGGCTGGAA GGAACCATTG AACGCAATGC CATGGGTGAA
ATTATTGTCA ATGAAAAAAA TGAAACCTCA ATGCCCGGGG TATTTGCAGC AGGTGACTGT
ACCAACAGCC CCTATAAACA GATTGTTATA GCCATGGGTT CCGGTGCGAC GGCCGCACTC
AGCGCTTTCG ATTACCTAAT CAGGAATTAA
 
Protein sequence
MFLDAEIMQQ LEQYLTLLEN DIVIKVNAGN DKVSVDMVGL IDEIAKLTPK IHVEKAELDR 
TPSFSVNRPN EDTGIVFAGI PLGHEFNSLV LALLQVGGRA PKVDEALINQ IKGIKGEYHF
ETYVSLSCHN CPDVVQALNI MSVLNPNITH TMIDGAVFRE EVESKGIMAV PTIYLNGNFF
ESGRLTLEEI LAKLGQATES PSINEKEPFD VLVIGGGPAG VSSAIYAARK GLRTGIIAER
FGGQILDTLG IENFISVPYT EGPKLAENFK EHVKRYDIDV MERQRAKSIR RNELLEVELE
KGAVVKSKTV IIATGARWRN VNVPGEKEFK NKGVAYCPHC DGPLFAGKDV AVIGGGNSGI
EAAIDLAGIV RHVTVLEFLP QLKADKVLQE RLYRLPNVTV LTNVQTKEFT GKEKLDGITY
IERDTNQEKH IEVQGVFVQI GLVPNTEWLE GTIERNAMGE IIVNEKNETS MPGVFAAGDC
TNSPYKQIVI AMGSGATAAL SAFDYLIRN