Gene Cthe_2566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2566 
Symbol 
ID4809173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3037597 
End bp3038922 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content41% 
IMG OID640107981 
Productradical SAM family protein 
Protein accessionYP_001038960 
Protein GI125975050 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGACAT TATTAATGAT TACTCCTGAA AACCTGGAAA TAAAAAGATT CAGAGCTTAT 
CAGCTTAACA ACTTTATGCA GCTGACCATG CCATATCTTG CAGGCTTCGT CCCGAAAGAA
TATGACATAA GGCTGTTGGA TGAGCATACA GAACCAATTA TTTTCGAAAA ATTCGACTTG
GTTGCAATTA CTGTAAATAC TCCAAATGCT CCCTATGCTT ATGCTATCTC AAAAAAATTC
CGCGAGCTGG GATCGTGGGT GGCTTTAGGC GGACCGCATG TGACCTTGAA TCCCGAAGAA
GCAGCTTTGC ATTGCGATAC TATTTTCATA GGAGAAGCTG AAGAAACCTG GCCTCAATTC
TTAAACGACT TCCTTTGCAA AAAAGCCCAA AAAGTTTACA CAAGTTCACG TGCTCCCAGT
CTTGCCGGTC TTCCGATACC CCGCAGGGAT TTGATTGTCG GCAATAGATT TATAAGCGGC
GCTGTTTTTG CTTCCCGCGG TTGCCCGCAT AATTGCAGCT ATTGCAGTCT TAAGAAAATC
TATCACAATA CGTTTAGAAC TCGTCCGGTT GACGAGGTAA TAAAGGATAT AGCCAGTATG
CCAAATAAGT ATTTCGTATT TTGGGATGAT AATTTCTTTG CCGATCCAAG CTACACAAAA
GCCCTGCTAA AAGATTTGGC AAAGCTAAAC AAAAAATGGG CTGCACAAGT AACAGCGCAC
AGCTGTGAGG ATGAAGAATT ACTCCGCCTG GCCAAATCCG CAGGGTGTCT GTATTTGTTC
CTGGGCTTGG AATCGTTTTC CGAGCAGGGT CTTAAAGATG CGAATAAGAG TTTTAATAAA
ATTGAGCAGT ATGGGAAGAT TGCAGATAAA CTTCACCGAC ATGGAATCAG CATCCAAGCC
GGTATTGTTT TTGGGTTTGA CTCCGACACG CCGGATGTAT TCGAAACAAC CTTAAAAGCA
TGTAATGACA TCGGAATCGA CGGAGTAACT GCAAGTATAC TAACTCCTTT TCCCGGAACC
GGGATATATG ACCAATATAA AAATGAAGGC CGGCTACTTG ATGTTGGTTG GAATTATTAC
AACGGCAAAA CAAGGGTTGC ATATGTACCG AAGCAAATGA GTCCCGAGGA ATTGCTAAAC
GGCTATAACG AATTTCGAAG AAAGTTTTAT TCGTGGAAAA GCATAATAAA GCGAATATGT
AAATCACGAG TAAATATCTT TTATAATCTG GCCATGAATT ATGGTTACAG GCAAGCATAT
CGTAACTTTG CCAAATTGGA TATTGGAGAT AGTAAGGGGA TGAATATTTT TGAGCAGCTT
ATATAA
 
Protein sequence
MPTLLMITPE NLEIKRFRAY QLNNFMQLTM PYLAGFVPKE YDIRLLDEHT EPIIFEKFDL 
VAITVNTPNA PYAYAISKKF RELGSWVALG GPHVTLNPEE AALHCDTIFI GEAEETWPQF
LNDFLCKKAQ KVYTSSRAPS LAGLPIPRRD LIVGNRFISG AVFASRGCPH NCSYCSLKKI
YHNTFRTRPV DEVIKDIASM PNKYFVFWDD NFFADPSYTK ALLKDLAKLN KKWAAQVTAH
SCEDEELLRL AKSAGCLYLF LGLESFSEQG LKDANKSFNK IEQYGKIADK LHRHGISIQA
GIVFGFDSDT PDVFETTLKA CNDIGIDGVT ASILTPFPGT GIYDQYKNEG RLLDVGWNYY
NGKTRVAYVP KQMSPEELLN GYNEFRRKFY SWKSIIKRIC KSRVNIFYNL AMNYGYRQAY
RNFAKLDIGD SKGMNIFEQL I