Gene Cthe_1688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1688 
Symbol 
ID4808863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2012330 
End bp2013310 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content32% 
IMG OID640107102 
Productradical SAM family protein 
Protein accessionYP_001038103 
Protein GI125974193 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.52361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGAA TAATGTTTGG AGAAAATTTA GAATTTCAGA GTAGTATAGG TGCACCGCTA 
TATGTTGTCC TAAACATAAC CAATAAGTGT AACTTAAGAT GTTTGCATTG CTTTAATGAT
AGTCCAACAT CGACAAAATC AGCTTGTAAG GAGCTGGAGG ATGATCAGAT AATTAAAATA
GTAAAGGAAC TGGGAGAAAT GAAAGTTGCT AACGTTTGTT TTTCTGGTGG AGAGCCCTTG
GTTAGAAAGC AGGTACTATT TAATTGTCTG ATGCTGTTAG GACAGAGAAA TGTAAGAACA
TCTATAGTTA CAAATGGGAC TTTAATTGAT GAACAAACGG CTGAAATGCT AAATGTTTTA
GGAGTAAAAG AAGTACAAGT TAGCTTAGAT GGATGTAATG AGGATACCCA CGAAAAACTA
AGACAAGTAA AAGGATGTTT TAATTTAGCG TTAAAGGGAA TAAGAAATCT GTGCTATGCT
GGAGTTACAA CATCTGTATC CTATACTCTA AATAAATGGA ATGTAAATGA CGTGGAGCCG
ATAATTGAAA AATTGGAGGA TATGCAAATT AGTGCGCTTA ATATAAGACC ATTACTGGAA
ATCGGAGCAG CGGCAGTAAA GAATGAATTG AGAGCTCCAA CCTCAAAAGA TTATAGACAA
GTTGTTAAAA CTATAAATAA ATATAAAAGA AAGGGAATAG GGTTTCAAAT TGGATTTAAC
GATCCTATAA GTCATATCTA TTATTACAGA GAAAATAAAG CGAATACAGT AATAGAGATA
CAAAGTGATG GAAACATATT TCCATCTTAT TGCATTCCTA TTTCAGTAGG GAATGTAAAA
GTAAAGTCAT TAAGAGAATA TTGGGATTCT GGATTAAATT CTCTATGGTC TAATAAAAAA
ATCCAGGAAA TTGCTAAGGA AATCTATAGC TGTAGAGATT TAAGTGAGAT TATTAATAAA
ATCAATCAGG AAGATATATA A
 
Protein sequence
MERIMFGENL EFQSSIGAPL YVVLNITNKC NLRCLHCFND SPTSTKSACK ELEDDQIIKI 
VKELGEMKVA NVCFSGGEPL VRKQVLFNCL MLLGQRNVRT SIVTNGTLID EQTAEMLNVL
GVKEVQVSLD GCNEDTHEKL RQVKGCFNLA LKGIRNLCYA GVTTSVSYTL NKWNVNDVEP
IIEKLEDMQI SALNIRPLLE IGAAAVKNEL RAPTSKDYRQ VVKTINKYKR KGIGFQIGFN
DPISHIYYYR ENKANTVIEI QSDGNIFPSY CIPISVGNVK VKSLREYWDS GLNSLWSNKK
IQEIAKEIYS CRDLSEIINK INQEDI