Gene Cthe_0505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0505 
Symbol 
ID4808305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp616237 
End bp618465 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content43% 
IMG OID640105918 
Productformate acetyltransferase 
Protein accessionYP_001036935 
Protein GI125973025 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01255] formate acetyltransferase 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00288175 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCAT GGCGCGGATT TAATAAAGGC AACTGGTGCC AGGAAATTGA CGTTCGTGAT 
TTTATAATTA GAAATTATAC TCCTTATGAA GGCGATGAAA GCTTTCTTGT AGGACCTACG
GATAGAACGC GGAAACTTTG GGAGAAGGTT TCCGAACTGT TAAAGAAAGA ACGGGAGAAC
GGCGGGGTAT TGGATGTTGA TACCCATACA ATTTCAACGA TTACGTCTCA TAAACCTGGA
TATATAGATA AAGAACTTGA AGTTATTGTC GGGCTTCAGA CGGATGAGCC TTTAAAAAGA
GCCATAATGC CGTTTGGCGG TATACGTATG GTGATTAAGG GAGCCGAAGC TTATGGCCAC
AGTGTGGACC CTCAGGTTGT TGAAATATTC ACAAAGTACA GAAAGACTCA TAACCAGGGA
GTTTATGATG TATATACTCC CGAAATGAGA AAAGCCAAAA AAGCCGGGAT TATTACAGGA
CTTCCCGACG CATACGGCAG AGGAAGAATA ATTGGCGATT ACAGAAGGGT TGCACTTTAT
GGCGTTGACA GGCTGATTGC TGAAAAAGAG AAAGAAATGG CAAGTCTTGA AAGAGATTAC
ATTGACTATG AGACTGTTCG AGACAGAGAA GAAATAAGCG AGCAGATTAA ATCTTTAAAA
CAACTTAAAG AAATGGCTTT AAGTTACGGT TTTGACATAT CTTGTCCTGC AAAGGATGCC
AGAGAAGCCT TTCAATGGTT GTATTTTGCA TATCTTGCAG CAGTCAAGGA ACAGAACGGC
GCGGCAATGA GTATTGGAAG AATTTCGACT TTCCTTGACA TATACATTGA AAGGGATCTC
AAAGAAGGAA AACTCACGGA GGAGTTGGCT CAGGAACTGG TTGACCAGCT GGTTATAAAG
CTGAGAATTG TGAGATTTTT GAGAACTCCT GAGTATGAAA AGCTCTTCAG CGGAGACCCC
ACTTGGGTAA CCGAAAGTAT CGGAGGTATG GCGCTGGATG GAAGAACGCT GGTTACAAAA
TCTTCGTTCA GGTTTTTGCA CACTCTTTTC AACCTGGGAC ATGCACCGGA GCCCAACCTT
ACAGTACTTT GGTCCGTCAA TCTTCCCGAA GGCTTTAAAA AGTACTGTGC AAAGGTATCA
ATTCATTCAA GCTCCATCCA GTATGAAAGC GACGACATAA TGAGGAAACA CTGGGGAGAC
GATTATGGAA TAGCATGCTG TGTTTCTGCT ATGAGAATTG GAAAACAGAT GCAGTTCTTC
GGTGCAAGAT GCAATCTTGC AAAAGCTCTT CTTTACGCTA TTAACGGCGG AAAGGATGAA
ATGACGGGAG AACAGATTGC TCCGATGTTT GCACCGGTGG AAACCGAATA CCTTGATTAC
GAGGACGTAA TGAAGAGGTT TGACATGGTG CTTGACTGGG TGGCAAGGCT TTATATGAAC
ACCCTCAATA TAATTCACTA CATGCATGAC AAATATGCCT ATGAGGCGCT GCAGATGGCA
TTGCATGACA AAGACGTGTT CAGGACGATG GCATGCGGAA TAGCCGGTTT GTCTGTGGTG
GCAGACTCCC TTAGCGCGAT AAAATATGCA AAGGTTAAAC CGATACGCAA TGAAAACAAC
CTCGTTGTTG ACTACGAAGT TGAGGGTGAT TATCCTAAAT TCGGAAATAA CGACGAACGT
GTTGATGAAA TTGCAGTGCA AGTAGTAAAA ATGTTCATGA ACAAGCTTAG AAAGCAAAGG
GCTTACAGAA GTGCCACTCC GACCCTTTCC ATACTTACCA TAACTTCAAA CGTGGTATAT
GGAAAGAAAA CCGGAAACAC TCCTGACGGC AGAAAAGCTG GAGAACCTTT GGCGCCGGGA
GCAAATCCGA TGCATGGAAG GGATATAAAC GGAGCATTGG CTGTACTGAA CAGTATTGCG
AAGCTTCCCT ATGAATATGC CCAGGACGGC ATTTCATATA CTTTCTCCAT AATTCCAAAA
GCTCTGGGAA GAGACGAGGA AACCAGAATA AACAATCTTA AATCAATGCT TGACGGATAT
TTCAAGCAGG GCGGCCACCA CATAAATGTA AATGTGTTTG AAAAAGAGAC ACTGTTAGAT
GCCATGGAAC ATCCGGAAAA ATATCCACAA CTTACCATAA GAGTGTCCGG GTATGCAGTG
AACTTTATAA AGCTTACACG GGAGCAACAG CTGGATGTTA TTAACAGAAC GATTCACGGA
AAGATTTAA
 
Protein sequence
MDAWRGFNKG NWCQEIDVRD FIIRNYTPYE GDESFLVGPT DRTRKLWEKV SELLKKEREN 
GGVLDVDTHT ISTITSHKPG YIDKELEVIV GLQTDEPLKR AIMPFGGIRM VIKGAEAYGH
SVDPQVVEIF TKYRKTHNQG VYDVYTPEMR KAKKAGIITG LPDAYGRGRI IGDYRRVALY
GVDRLIAEKE KEMASLERDY IDYETVRDRE EISEQIKSLK QLKEMALSYG FDISCPAKDA
REAFQWLYFA YLAAVKEQNG AAMSIGRIST FLDIYIERDL KEGKLTEELA QELVDQLVIK
LRIVRFLRTP EYEKLFSGDP TWVTESIGGM ALDGRTLVTK SSFRFLHTLF NLGHAPEPNL
TVLWSVNLPE GFKKYCAKVS IHSSSIQYES DDIMRKHWGD DYGIACCVSA MRIGKQMQFF
GARCNLAKAL LYAINGGKDE MTGEQIAPMF APVETEYLDY EDVMKRFDMV LDWVARLYMN
TLNIIHYMHD KYAYEALQMA LHDKDVFRTM ACGIAGLSVV ADSLSAIKYA KVKPIRNENN
LVVDYEVEGD YPKFGNNDER VDEIAVQVVK MFMNKLRKQR AYRSATPTLS ILTITSNVVY
GKKTGNTPDG RKAGEPLAPG ANPMHGRDIN GALAVLNSIA KLPYEYAQDG ISYTFSIIPK
ALGRDEETRI NNLKSMLDGY FKQGGHHINV NVFEKETLLD AMEHPEKYPQ LTIRVSGYAV
NFIKLTREQQ LDVINRTIHG KI