Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0505 |
Symbol | |
ID | 4808305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 616237 |
End bp | 618465 |
Gene Length | 2229 bp |
Protein Length | 742 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105918 |
Product | formate acetyltransferase |
Protein accession | YP_001036935 |
Protein GI | 125973025 |
COG category | [C] Energy production and conversion |
COG ID | [COG1882] Pyruvate-formate lyase |
TIGRFAM ID | [TIGR01255] formate acetyltransferase 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00288175 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGCAT GGCGCGGATT TAATAAAGGC AACTGGTGCC AGGAAATTGA CGTTCGTGAT TTTATAATTA GAAATTATAC TCCTTATGAA GGCGATGAAA GCTTTCTTGT AGGACCTACG GATAGAACGC GGAAACTTTG GGAGAAGGTT TCCGAACTGT TAAAGAAAGA ACGGGAGAAC GGCGGGGTAT TGGATGTTGA TACCCATACA ATTTCAACGA TTACGTCTCA TAAACCTGGA TATATAGATA AAGAACTTGA AGTTATTGTC GGGCTTCAGA CGGATGAGCC TTTAAAAAGA GCCATAATGC CGTTTGGCGG TATACGTATG GTGATTAAGG GAGCCGAAGC TTATGGCCAC AGTGTGGACC CTCAGGTTGT TGAAATATTC ACAAAGTACA GAAAGACTCA TAACCAGGGA GTTTATGATG TATATACTCC CGAAATGAGA AAAGCCAAAA AAGCCGGGAT TATTACAGGA CTTCCCGACG CATACGGCAG AGGAAGAATA ATTGGCGATT ACAGAAGGGT TGCACTTTAT GGCGTTGACA GGCTGATTGC TGAAAAAGAG AAAGAAATGG CAAGTCTTGA AAGAGATTAC ATTGACTATG AGACTGTTCG AGACAGAGAA GAAATAAGCG AGCAGATTAA ATCTTTAAAA CAACTTAAAG AAATGGCTTT AAGTTACGGT TTTGACATAT CTTGTCCTGC AAAGGATGCC AGAGAAGCCT TTCAATGGTT GTATTTTGCA TATCTTGCAG CAGTCAAGGA ACAGAACGGC GCGGCAATGA GTATTGGAAG AATTTCGACT TTCCTTGACA TATACATTGA AAGGGATCTC AAAGAAGGAA AACTCACGGA GGAGTTGGCT CAGGAACTGG TTGACCAGCT GGTTATAAAG CTGAGAATTG TGAGATTTTT GAGAACTCCT GAGTATGAAA AGCTCTTCAG CGGAGACCCC ACTTGGGTAA CCGAAAGTAT CGGAGGTATG GCGCTGGATG GAAGAACGCT GGTTACAAAA TCTTCGTTCA GGTTTTTGCA CACTCTTTTC AACCTGGGAC ATGCACCGGA GCCCAACCTT ACAGTACTTT GGTCCGTCAA TCTTCCCGAA GGCTTTAAAA AGTACTGTGC AAAGGTATCA ATTCATTCAA GCTCCATCCA GTATGAAAGC GACGACATAA TGAGGAAACA CTGGGGAGAC GATTATGGAA TAGCATGCTG TGTTTCTGCT ATGAGAATTG GAAAACAGAT GCAGTTCTTC GGTGCAAGAT GCAATCTTGC AAAAGCTCTT CTTTACGCTA TTAACGGCGG AAAGGATGAA ATGACGGGAG AACAGATTGC TCCGATGTTT GCACCGGTGG AAACCGAATA CCTTGATTAC GAGGACGTAA TGAAGAGGTT TGACATGGTG CTTGACTGGG TGGCAAGGCT TTATATGAAC ACCCTCAATA TAATTCACTA CATGCATGAC AAATATGCCT ATGAGGCGCT GCAGATGGCA TTGCATGACA AAGACGTGTT CAGGACGATG GCATGCGGAA TAGCCGGTTT GTCTGTGGTG GCAGACTCCC TTAGCGCGAT AAAATATGCA AAGGTTAAAC CGATACGCAA TGAAAACAAC CTCGTTGTTG ACTACGAAGT TGAGGGTGAT TATCCTAAAT TCGGAAATAA CGACGAACGT GTTGATGAAA TTGCAGTGCA AGTAGTAAAA ATGTTCATGA ACAAGCTTAG AAAGCAAAGG GCTTACAGAA GTGCCACTCC GACCCTTTCC ATACTTACCA TAACTTCAAA CGTGGTATAT GGAAAGAAAA CCGGAAACAC TCCTGACGGC AGAAAAGCTG GAGAACCTTT GGCGCCGGGA GCAAATCCGA TGCATGGAAG GGATATAAAC GGAGCATTGG CTGTACTGAA CAGTATTGCG AAGCTTCCCT ATGAATATGC CCAGGACGGC ATTTCATATA CTTTCTCCAT AATTCCAAAA GCTCTGGGAA GAGACGAGGA AACCAGAATA AACAATCTTA AATCAATGCT TGACGGATAT TTCAAGCAGG GCGGCCACCA CATAAATGTA AATGTGTTTG AAAAAGAGAC ACTGTTAGAT GCCATGGAAC ATCCGGAAAA ATATCCACAA CTTACCATAA GAGTGTCCGG GTATGCAGTG AACTTTATAA AGCTTACACG GGAGCAACAG CTGGATGTTA TTAACAGAAC GATTCACGGA AAGATTTAA
|
Protein sequence | MDAWRGFNKG NWCQEIDVRD FIIRNYTPYE GDESFLVGPT DRTRKLWEKV SELLKKEREN GGVLDVDTHT ISTITSHKPG YIDKELEVIV GLQTDEPLKR AIMPFGGIRM VIKGAEAYGH SVDPQVVEIF TKYRKTHNQG VYDVYTPEMR KAKKAGIITG LPDAYGRGRI IGDYRRVALY GVDRLIAEKE KEMASLERDY IDYETVRDRE EISEQIKSLK QLKEMALSYG FDISCPAKDA REAFQWLYFA YLAAVKEQNG AAMSIGRIST FLDIYIERDL KEGKLTEELA QELVDQLVIK LRIVRFLRTP EYEKLFSGDP TWVTESIGGM ALDGRTLVTK SSFRFLHTLF NLGHAPEPNL TVLWSVNLPE GFKKYCAKVS IHSSSIQYES DDIMRKHWGD DYGIACCVSA MRIGKQMQFF GARCNLAKAL LYAINGGKDE MTGEQIAPMF APVETEYLDY EDVMKRFDMV LDWVARLYMN TLNIIHYMHD KYAYEALQMA LHDKDVFRTM ACGIAGLSVV ADSLSAIKYA KVKPIRNENN LVVDYEVEGD YPKFGNNDER VDEIAVQVVK MFMNKLRKQR AYRSATPTLS ILTITSNVVY GKKTGNTPDG RKAGEPLAPG ANPMHGRDIN GALAVLNSIA KLPYEYAQDG ISYTFSIIPK ALGRDEETRI NNLKSMLDGY FKQGGHHINV NVFEKETLLD AMEHPEKYPQ LTIRVSGYAV NFIKLTREQQ LDVINRTIHG KI
|
| |