Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2792 |
Symbol | |
ID | 4810109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3292404 |
End bp | 3293726 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640108212 |
Product | phenylacetate--CoA ligase |
Protein accession | YP_001039184 |
Protein GI | 125975274 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1541] Coenzyme F390 synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTATGA GAAGATATTG GAATGAAGAA ATAGAGACCA TGTCAAGAAA GGACCTGGAG GATTATCAGT TTAGGCTTTT ATCGGAGCAT CTTGCGCTGG CATACGAAAA ATCTCAATAT TACAGACAGT CTTTTGACGA GGCGGGGGTA AAACCGTCGG ATTTTAAAAA GCTTTCTGAC ATTAGCAAAT TTCCTTTTGT GAACAAACAT ATAGAGCGGG AAAGACAGCA AAAAAAGCCT TTGCTTGGCG ACATGACGGC TGTGGCCGAG GAGGAAGTGG TGTTTGTATC CGCTTCCAGC GGCTCAACGG GAGTTCCTAC GCTAAGTCCC TTTACAAAGA AGGATTTTGA AGAATTTCAG GATGTTCAAA GCAGGTTGTT TTGGGCGGCA GGAATGAGAC CCAACGACCG TTATGTTCAT GCCCTCAATT TCACATTATT TGTGGGAGGT CCGGACGTTA TAGGCGCTCA AAATCTAGGG GCTTTGTGCA TTTGGGCAGG AGCCATTCCT TCCGACAGGC TGCTCTTTAT CCTTAAAGAG TTTCAGCCTA CCGTTATATG GACGACACCT TCCTATGCAT GGTACCTGGG GGAAACTGCG AAAAAACAGG GAATTGACCC TGCAAAGGAC CTTTCCATCA ACAAAATCAT TGTGGCAGGA GAGCCGGGAG GCTCTATTGA TGCCACAAGG CAAGCCATTG AGGAGCTTTG GGATGCAAAA GTCTACGATT TCTACGGAAT TTCGGACATT TTCGGAGCAT GCGCGGGAAT GTGCAGCGAG AGAAACGGTC TTCATTTGGT GGAGGACCAT ATTCTGGTTG AAGTAATCAA TCCCGATACT TTAGAGCCGG TTGCGGAAGG AGAAAGAGGG GAACTGGTAT TTACCACTTT AAGAAAAACT GCAAGGCCGA TGATTCGATT CCGGACGGGA GATATCGGCA CGGTAAACAG GGAGAAATGC GCCTGCGGAC GTACCCATGC CCGCATAAAC ATTACAGGGC GCCTGGATGA TATGCTGATT GTATCTGGAG TAAATGTGTT CCCCAGTGAT ATTGAGTATG TTGTACGCAA CATGGAAGAA CTTTCGGGAG AATACAGGAT TACTGCCATA ACAGAAAACT TTACCACAAA ATTTAAGCTT GAAGTGGAGA GGGCGCTCGG AAACCAGGAG CCCAAAGAAG TGCTTGCAGA GAAAGTATCA GCCAGAATAA AGGCGCGCTT AGGTGTCAGG CCAAGAGAAG TCATTGTTCT GGAGAACGGT GAACTTCCCA GGGCCACCCA CAAAGCAAAA AGGTTGATTG ATGAGAGAAA CGGGGGATTT TAA
|
Protein sequence | MSMRRYWNEE IETMSRKDLE DYQFRLLSEH LALAYEKSQY YRQSFDEAGV KPSDFKKLSD ISKFPFVNKH IERERQQKKP LLGDMTAVAE EEVVFVSASS GSTGVPTLSP FTKKDFEEFQ DVQSRLFWAA GMRPNDRYVH ALNFTLFVGG PDVIGAQNLG ALCIWAGAIP SDRLLFILKE FQPTVIWTTP SYAWYLGETA KKQGIDPAKD LSINKIIVAG EPGGSIDATR QAIEELWDAK VYDFYGISDI FGACAGMCSE RNGLHLVEDH ILVEVINPDT LEPVAEGERG ELVFTTLRKT ARPMIRFRTG DIGTVNREKC ACGRTHARIN ITGRLDDMLI VSGVNVFPSD IEYVVRNMEE LSGEYRITAI TENFTTKFKL EVERALGNQE PKEVLAEKVS ARIKARLGVR PREVIVLENG ELPRATHKAK RLIDERNGGF
|
| |