Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2353 |
Symbol | |
ID | 4808987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2806134 |
End bp | 2807282 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107760 |
Product | PGAP1-like protein |
Protein accession | YP_001038748 |
Protein GI | 125974838 |
COG category | [R] General function prediction only |
COG ID | [COG1075] Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.808395 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTGC GCCAAGTACG TAACCGCAAA ATTTTTATAA TGATTGCCCT GTCACTTTTT CTTTTATTTG TGCATTTCAG GCAAATTAAT GCTTTCTTTA TTGGTGCCTT TTACAATAAA AACAATCTGA ACATTTACCT TGACCTTATA AACAGCCAGG TTCACACAAT TTCATCAGAG CCGTCAAACA AAGTATATCT AAAAGCTGTA GTTAAAGATG CAAACGGAAA ACTCATTCCC CATATTGAAG TTAACTTTGA AGCTTCAAAA GGCATGGGTA CGGTACAGCC TGCAAAAGCC GCAACCGACA GCCGGGGTGA ATGTTTTGTC ACTTACGTCC CCGAATACTA TTACAACCTT AGTCCTGATG CGAATCCACG GCATGTTGTC ATTACTGCTT CAATTGCCGG CACCGACACA AACTCAACGG TAAAACTGAA CCTTGTTCCG GCACCCGTCG TCTTTGTGCA TGGATACAGG GAAACTGCGG ATGTTTTTGA CAATTTGAAT GAATTTATTT CATCAAAAGG GTATACTTGC ATTTCCCTGA ATTATGACTC AACTTTGGGA ATAGAGCATA GTGCCAAAGA ACTGGAGCTG TTTTTGCAAA AGCAAAAAAA GGATTTTTTA AGTCAGGGAA TCCTTGTAAA CAAATTTGAC CTGATTACCC ACAGTATGGG GGGATTGGTG GCAAGGTACT ATTCAGCAAG TCAGAACTAT CTTAAAAATG ACGATATCAA TAAAATAATT TTTCTTTCAG TGCCTCACAA AGGCTCGGTT TTGGCATCAA TAGGCGAGGA ATATTTCAAA GACAAATCTA TTAAAGAACT GGTTCCTGAC AACGAATTGT TCGTAAGCAT ATTCCCCAAT ACAATTAACG GCGGGCTCAA CAATTCAATA CAAACAGGTA ATCTTTTAAG CCAGTACGAT GAAGTGGTTA CAAATGAAAG TGCCGCTCTT GACAAATGGG GGATTAAGAC TGAAATATTC AACGTGGGGG AAAACAGTTT CACTGTGCAC AATCTGCTAA GCGGCAACAT TCTTGATGCT CCGAACCATA AAGGCATATT AAACAACAGC ACAGTCTTTA ACCGCATCGC TGAAATGTTA AATACCAATC TTCCTTATCC TGCCGTTATA AACAAATAA
|
Protein sequence | MSVRQVRNRK IFIMIALSLF LLFVHFRQIN AFFIGAFYNK NNLNIYLDLI NSQVHTISSE PSNKVYLKAV VKDANGKLIP HIEVNFEASK GMGTVQPAKA ATDSRGECFV TYVPEYYYNL SPDANPRHVV ITASIAGTDT NSTVKLNLVP APVVFVHGYR ETADVFDNLN EFISSKGYTC ISLNYDSTLG IEHSAKELEL FLQKQKKDFL SQGILVNKFD LITHSMGGLV ARYYSASQNY LKNDDINKII FLSVPHKGSV LASIGEEYFK DKSIKELVPD NELFVSIFPN TINGGLNNSI QTGNLLSQYD EVVTNESAAL DKWGIKTEIF NVGENSFTVH NLLSGNILDA PNHKGILNNS TVFNRIAEML NTNLPYPAVI NK
|
| |