Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0798 |
Symbol | |
ID | 4810416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 964391 |
End bp | 965977 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106215 |
Product | lipolytic enzyme, G-D-S-L |
Protein accession | YP_001037226 |
Protein GI | 125973316 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACAAAT CGAAAAATTT AGCAGCAAAA GTATTATCAA CTCTATTAAT CTTCATTACG GTTATATCGT TAATAAATAT GGAAGCACCG GCTGCGTCAA AAACGATAAA AATCATGCCT GTCGGAGATT CCTGCACCGA AGGTATGGGT GGAGGAGAGA TGGGTTCTTA CCGTACGGAA TTGTACAGAC TGTTGACACA AGCCGGGCTC AGTATTGACT TTGTTGGTTC GCAAAGAAGC GGGCCAAGCA GTTTGCCCGA CAAAGATCAT GAAGGTCACT CAGGATGGAC GATTCCCCAG ATAGCAAGCA ATATCAACAA CTGGCTTAAT ACCCATAACC CCGACGTGGT ATTTCTGTGG ATTGGAGGAA ATGACCTGCT TTTGAACGGA AACTTGAACG CAACAGGCCT TAGTAATCTT ATAGACCAGA TTTTCACGGT GAAACCCAAT GTAACACTGT TTGTGGCCGA TTATTATCCG TGGCCTGAAG CGATTAAGCA ATACAATGCG GTGATTCCGG GAATAGTTCA GCAGAAGGCC AATGCCGGCA AGAAAGTTTA TTTTGTAAAG CTTAGTGAGA TTCAGTTTGA CAGGAACACC GATATTTCAT GGGATGGTTT GCACTTGAGC GAAATAGGAT ACAAAAAGAT TGCAAATATT TGGTACAAGT ATACGATTGA CATACTGAGA GCTTTGGCTG GAGAAACACA GCCAAACCCA AGTCCAAGCT CAACTCCGAA TACGACGAAA ACGATAAAAA TCATGCCTGT CGGAGATTCC TGCACCGAAG GTATGGGTGG AGGAGAGATG GGTTCTTACC GTACGGAATT GTACAGACTG TTGACACAAG CCGGGCTCAG TATTGACTTT GTTGGTTCGC AAAGAAGCGG GCCAAGCAGT TTGCCCGACA AAGATCATGA AGGTCACTCA GGATGGACGA TTCCCCAGAT AGCAAGCAAT ATCAACAACT GGCTTAATAC CCATAACCCG GATGTGGTAT TTCTGTGGAT TGGAGGAAAT GACCTGCTTT TGAGCGGAAA CGTGAATGCA ACAGGCCTTA GTAATCTTAT AGACCAGATT TTCACAGTGA AACCCAATGT AACACTGTTT GTGGCCGATT ATTATCCGTG GCCTGAAGCG GTCAAGCAAT ACAATGCGGT GATTCCGGGA ATAGTTCAAC AGAAGGCCAA TGCCGGCAAG AAAGTTTATT TTGTAAAGCT TAGTGAGATT CAGTTTGACA GGAACACCGA TATTTCATGG GATGGTTTGC ACTTGAGCGA AATAGGATAC ACAAAGATTG CAAATATTTG GTACAAGTAT ACGATTGACA TACTAAAAGC TTTGGCAGGA CAAACGCAGC CAACTCCAAG TCCGTCTCCG ACTCCCACAG ATTCTCCTCT GGTTAAAAAA GGTGATGTTA ATTTGGACGG TCAGGTCAAT TCGACAGATT TCAGCCTTTT GAAAAGATAT ATACTGAAAG TTGTGGATAT AAATTCAATA AATGTGACAA ATGCTGATAT GAACAATGAT GGCAATATCA ACTCTACAGA CATTTCAATA CTAAAGAGAA TACTTCTTAG AAATTAG
|
Protein sequence | MYKSKNLAAK VLSTLLIFIT VISLINMEAP AASKTIKIMP VGDSCTEGMG GGEMGSYRTE LYRLLTQAGL SIDFVGSQRS GPSSLPDKDH EGHSGWTIPQ IASNINNWLN THNPDVVFLW IGGNDLLLNG NLNATGLSNL IDQIFTVKPN VTLFVADYYP WPEAIKQYNA VIPGIVQQKA NAGKKVYFVK LSEIQFDRNT DISWDGLHLS EIGYKKIANI WYKYTIDILR ALAGETQPNP SPSSTPNTTK TIKIMPVGDS CTEGMGGGEM GSYRTELYRL LTQAGLSIDF VGSQRSGPSS LPDKDHEGHS GWTIPQIASN INNWLNTHNP DVVFLWIGGN DLLLSGNVNA TGLSNLIDQI FTVKPNVTLF VADYYPWPEA VKQYNAVIPG IVQQKANAGK KVYFVKLSEI QFDRNTDISW DGLHLSEIGY TKIANIWYKY TIDILKALAG QTQPTPSPSP TPTDSPLVKK GDVNLDGQVN STDFSLLKRY ILKVVDINSI NVTNADMNND GNINSTDISI LKRILLRN
|
| |