Gene Cthe_0798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0798 
Symbol 
ID4810416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp964391 
End bp965977 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content42% 
IMG OID640106215 
Productlipolytic enzyme, G-D-S-L 
Protein accessionYP_001037226 
Protein GI125973316 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAAAT CGAAAAATTT AGCAGCAAAA GTATTATCAA CTCTATTAAT CTTCATTACG 
GTTATATCGT TAATAAATAT GGAAGCACCG GCTGCGTCAA AAACGATAAA AATCATGCCT
GTCGGAGATT CCTGCACCGA AGGTATGGGT GGAGGAGAGA TGGGTTCTTA CCGTACGGAA
TTGTACAGAC TGTTGACACA AGCCGGGCTC AGTATTGACT TTGTTGGTTC GCAAAGAAGC
GGGCCAAGCA GTTTGCCCGA CAAAGATCAT GAAGGTCACT CAGGATGGAC GATTCCCCAG
ATAGCAAGCA ATATCAACAA CTGGCTTAAT ACCCATAACC CCGACGTGGT ATTTCTGTGG
ATTGGAGGAA ATGACCTGCT TTTGAACGGA AACTTGAACG CAACAGGCCT TAGTAATCTT
ATAGACCAGA TTTTCACGGT GAAACCCAAT GTAACACTGT TTGTGGCCGA TTATTATCCG
TGGCCTGAAG CGATTAAGCA ATACAATGCG GTGATTCCGG GAATAGTTCA GCAGAAGGCC
AATGCCGGCA AGAAAGTTTA TTTTGTAAAG CTTAGTGAGA TTCAGTTTGA CAGGAACACC
GATATTTCAT GGGATGGTTT GCACTTGAGC GAAATAGGAT ACAAAAAGAT TGCAAATATT
TGGTACAAGT ATACGATTGA CATACTGAGA GCTTTGGCTG GAGAAACACA GCCAAACCCA
AGTCCAAGCT CAACTCCGAA TACGACGAAA ACGATAAAAA TCATGCCTGT CGGAGATTCC
TGCACCGAAG GTATGGGTGG AGGAGAGATG GGTTCTTACC GTACGGAATT GTACAGACTG
TTGACACAAG CCGGGCTCAG TATTGACTTT GTTGGTTCGC AAAGAAGCGG GCCAAGCAGT
TTGCCCGACA AAGATCATGA AGGTCACTCA GGATGGACGA TTCCCCAGAT AGCAAGCAAT
ATCAACAACT GGCTTAATAC CCATAACCCG GATGTGGTAT TTCTGTGGAT TGGAGGAAAT
GACCTGCTTT TGAGCGGAAA CGTGAATGCA ACAGGCCTTA GTAATCTTAT AGACCAGATT
TTCACAGTGA AACCCAATGT AACACTGTTT GTGGCCGATT ATTATCCGTG GCCTGAAGCG
GTCAAGCAAT ACAATGCGGT GATTCCGGGA ATAGTTCAAC AGAAGGCCAA TGCCGGCAAG
AAAGTTTATT TTGTAAAGCT TAGTGAGATT CAGTTTGACA GGAACACCGA TATTTCATGG
GATGGTTTGC ACTTGAGCGA AATAGGATAC ACAAAGATTG CAAATATTTG GTACAAGTAT
ACGATTGACA TACTAAAAGC TTTGGCAGGA CAAACGCAGC CAACTCCAAG TCCGTCTCCG
ACTCCCACAG ATTCTCCTCT GGTTAAAAAA GGTGATGTTA ATTTGGACGG TCAGGTCAAT
TCGACAGATT TCAGCCTTTT GAAAAGATAT ATACTGAAAG TTGTGGATAT AAATTCAATA
AATGTGACAA ATGCTGATAT GAACAATGAT GGCAATATCA ACTCTACAGA CATTTCAATA
CTAAAGAGAA TACTTCTTAG AAATTAG
 
Protein sequence
MYKSKNLAAK VLSTLLIFIT VISLINMEAP AASKTIKIMP VGDSCTEGMG GGEMGSYRTE 
LYRLLTQAGL SIDFVGSQRS GPSSLPDKDH EGHSGWTIPQ IASNINNWLN THNPDVVFLW
IGGNDLLLNG NLNATGLSNL IDQIFTVKPN VTLFVADYYP WPEAIKQYNA VIPGIVQQKA
NAGKKVYFVK LSEIQFDRNT DISWDGLHLS EIGYKKIANI WYKYTIDILR ALAGETQPNP
SPSSTPNTTK TIKIMPVGDS CTEGMGGGEM GSYRTELYRL LTQAGLSIDF VGSQRSGPSS
LPDKDHEGHS GWTIPQIASN INNWLNTHNP DVVFLWIGGN DLLLSGNVNA TGLSNLIDQI
FTVKPNVTLF VADYYPWPEA VKQYNAVIPG IVQQKANAGK KVYFVKLSEI QFDRNTDISW
DGLHLSEIGY TKIANIWYKY TIDILKALAG QTQPTPSPSP TPTDSPLVKK GDVNLDGQVN
STDFSLLKRY ILKVVDINSI NVTNADMNND GNINSTDISI LKRILLRN