Gene Cthe_3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3141 
Symbol 
ID4809704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3709573 
End bp3712068 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content43% 
IMG OID640108574 
Productlipolytic enzyme, G-D-S-L 
Protein accessionYP_001039529 
Protein GI125975619 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2755] Lysophospholipase L1 and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGGAAAA GAAAATTAGG TTTTTCGGCT GTTATTTTTA TTACGATAAT AGCAATATTC 
GGATGTTTTT TAGACTTGCG TATCAGTGCA GCCCCGAATG AATACAAGTT TGATTTTGGA
GCCGGTCCTG TAGAGCCGGG TTACATAGGT GTCAGTGCTT CTACGGCTTA TAGCAAGTCA
AGAGGCTATG GTTTTAATAC TCCCTGGAAT ATGAGGGATG TTGCGGCATC AGGCAGCGGT
CTTACAAGTG ATGCTGTTCA ATTTCTGACA TATGGTACAA AGAGTGAAAA TACTTTTAAT
GTCGACCTTG ACAACGGTCT GTATGAGGTA AAAGTCACTT TGGGAAACAC TTCGAGGGCA
AGTGTGGCGG CAGAAGGTGT TTATCAAATA ATAAATATGA CCGGTAACTG TGCAACGGAC
AAGTTTCAAA TTCCTATTAC GGACGGACAG TTAAATATAC TGGTGACTGC GGGAAAAGAA
GGTACACCTT TTACCCTTAG TGCCCTTGAA ATAAGAAAAA TATCGGATGT TCCCGTAACA
AACAGAACTA TATATATCGG AGGAGATTCG ACAGTCTGCA ACTATTATCC GCTGGATACA
AGTGCTCAGG CCGGTTGGGG ACAGATGCTG CATAAATTCG TAGACACCAA TGTTTTTCAA
ATTAGAAATA TGGCTTCATC AGGCCAATTT GCAAGAGGTT TCAGGGATGA CGGACAGTTT
GAGGCAATAA TGAAGTACCT CAAGCCCGGT GACATATTCA TATTGCAATT TGGTATAAAT
GATACCAATT CCAAAAACTC AACAACCGAA GCGCAATTTA AGGAAATTAT GACCGACATG
GTGGTAAAGG CCAAAGCTAC CGGTGCAACT GTGGTATTGT CGACACCACA AGGCCGGGCA
ACCGACTTTA ATTCATCAAA TGTGCATGAT TCCCAAGGCA GATGGTACAG AAGGGCTACT
ATAGAAGTGG CAAGAGAACA AGGTGTCCGA TTGGTGGACC TTAATGTATT AAGCTCGGCA
TATTTCACAT CAATAGGTCC GGAAGCAACG CTTGCATTAT ATATGCCGGG AGACACCTTA
CACCCGAATC GAGAGGGAGC TACCCAGCTT GCCCGTATTG TAGCGGAGGA ATTGGCAGAT
CTCTTAAAAG CACCTGTTGC AACTCCTACA AGTGGGCCCT CAGCAACTCC CACACCCATT
CCAAACAGCT TTATTTACGG CGATGTTAAT GGCAATGGTT CTATAGAGTC CACAGACTGT
GTATGGGTGA AAAGATATTT ATTGAAGCAA ATAGATTCTT TCCCAAACGA AAACGGTGCC
AGAGCGGCTG ATGTAAACGG AAACGGCACA ATAGACTCCA CGGACTATCA ACTTCTGAAA
AGATTCATAT TAAAAGTTAT TAATGAGTTT CCGGTGCAAA AGCAAAAAAA TGAACCGGTA
ATATATCAGG CTGAGGATGC GATAATATAC AATGCCATTC TGGAAACCGT CAATGCAGGG
TATACGGGAA GCTGTTATGT GAATTATCAC AATGAAGTCG GAGGCTATAT TGAGTGGAAT
GTTAATGCAC CGTCTTCAGG CTCATATGCC CTTATATTCA GGTATGCAAA CGGAACAACT
GCCAACAGAC CTATGCGGAT AACGGTTAAC GGTAATATAG TTAAGCCGAG CATGGATTTT
GTTTCAACAG GGGCATGGAC CACTTGGAAC GAAGCTGGTA TAGTGGCAAA TCTTAATCAA
GGGAATAACG TTATCAGGGC AACAGCTATT GCATCCGATG GCGGTCCAAA TGTTGACTAT
CTTAAAGTAT TTTCGGCAAA TGCCTTTCAG CCGGTATCTG AAGAAAAAAT TACAATTTAC
ATAGCCGGCG ATTCTACTGT ACAGACATAT AATGCATCAT ATGCGCCGCA AGCCGGATGG
GGACAGTTTT TAGGCCAATA TTTTACTTCC AACGTTGTTA TTGAAAACAG GGCAATTGCG
GGAAGAAGTT CCAGAAGCTT TGTGGAAGAA GGAAGATTGG ACAGCATTCT GAGTGTCATA
AAGCCCGGCG ACTATTTATT TATTCAGTTT GGACATAACG ATGCGGACAT AAGCAAGCCC
GAACGCTATT CCGCTCCGTA TACAACATAC AAGGAGTATC TTCGCAAATA CGTGGACGGG
GCCCGGCAAA AAGGCGCAAT ACCGGTATTG ATAACTCCAG TCGCAAGGCT TAATTATAAA
AACAATGCCT TTGTCAATGA TTTTCCGGAT TATTGCACGG CTATGAAGCA GGTGGCCGAA
GAGAAGAATG TAAAGCTCAT TGACCTTATG ACAAAGAGCC TGAATTACTA TAATTCAATA
GGCTATAATG AAACATACAA ACTTTTTATG GTATCTGTGA ACAATACGGA TTATACGCAT
TTTACGGAGA AGGGAGCCCA GCAGATTGCC CGGTTGGTGG CACAAGGCGT TAAAGAGGCA
AATTTGGATA TTGCGAAGTA TTTGAAAACC AATTAA
 
Protein sequence
MRKRKLGFSA VIFITIIAIF GCFLDLRISA APNEYKFDFG AGPVEPGYIG VSASTAYSKS 
RGYGFNTPWN MRDVAASGSG LTSDAVQFLT YGTKSENTFN VDLDNGLYEV KVTLGNTSRA
SVAAEGVYQI INMTGNCATD KFQIPITDGQ LNILVTAGKE GTPFTLSALE IRKISDVPVT
NRTIYIGGDS TVCNYYPLDT SAQAGWGQML HKFVDTNVFQ IRNMASSGQF ARGFRDDGQF
EAIMKYLKPG DIFILQFGIN DTNSKNSTTE AQFKEIMTDM VVKAKATGAT VVLSTPQGRA
TDFNSSNVHD SQGRWYRRAT IEVAREQGVR LVDLNVLSSA YFTSIGPEAT LALYMPGDTL
HPNREGATQL ARIVAEELAD LLKAPVATPT SGPSATPTPI PNSFIYGDVN GNGSIESTDC
VWVKRYLLKQ IDSFPNENGA RAADVNGNGT IDSTDYQLLK RFILKVINEF PVQKQKNEPV
IYQAEDAIIY NAILETVNAG YTGSCYVNYH NEVGGYIEWN VNAPSSGSYA LIFRYANGTT
ANRPMRITVN GNIVKPSMDF VSTGAWTTWN EAGIVANLNQ GNNVIRATAI ASDGGPNVDY
LKVFSANAFQ PVSEEKITIY IAGDSTVQTY NASYAPQAGW GQFLGQYFTS NVVIENRAIA
GRSSRSFVEE GRLDSILSVI KPGDYLFIQF GHNDADISKP ERYSAPYTTY KEYLRKYVDG
ARQKGAIPVL ITPVARLNYK NNAFVNDFPD YCTAMKQVAE EKNVKLIDLM TKSLNYYNSI
GYNETYKLFM VSVNNTDYTH FTEKGAQQIA RLVAQGVKEA NLDIAKYLKT N