Gene Cthe_3168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3168 
Symbol 
ID4809618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3743985 
End bp3744863 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content42% 
IMG OID640108601 
Productputative lipid kinase 
Protein accessionYP_001039556 
Protein GI125975646 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAG CTTTATTGGT TTACAATCCT TTTTCGGGAG ACAGGGGAAT AGCAAATAAA 
TTGGATTATA TTCTGGAGAG ATTTCAGCAA AAAGATATTT TGCTTCAGCC TTACAGGATT
ATGGAAGGAT GCGGTAAGAA CATCTCCGAC CTTTTGACAG AAGGTTCCTA TAAGTTTGTA
ATATCCTCCG GAGGGGACGG TACGCTGAAC TTTATCTGTA ATATTATTAT GAAGAATAAC
CTCTCTGTGC CCATGGGAAT AATTCCTGCA GGAACCTGCA ATGATTTTGC TTCGATATTG
AATATCCCGA CATCGGTTGA AGAGTGCGTG GACATCATTT TAAAAGGCAG GACGGTGGAT
GTGGACGTAG GGGTTGTGGA TGACAGGATT TATTTTTTGA GCTCTTGCGC CGGAGGTGTT
TTTGTGGATG TTTCTTTCAG TACGGACGGC GAGCTTAAAA AGAACCTGGG TGCCCTGGCT
TATTATCTGA AGGCGCTTAC CGAAATGGCA AGCATGAAGC CCTTTAGAGT AACCATTGAA
ACCGAAGAGG AAATTTTTGA AGATGACATA CTTCTTTTCT GCATTCTGAA CGGCAACCAG
GCCGGCGGCT TTCACAACCT TATGGACGCG GTTTACGATG ACGGGCTTAT GGATATTGTC
ATTATCAAAG ACTGCAGAAA AATAGAACTT CCGGCTATTT TCTATAAAGT TATAAACAAT
GAGCTGCAAA ACGACAAGAA TGTGGTTACC ATAAGAACGA ACCGGTGTAC CATAAAGAGC
TCGAAAGAAA TAGTACTTAG CATTGACGGG GAAAAGGGAC CGACCCTGCC GGTCGAGGTA
AAATTTATAA ACAAGGCATT AAAAGTATTT GCGGCATAG
 
Protein sequence
MDKALLVYNP FSGDRGIANK LDYILERFQQ KDILLQPYRI MEGCGKNISD LLTEGSYKFV 
ISSGGDGTLN FICNIIMKNN LSVPMGIIPA GTCNDFASIL NIPTSVEECV DIILKGRTVD
VDVGVVDDRI YFLSSCAGGV FVDVSFSTDG ELKKNLGALA YYLKALTEMA SMKPFRVTIE
TEEEIFEDDI LLFCILNGNQ AGGFHNLMDA VYDDGLMDIV IIKDCRKIEL PAIFYKVINN
ELQNDKNVVT IRTNRCTIKS SKEIVLSIDG EKGPTLPVEV KFINKALKVF AA