Gene Cthe_1364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1364 
Symbol 
ID4809359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1657460 
End bp1658638 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content35% 
IMG OID640106788 
Producthypothetical protein 
Protein accessionYP_001037789 
Protein GI125973879 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2755] Lysophospholipase L1 and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000024275 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGAAAA ACGCAATTTA TCTCTTTAGT TCAATTCTGA TTGCAAGCCT GGTTATACTT 
ATATCCGGGA GATACCTCGA TGGGATAAGC GGGTCTTACT TGGGAGAACA AAAACAACGG
CTCGAAGTTC AGACAGAGTA TCAAACTGAG CTTGAAAAAG TCCAGGCAGG GAATGTGTAC
GAAAAGCTTG TGAATAAAAA AGAAATCAGT GTTTTGATTA TTGGAGACGA TATTGCTCAA
GGAGGTTTGG AAACCGAGGA CGAAAAGAAA TGGTATAATC TTTTGGCGAA AAGAATAAAA
GAGGAGTATG GAGCTGATTT AACTTGTAAA AATATTGCAA CACCCGGTGG AACAGCATTT
GATGGATGGA TTGACTATAT TACCGACAGA GAAAGGCAAG AGTATGATCT TGTATTTTTA
TGTTTTGGTG CAAACGACGA AAGAGAGATG AATTTCAATC AAAAAGTTTT CGGCGCTATT
GTGGAAGGAT TGATTAGAAA TATAAAGAAA GCAAAAGCGA GTACGGAAAT AATCACGATT
ATTGAGAACA GTATAAGGAG CCAGTCATAT GTGGATACTC TAAAGCAAGT ATCGGAATAT
TACGAAATAA CTTATGCAGA CATAATAAAA GCTTTTATAG ACTCACGGCT GCCGTTTAAT
GATATCACTG AGGATGGCAG AAAACCAAAT GAACAAGGTT ACTCAATTTA CGTCAATACA
ATATTTGATT TAATCAAGTC GAATATTAAC AGCAAAAGAG AACCTGGTTT TGATGGGAAA
AAACCGTTAC TTTATGAGGA AAGCAATGCT TTTGAGAACG GAAAAATTAC AACGGAATTT
TTGACAATTC AAGGTTTTTA TAACAGTGTG GTTGCTTTTG ACAAGATTTT TATGAAGAGT
AGTCACAGTA ATGACTCTAT AACATATGAA GTAAGCAACA GCCATATGCT GGGAGTAACA
TTGATGGCGG GTCCTAATTG TGGAATTGTG GATATATATC TAAATAACAG ATTGATTCAA
ACCTATGATT GTTATGCACC ATACGAAGCT TTGAGGCATG TGTTGATAAG TGATAATATT
GGAATGGGGA CTCATAAAAT AAGAATTGAA GTGTCAAGTA TAAAAAATGC CAAAGCAAGC
AATTCAAATG TTTATATTCA CGGGATAATA ACTAACTAA
 
Protein sequence
MWKNAIYLFS SILIASLVIL ISGRYLDGIS GSYLGEQKQR LEVQTEYQTE LEKVQAGNVY 
EKLVNKKEIS VLIIGDDIAQ GGLETEDEKK WYNLLAKRIK EEYGADLTCK NIATPGGTAF
DGWIDYITDR ERQEYDLVFL CFGANDEREM NFNQKVFGAI VEGLIRNIKK AKASTEIITI
IENSIRSQSY VDTLKQVSEY YEITYADIIK AFIDSRLPFN DITEDGRKPN EQGYSIYVNT
IFDLIKSNIN SKREPGFDGK KPLLYEESNA FENGKITTEF LTIQGFYNSV VAFDKIFMKS
SHSNDSITYE VSNSHMLGVT LMAGPNCGIV DIYLNNRLIQ TYDCYAPYEA LRHVLISDNI
GMGTHKIRIE VSSIKNAKAS NSNVYIHGII TN