Gene Cthe_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2055 
Symbol 
ID4810651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2444007 
End bp2445926 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content37% 
IMG OID640107460 
Producthypothetical protein 
Protein accessionYP_001038455 
Protein GI125974545 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAGTG TGCTTTCGGT TTTAAGTCCG GAAAACATTC AAAAGAATGA ATATGAAGAA 
ATAATAAGAG ACATTCGCAG TCAATTGCAA TCTCTTTCTG ACAGCCGAAG ATTCAGACAG
TTGCTGGACA AATACGCGGA CAAGCCTTAT TATTTTAAAA GAAACGGAAA TTATCTGGAA
AATAAAAGAT GTACTCCTAC GATAAGAGAT GCGGCTTGTT TTTATGTATT AATCAAATAT
CTTGAATATT ATAAATTGAA AAACAATCCC GACGATTTGG AGAAAATTTT CTCAAACAGC
AGTTTTGATG CCGGAAAGTA TTACATGGAA TTCTTTGGCG ATGACTGGGA CGAAGAAAAG
CAGTTCATTT ATAGTTTCAT ATGCAATGAA CCCGATATGA ATTCAAATTT GTTGTCTGAT
ATAGAAATTG ATATAGTTTC GGGTTCTGTT TACAAGATTA AGAAGTATTT TCTTGAAAAT
TCCAGAATAA AAGATATCCG GGGAGGAAGT GCATTAATTA AATATGTAAA TGAAGATGTC
ACGTTTGATT ATCTGAGAGA CAATTATACG GAGGAATGCG CTGTATACTG CGGAGGAGGA
AATGTGCTGA TAATAGCACC CGGCGGTGCC GGAGAAAAGA TATGCTCGGC TCTTGAAGAA
AAATATACAA GAATTACTCT TACGGCACAG AATGCCTTTG AATACGTCAG TACAAACCTT
AATACTTTCA TTAAGCATTA TAAAAACATA ATGGGAGATT TAAACCAGAA GCTTGATGCC
CGGAAAAAAC TGAAGATTTA CAGTATAAAT CCTGACGGCA GGCTTGAAAC GATAGAAATG
GGAAAAGAGA AAATAAGTTT TGACGATGTT GAGGAAATAA AACAAAGAGG AACAGTATGT
TACCTTTGTG GTGTTCGGGA CGGAAGATAC AAAATTAAAA TGCTTGATGT GGAAACAGCG
ATGGTTTGCC TCTCATGTTT AAAAAAGCAC AAGGTTGGGA AAGACAAGAC GGTGTTTTAC
GATGAGTATG AGGAATTTAC CGGTTTCGCT GTAAAAAAGA AGATCGACAG TATTATTGAG
CTCGAAGATG AAAACGGGCA TATAGCTGTT ATATATGCAG ATGGAAATAA CATGGGAAAT
GTGGTTAAAA ATATTGAAAC TCCTTTTCAG CATATGTACT TCAGCCGTGC TTTGGACAGA
ATTACAAAAA GATGTGTTTA TCAATCTATA AATGAAGTGA TGGGAAATGA TGCAATGTTT
GAGGCAATAG CTCTGGGCGG TGATGATATA TTTATTATTG TCCCGGGCAA CAGAAGTTTG
GAAATCACAA ACAAAATTAT TGAAAAGTTC GACGGCGCTT TTGAAAATAA AATGACCATG
TCTGCCGGAA TATGTATTGC CAAATCCAGC ACTCCGATAA GGACTTTGTT TGAAATTGCA
CAGTATATGC TTAAAAGTGC AAAAAGGTAT TCAAGGAAAA ACAACAGTTC TGAAGGAACA
GTGGATGTTC AGTTTATTCG CAGCAATGTC GGTGTCGATT TGCTGGAATC CGAAAGCAGT
TTGTTCCCTG CTGCAAATTC TGAACTTTCA GCCTACCTGG ATATTATCAG AAGGCTTAAA
AACGACGTAA ATATAAAAAC TGCCCAATTA TACAAATTCA GTAATGCATG GCGCATATTA
AAAAATCCAA TGGAATTCCA GTTATTTTAT CTTTACCAGA CAGGCAGGCT ATCTTGCAAA
TATAATGACT ATGCCATGGA ATTCCTTGGC AATATGAAAA ATGTTGACAA GGATGCTTAT
TGTTATTGCG GACTTGTAAA GAAAAAGCCG GGTTATGCGG GTTATGATTC TGTAAAAGGA
AACGATTATG TATCTTTGTG GGATGACGTC ATCCTTTTAA TGGATGCGGT AGGGAGGTGA
 
Protein sequence
MDSVLSVLSP ENIQKNEYEE IIRDIRSQLQ SLSDSRRFRQ LLDKYADKPY YFKRNGNYLE 
NKRCTPTIRD AACFYVLIKY LEYYKLKNNP DDLEKIFSNS SFDAGKYYME FFGDDWDEEK
QFIYSFICNE PDMNSNLLSD IEIDIVSGSV YKIKKYFLEN SRIKDIRGGS ALIKYVNEDV
TFDYLRDNYT EECAVYCGGG NVLIIAPGGA GEKICSALEE KYTRITLTAQ NAFEYVSTNL
NTFIKHYKNI MGDLNQKLDA RKKLKIYSIN PDGRLETIEM GKEKISFDDV EEIKQRGTVC
YLCGVRDGRY KIKMLDVETA MVCLSCLKKH KVGKDKTVFY DEYEEFTGFA VKKKIDSIIE
LEDENGHIAV IYADGNNMGN VVKNIETPFQ HMYFSRALDR ITKRCVYQSI NEVMGNDAMF
EAIALGGDDI FIIVPGNRSL EITNKIIEKF DGAFENKMTM SAGICIAKSS TPIRTLFEIA
QYMLKSAKRY SRKNNSSEGT VDVQFIRSNV GVDLLESESS LFPAANSELS AYLDIIRRLK
NDVNIKTAQL YKFSNAWRIL KNPMEFQLFY LYQTGRLSCK YNDYAMEFLG NMKNVDKDAY
CYCGLVKKKP GYAGYDSVKG NDYVSLWDDV ILLMDAVGR