Gene Cthe_0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0143 
Symboleno 
ID4808701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp182307 
End bp183608 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content45% 
IMG OID640105554 
Productphosphopyruvate hydratase 
Protein accessionYP_001036577 
Protein GI125972667 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0148] Enolase 
TIGRFAM ID[TIGR01060] phosphopyruvate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000522536 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGT ACTTGGAAAT TGAGAGCGTA TTTGCGAGAG AAATACTTGA TTCAAGGGGT 
AATCCCACAG TAGAGGTGGA GGTTATAGCA GAAGGCGGAT TTGTAGGCAG AGCGGCAGTT
CCGTCAGGTG CATCCACAGG AGCTTTTGAA GCCGTAGAAC TGAGGGACAA CGACAAAAAC
AGATATTTGG GCAAAGGTGT CCAGAAAGCA GTGGAAAACG TAAACAATAT TATTGCGCCT
GAAGTGGAAG GCATGAATGT GTTTGACCAG GCGGCTGTTG ACAACCTCAT GATAAGCCTG
GACGGTACTC CAAACAAGTC AAAGCTCGGT GCCAATGCCA TCCTGGGAGT TTCCCTTGCG
ACAGCAAAAG CTGCTGCTGA AGCTCTTGGG TTAAGCCTTT ACCAATATAT CGGAGGAGTA
AATGCAAAGA CCCTTCCGGT ACCTATGATG AACATCATAA ACGGAGGAAA ACACGCCGAC
AACAGTGTAA ATATCCAGGA GTTTATGATA ATGCCGGTGG GTGCTTCTTC CTTCAGACAT
GCACTTCAAA TGTGCGCAGA AGTTTTCCAT AACCTGAAGA AAGTGTTAAA GGATAAAGGT
TACAGTACGG CTGTGGGAGA CGAAGGAGGA TTTGCTCCGA ATCTTAAAAC TGATGAAGAG
GCAATCCAGG TTATATTGGA AGCTGTTGAG AAAGCGGGCT ACAAACCGGG TGATGATTTC
AGACTCGCAA TAGACGCTGC TTCCACGGAA ATGTACCAGG AAGACGGAAC ATATCTTTTC
TGGAAATCCG GTGTGAAAAA GACAAAAGAG GAAATGATAA ACTACTGGGA AGAGCTTGTT
AATAAATACC CGATTATTTC TCTGGAAGAC GGTGTTGCGG AAGAAGACTG GGAAGGCTGG
AAAATGCTTA CCGAAAGACT CGGAAAAAGA ATTCAGCTTG TGGGAGACGA CCTCTTTGTT
ACAAACACAA CCAGACTTAA AAAAGGTATA GAATTGGGAG TTGCCAATTC CATACTTATT
AAAGTAAACC AAATAGGAAC TCTTACTGAA ACCCTTGATG CCATAGAAAT GGCAAACCGT
GCGGGATATA CCGCAGTTGT ATCCCACAGA TCGGGAGAGA CTGAAGATGC AACAATTGCG
GACATAGCTG TGGCAACCAA TGCCGGACAG ATAAAGACCG GAGCTCCCTC AAGAACTGAC
CGTGTGGCAA AATACAATCA GCTGTTAAGA ATAGAGGAAG AAATAGGAGC AGTCAGCCGT
TATCCCGGAC TTGATGCCTG GTTCAACCTT AAAAAGAAAT AA
 
Protein sequence
MKQYLEIESV FAREILDSRG NPTVEVEVIA EGGFVGRAAV PSGASTGAFE AVELRDNDKN 
RYLGKGVQKA VENVNNIIAP EVEGMNVFDQ AAVDNLMISL DGTPNKSKLG ANAILGVSLA
TAKAAAEALG LSLYQYIGGV NAKTLPVPMM NIINGGKHAD NSVNIQEFMI MPVGASSFRH
ALQMCAEVFH NLKKVLKDKG YSTAVGDEGG FAPNLKTDEE AIQVILEAVE KAGYKPGDDF
RLAIDAASTE MYQEDGTYLF WKSGVKKTKE EMINYWEELV NKYPIISLED GVAEEDWEGW
KMLTERLGKR IQLVGDDLFV TNTTRLKKGI ELGVANSILI KVNQIGTLTE TLDAIEMANR
AGYTAVVSHR SGETEDATIA DIAVATNAGQ IKTGAPSRTD RVAKYNQLLR IEEEIGAVSR
YPGLDAWFNL KKK