Gene Cthe_1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1200 
Symbol 
ID4810153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1430523 
End bp1431767 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content47% 
IMG OID640106623 
Productadenosylhomocysteinase 
Protein accessionYP_001037625 
Protein GI125973715 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0499] S-adenosylhomocysteine hydrolase 
TIGRFAM ID[TIGR00936] adenosylhomocysteinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.51661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAGTG TTATTCGTGA TATCAGCCTT GCAAAATCCG GCAGGCAGAA AATAAATTGG 
GTAAGGAAGA ATATGCCGCT TTTAAGAAGC CTTGAGGAAG AGTTCGCAAA AACAAAGCCT
TTTGACGGAA TAAGAATTAC GGTATCGGTG CATCTCGAGG CAAAAACGGC GTATCTTGCA
AAGCTTTTTG CAATAGGCGG CGGACAAGTT TCCGTTACGG GAAGCAATCC GCTGTCCACC
CAGGATGACG TTGCCGCGGC TCTTGCGGAA GACGGGCTTA ATGTTTACGC ATGGTACAAT
GCAACGGAAA AAGAATATAA GGAACATTTG AATCTTGCTT TGGATGGCAG GCCGAACATT
GTTATTGATG ACGGAGGCGA CCTTGTACAT CTGCTGCACA CCGAAAGAAC TGAGCTTCTT
TCCGATGTCA TGGGTGGATG CGAGGAGACC ACGACTGGGG TTATAAGGCT TAAGGCTATG
GAAAAAGAAG GTGCCTTGAA ATTCCCCATG GTGGCGGTAA ACAATGCTTA CTGCAAATAT
TTGTTTGATA ACCGATACGG TACAGGCCAG TCGGTGTGGG ACGGTATAAA CAGAACCACC
AATCTCATCG TGGCCGGAAA AAATGTTGTT GTGGTAGGTT ACGGATGGTG CGGCAAGGGT
ATTGCCATGA GAGCCAAGGG ATTGGGCGCA AGTGTTATTG TGTGCGAAGT TGATCCGATA
AAAGCTGCCG AGGCCGTCAT GGACGGATAC AAGGTTATGC CCATGATGGA GGCGGCAAAG
ATTGGTGATT TGTTTATTAC CGCAACCGGC TGCAGCAGGG TGATACACCG TGAACATTTC
AAGGTAATGA AGGACGGCGC CATACTATGC AACGCCGGAC ATTTTGATGT GGAAGTAAGT
GTTAAGGATT TGGAAGAAAT GGCTGTCCGC AAGGAAGAAC AGCGCAAAAA CATTATGGGA
TACATGATGG AGGACGGCAG GTGGATAAAT CTTCTTGCGG AAGGAAGACT GGTAAACCTT
GCGGCCGGAG ACGGACATCC GGCGGAAATC ATGGACATGA GTTTTGCCCT TCAGGCCCTG
AGTGCAAAAT ATGTGCTTGA AAACCATTCA AGGCTTGGAA AAAAGGTGAT TGACGTGCCT
GAGGAAATTG ACAGAAGGGT TGCGTTAATG AAACTTGAAT CCTGGGGAAT TACAATAGAT
GAGCTTACCG AGGAGCAGAA AAAATATCTT GACAGCTGGT GCTAA
 
Protein sequence
MGSVIRDISL AKSGRQKINW VRKNMPLLRS LEEEFAKTKP FDGIRITVSV HLEAKTAYLA 
KLFAIGGGQV SVTGSNPLST QDDVAAALAE DGLNVYAWYN ATEKEYKEHL NLALDGRPNI
VIDDGGDLVH LLHTERTELL SDVMGGCEET TTGVIRLKAM EKEGALKFPM VAVNNAYCKY
LFDNRYGTGQ SVWDGINRTT NLIVAGKNVV VVGYGWCGKG IAMRAKGLGA SVIVCEVDPI
KAAEAVMDGY KVMPMMEAAK IGDLFITATG CSRVIHREHF KVMKDGAILC NAGHFDVEVS
VKDLEEMAVR KEEQRKNIMG YMMEDGRWIN LLAEGRLVNL AAGDGHPAEI MDMSFALQAL
SAKYVLENHS RLGKKVIDVP EEIDRRVALM KLESWGITID ELTEEQKKYL DSWC