Gene Cthe_1058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1058 
SymbolglyA 
ID4811356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1264583 
End bp1265821 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content43% 
IMG OID640106480 
Productserine hydroxymethyltransferase 
Protein accessionYP_001037483 
Protein GI125973573 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00108739 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAATT TGAATGAAAT ATCAAAAATC GATCCCGAAG TTGCGAAGGC AATTGAATTG 
GAGGTTAATC GTCAGAGAAA CAAGATAGAG CTTATTGCAT CTGAAAATTT TGTCAGTAAA
GCCGTAATAG AAGCAATGGG TACACCTCTG ACCAACAAGT ATGCTGAAGG ATATCCGGGA
AAAAGGTATT ACGGAGGCTG TGAGTTTGTT GACATAATTG AAAATCTTGC GATTGAACGG
GCAAAGAAAA TATTCGGAGC TGAGCATGCG AATGTGCAGC CGCATTCAGG GGCTCAGGCA
AATATGGCTG TGTTTTTTGC AGTGTTAAAT CCCGGAGATA CGATTCTTGG AATGAATCTT
TCCCATGGAG GGCATTTGAG CCATGGAAGC CCTGTCAACA TGTCCGGAAA ATATTATAAT
GTCATATCCT ACGGAGTAAG GAAGGAAGAC TGCAGAATAG ACTATGACGA AGTGAGAAAG
CTTGCAAAGG AACACAGGCC GAAACTTATA GTGGCGGGAG CCAGTGCATA TCCAAGAATA
ATAGATTTTA AGGCTTTCAG AGATATTGCG GATGAAGTCG GAGCATATTT GATGGTGGAT
ATTGCACATA TAGCAGGTCT TGTTGCAGCA GGACTGCACC CGAATCCTGT TCCTTATGCA
CATTTTGTTA CCACCACCAC TCACAAGACT TTGAGAGGTC CGAGAGGCGG ACTGATATTG
TGCGGCAATG AGCATGCAAA AATGATTGAC AAGGCTGTTT TCCCGGGAAT ACAGGGCGGT
CCTCTGATGC ATGTTATTGC GGCAAAAGCG GTAAGCTTTG CCGAAGTATT GACCGATGAA
TTCAAGCAGT ATCAGCAGCA GATAGTAAAA AATGCGAAAA CTCTTGCCAA CGCTTTGATG
GAGAAAGGCA TTGACCTTGT TTCCGGTGGA ACGGACAACC ATCTCATGCT GGTTGATTTA
AGAAATAAAG GTCTTACGGG TAAATACGTT CAGCATATTC TTGATGAGGT TTGCATTACC
GTAAATAAAA ACGGAATTCC TTTTGACCCT GAAAGTCCGT TTGTTACCAG CGGTATCAGA
ATAGGAACAC CTGCGGTGAC GGCACGGGGT ATGAAAGAAG AGGATATGGT TGAGATAGCG
GATCTTATCA ATCTCACCAT TACGGATTAT GAGAATTCGA AAGAGAAAGT AAAGGAAAGA
GTAAGAATGC TATGCGAAAA ATATCCTTTG TATCAGTAA
 
Protein sequence
MFNLNEISKI DPEVAKAIEL EVNRQRNKIE LIASENFVSK AVIEAMGTPL TNKYAEGYPG 
KRYYGGCEFV DIIENLAIER AKKIFGAEHA NVQPHSGAQA NMAVFFAVLN PGDTILGMNL
SHGGHLSHGS PVNMSGKYYN VISYGVRKED CRIDYDEVRK LAKEHRPKLI VAGASAYPRI
IDFKAFRDIA DEVGAYLMVD IAHIAGLVAA GLHPNPVPYA HFVTTTTHKT LRGPRGGLIL
CGNEHAKMID KAVFPGIQGG PLMHVIAAKA VSFAEVLTDE FKQYQQQIVK NAKTLANALM
EKGIDLVSGG TDNHLMLVDL RNKGLTGKYV QHILDEVCIT VNKNGIPFDP ESPFVTSGIR
IGTPAVTARG MKEEDMVEIA DLINLTITDY ENSKEKVKER VRMLCEKYPL YQ