Gene Cthe_0217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0217 
Symbolpgi 
ID4808635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp265432 
End bp266778 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content43% 
IMG OID640105630 
Productglucose-6-phosphate isomerase 
Protein accessionYP_001036651 
Protein GI125972741 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGAA TAAAATTTGA CTATTCAAAA GCATTGCCTT TTGTAAGTGA ACGTGAAGTT 
GCATATTTCG AGAATTTTGT AAGGTCTGCC CATGACATGC TCCATAACAA AACCGGAGCG
GGAAATGACT TTGTAGGCTG GGTTGATCTT CCTGTAAATT ATGACAGGGA AGAATTTGCG
AGAATCAAGG CTGCGGCAGA AAAGATAAAA TCTGATTCTG ATGCTTTGGT TGTAATTGGA
ATCGGAGGTT CCTATCTGGG AGCAAGGGCG GCAATAGAGA TGCTTTCCCA CTCATTCCAC
AATCTCATGC CCAAATCAAA GAGGAATGCT CCTGAGATAT ATTTTGTGGG AAACAATATC
AGCTCTACAT ACATTGCTGA TTTGCTGGAA GTAATAGAAG GCAAAGAGAT TTCGGTAAAC
GTTATATCAA AATCCGGTAC TACAACGGAG CCTGCCATTG CTTTCAGAAT CTTTAAAGAG
TACATGGAAA ACAAATACGG AAAAGACGGA GCAAGTAAAA GAATATATGC CACTACCGAC
AAGGAGAAAG GAGCACTCAG GAAGCTGGCA ACCGAAGAGG GATATGAAAC ATTTGTAGTT
CCTGATGACA TAGGTGGAAG ATTCTCCGTT CTGACGGCAG TTGGCTTGCT TCCCATTGCA
GTGGCCGGAA TTGACATCGA CAGCATGATG AAGGGAGCTG CTGACGCCCG TGAGCTTTAC
AGCAATCCAA ACCTGATGGA AAACGACTGC TACAAATATG CGGCTGTAAG AAACGCTCTC
TACAGAAAGA ACAAGACAAT TGAGATAATG GTAAACTATG AACCTTCACT CCATTACTTC
ACAGAATGGT GGAAACAGCT CTACGGAGAA AGTGAAGGAA AGGATCAAAA AGGTATATTC
CCGGCCGGAG TTGACTTCAC TACGGACCTT CATTCCATGG GACAGTATAT ACAGGATGGA
CTCAGGAACA TATTTGAAAC GGTAATCAGG GTTGAAAAGC CCAGAAAGAA TATTGTTATA
AAGGAAGAAA AGGACAACCT TGACGGATTG AACTTTATTG CCGGAAAAGA CGTGGACTAT
GTAAACAAGA AAGCAATGGA AGGAACGGTA CTTGCCCATA CCGACGGCGG TGTTCCGAAT
CTTGTGGTAA CCGTGCCTGA GCTTAGTGCT TATTACTTTG GAAATATGGT ATACTTCTTT
GAAAAAGCCT GCGGTATAAG CGGATACCTC CTTGGTGTAA ATCCTTTTGA CCAGCCGGGA
GTTGAGGCTT ACAAGAAAAA CATGTTTGCC CTTCTTGGAA AACCGGGATA TGAAGAACAA
AGAAAGAAAC TTGAAGAGCG TTTGTAA
 
Protein sequence
MERIKFDYSK ALPFVSEREV AYFENFVRSA HDMLHNKTGA GNDFVGWVDL PVNYDREEFA 
RIKAAAEKIK SDSDALVVIG IGGSYLGARA AIEMLSHSFH NLMPKSKRNA PEIYFVGNNI
SSTYIADLLE VIEGKEISVN VISKSGTTTE PAIAFRIFKE YMENKYGKDG ASKRIYATTD
KEKGALRKLA TEEGYETFVV PDDIGGRFSV LTAVGLLPIA VAGIDIDSMM KGAADARELY
SNPNLMENDC YKYAAVRNAL YRKNKTIEIM VNYEPSLHYF TEWWKQLYGE SEGKDQKGIF
PAGVDFTTDL HSMGQYIQDG LRNIFETVIR VEKPRKNIVI KEEKDNLDGL NFIAGKDVDY
VNKKAMEGTV LAHTDGGVPN LVVTVPELSA YYFGNMVYFF EKACGISGYL LGVNPFDQPG
VEAYKKNMFA LLGKPGYEEQ RKKLEERL