Gene Cthe_1361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1361 
Symbol 
ID4809356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1653882 
End bp1654925 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content33% 
IMG OID640106785 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001037786 
Protein GI125973876 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAA TAGACTTAAG AAATAAGATT ATATTTATTA CAGGTGTTGC TGGTTTTATT 
GGAGCTTATT TTGCTAAGCA ATTACTTGAT ACAGTGGATG GTATTACAAT TATAGGTATT
GACAATATGA ATGATTATTA TGATGTTAAA TTAAAAGAAA GTCGTTTAGA AAGTTTGTGT
AATAATTCAA AATTCATTTT TGTGAAAGGA AACATTGCAG ATAAAGAATT AATTAATAAT
ATTTTTAATA CATACCATCC ACAGATTGTA GTTAATTTAG CTGCGCAGGC TGGAGTTAGA
TATAGCATTA CTAACCCAGA TGCTTATATT GAATCGAATA TAATAGGTTT TTACAATATA
CTTGAGGCCT GCCGTCATTC ATATGACGAA GGTAAAGTTC CAGTTGAACA CCTTGTTTAT
GCGAGCAGTT CATCTGTTTA TGGTTCAAAC AAGAAAGTAC CATATTCTAC TGAAGACAAA
GTAGACTATC CTGTTTCACT GTATGCAGCA ACAAAAAAGT CTAATGAGTT AATGGCTTAT
ACATATAGTA AATTGTACAA TATACCATCG ACAGGATTAC GCTTCTTTAC TGTTTATGGT
CCTGCGGGAC GACCGGATAT GGCGTATTTT AGCTTTACAA ATAAGTTAGC ACAAGGTAAA
AAGATTCAGA TTTTTAACTA TGGTGACATG TACCGTGATT TTACATATAT TGATGATATA
GTTAAGGGTA TTGTGCTTGT GCTCCAAAAG GTTCCTGAAC CTATGGAAGA TGGAGTAAGA
TACAAGATTT ACAATATCGG AAATAACAAA CCAGAAAATT TAATGCATTT CGTGGAAGTA
TTGGAGAAAT GTCTGATGGA GGAAGGCATT ATTACAAAGC CTGGTGAAAA AGAACTACTT
CCTATGCAGC CTGGTGATGT ATACCAAACA TATGCTGATG TAGATGATTT AGTTCGAGAT
TTTGGATTTA AGCCAAGCAC AAGTCTTGAA GAAGGATTGA GTAAGTTTGC TAAGTGGTAT
CGTGAATTTT ATATGCAGAA ATAA
 
Protein sequence
MSKIDLRNKI IFITGVAGFI GAYFAKQLLD TVDGITIIGI DNMNDYYDVK LKESRLESLC 
NNSKFIFVKG NIADKELINN IFNTYHPQIV VNLAAQAGVR YSITNPDAYI ESNIIGFYNI
LEACRHSYDE GKVPVEHLVY ASSSSVYGSN KKVPYSTEDK VDYPVSLYAA TKKSNELMAY
TYSKLYNIPS TGLRFFTVYG PAGRPDMAYF SFTNKLAQGK KIQIFNYGDM YRDFTYIDDI
VKGIVLVLQK VPEPMEDGVR YKIYNIGNNK PENLMHFVEV LEKCLMEEGI ITKPGEKELL
PMQPGDVYQT YADVDDLVRD FGFKPSTSLE EGLSKFAKWY REFYMQK