Gene Cthe_0747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0747 
Symbol 
ID4810365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp913436 
End bp914500 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content40% 
IMG OID640106164 
Productextracellular solute-binding protein 
Protein accessionYP_001037175 
Protein GI125973265 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.389252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTGCGTTGCT ATTGGCTGTG TTGATGTTTG TATTTACAGG CTGTGCCGGC 
AATGTTAAAA ATGACCCCAG CAAACCTCTT GCGGGTACGA CTATTTATGT CTATAACTGG
GGAGATTATA TAGCTGAAGA TACTATTGAA CGTTTTACCA AGGAAACGGG CATTAAGGTT
ATTTATGAAA CTTTTGATTC CAATGAAACC ATGTATGCTA AATATAAATC CGGTGCGGTA
AATTACGATG TTTTGATTCC GTCCGATTAT ATGATTGAGA AATTGATTGC GGAAAATGAA
CTGTTGCCTT TGAATTTTGA CAATATTCCC AATGCAAAAT ATATTGACGA ATCATTCAGA
AATTTGGGTT ATGACCCGGA AAACAAGTAT TCTGTGCCAT ATTTCTGGGG AACTCTCGGA
ATTCTTTACA ACAAAAAAAT GGTGCAGGAG GAAGTTGACT CCTGGGATAT TCTGTGGGAC
AGCAAGTACA AGGGTCAAAT TATAATGATG GATTCGGTAA GAGACACTTT TGCTGTTGCC
CTCAAAAGGT TGGGATATTC CCTGAACAGC ACTGACAAAG CGCAGGTGGA TGAAGCAGTG
GCAAGCCTGA TAGAGCAGGC TCCGTTGGTT CAGGCTTACC TGATGGACCA GGTAAAGGAC
AAGATGATAG GGGAAGAGGC GGCTTTGGCG GTAATTTATT CCGGAGAAGC TGTTTATACT
TCTGAATACA ACGAGAATTT GGAATATGCA GTTCCAAAGG AAGGTACAAA CTTCTTTGTT
GACGCCATGG TTATACCAAA GACTTGCCAA AACAAAGAAG CAGCAGAAGC TTTTATCAAT
TTTATGAATG ATCCTCAGAT TGCTTACAAC AATACCAAGT ATGTTGGATA TTCCACACCG
CATACGGAAG CCAGAGACAT GTTGGATGAA GAAATAAAAA ACAATCCTGC GGCTTACCCT
CCACAGGAAA TTATAGACAA GTGTGAAGTG TTTGTGGATC TTGGACCGGA GATGACGGTT
TACTACAATG ACAAATGGAA TGAATTGAAA GCATCGTTGC GTTAA
 
Protein sequence
MKKIALLLAV LMFVFTGCAG NVKNDPSKPL AGTTIYVYNW GDYIAEDTIE RFTKETGIKV 
IYETFDSNET MYAKYKSGAV NYDVLIPSDY MIEKLIAENE LLPLNFDNIP NAKYIDESFR
NLGYDPENKY SVPYFWGTLG ILYNKKMVQE EVDSWDILWD SKYKGQIIMM DSVRDTFAVA
LKRLGYSLNS TDKAQVDEAV ASLIEQAPLV QAYLMDQVKD KMIGEEAALA VIYSGEAVYT
SEYNENLEYA VPKEGTNFFV DAMVIPKTCQ NKEAAEAFIN FMNDPQIAYN NTKYVGYSTP
HTEARDMLDE EIKNNPAAYP PQEIIDKCEV FVDLGPEMTV YYNDKWNELK ASLR