Gene Cthe_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1019 
Symbol 
ID4811313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1219127 
End bp1220110 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content37% 
IMG OID640106437 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001037444 
Protein GI125973534 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.103701 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATAC AATTAAAGAA ATCTGGTATA GGAGTTAAGG AAAAAAAGTC AAAAAATCAT 
CTGTTATATT CAATAAAGCA GAATTTATTT GCGTATGCAA TGTTAATACC TACTTTTGTT
TGCATGATGT GCATTCACTT TATTCCCATG CTTCAGGGAA TATATCTGTC TTTGCTGGAT
CTTAACCAGT TGACAATGAC TAAGTTTTTG AATGCACCGT TTATAGGTCT GAAAAATTAT
TATGAAATTC TTTTTGATGA AAAGAGTTTG ATTAGAAGAG GTTTCTGGTT TGCTCTTAGA
AATACGGCCA TTTATACGGT GGTAGTTACT TTTGCAACAT TTGCCCTGGG AATTATACTG
GCTATGCTTG TAAACAGGGA ATTTAAAGGG AGAGGTATTG TAAGAACCGC GCTCCTTATG
CCTTGGGTTG TACCTTCCTA TGTTGTTGGT ATGACATGGG GCTTTTTATG GAGACAGGAT
TCAGGTTTAA TAAACATTAT TTTGTGTGAC ATACTGCATA TATTACCCGA AAAGCCGTAT
TGGCTGGTAG GGTCCAACCA GATTTGGGCA ATAATTATAC CTACAATATG GAGAGGTCTT
CCTCTTTCTA TGATTCTTAT GCTGGCCGGT TTGCAGAGTA TATCACCGGA TTATTATGAA
GCAGCTGATA TTGATGGTGC CAACGGTTGG CAGAAGTTCT GGCATATAAC TTTGCCTCTG
TTGAAACCTA TTCTTGCCAT CAATGTTATG TTCTCATTAA TTTCAAATAT TTATTCTTTC
AATATTGTTT CAATGATGTT TGGTAATGGT GCCGGTATAC CGGGTGAATG GGGAGATCTT
CTGATGACAT ACATTCAGAG AAATACATTC CAGATGTGGA GGTTTGGCCC GGGTGCGGCG
GCTTTAATGA TTGTAATGTT CTTTGTACTT GGTATTGTTG CTTTATGGTA TACACTCTTT
AAAGATGATT TGGTGGTGAA GTAA
 
Protein sequence
MDIQLKKSGI GVKEKKSKNH LLYSIKQNLF AYAMLIPTFV CMMCIHFIPM LQGIYLSLLD 
LNQLTMTKFL NAPFIGLKNY YEILFDEKSL IRRGFWFALR NTAIYTVVVT FATFALGIIL
AMLVNREFKG RGIVRTALLM PWVVPSYVVG MTWGFLWRQD SGLINIILCD ILHILPEKPY
WLVGSNQIWA IIIPTIWRGL PLSMILMLAG LQSISPDYYE AADIDGANGW QKFWHITLPL
LKPILAINVM FSLISNIYSF NIVSMMFGNG AGIPGEWGDL LMTYIQRNTF QMWRFGPGAA
ALMIVMFFVL GIVALWYTLF KDDLVVK