Gene Cthe_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1046 
Symbol 
ID4811344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1248913 
End bp1250253 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content39% 
IMG OID640106468 
Productextracellular solute-binding protein 
Protein accessionYP_001037471 
Protein GI125973561 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACA CCGCAGTGAA GTTGTTGCTG GTTTTTCCGG TTTTGCTCGC TTACATATTT 
TTTACCGGAT GTTCTAAAAA GCCTGCGAAG GCAGAGGAAA ATACACAAAT TCAGATAAGC
AGCACACCTC AGCAAGATTT TGACCTGGGC GGTTACACCG TTAGAATTGC TCAATGGTGG
GACGCCAGTC CAAACGATAG AAGCTCAATA GCCCGTCACA AGGCAGCGGA AGAAAAATAT
AATTGCAAAA TTGAATATAT AACCATTACA TGGGATCAAA TTGTAAGCAA ATTTACCTCA
TCAGTATTAT CAGGAGAACC TATAGCCGAT ATTGTTTTGT TTGAAATGAC CAGGGCGCTT
CCCGTGCTTG CTGAATCAGA CCTTATAATT CCGGTGGACG ACTACTTTGA CTTTAATGAT
CGGAAATGGC CCCCTATAAT TAGGCAAATA GGAAGATATA AAGGAAAACA ATACGGATTT
ACGAACTATT GCTGGACTGT AACGGGGATT TTTTACAACA AAGTGTTGTT TGACAGATTG
GGATTGCCTG ACCCGTATAT GCTTCAGGAG AACGGAGATT GGACTTGGGA AAAATTTGCT
GAGATAGCTC AAATGGCAAC CAGAGATGAA GACGGCGACG GAGAGAATGA CCTGTGGGGA
TTGGCAATAC AAGGTCATAA TCTTTATTCT CCTCTTATTT TATCAAACAA TGCCAATATC
ATAAATTTTG ATGAAAACGG CAGAGCTATC TATGCTCTTG ATGACCCAAA TGCCATTGAA
GCTCTTCAGT TTTTTGAGGA TTTGCACAAT AAATATAAAG TTGTGGCGCC TGTTGAAGAT
CCAACTGACT GGTATGAGGC TCCTCGAAAA TTTTCAGAAG GCAATATTGC CATGTTTTTT
GGACATGGCT GGGACGGACA GGAACTCAAA AATACGATGA AAGATGATTT TGGTTTTGTA
TTTTTTCCAA AAGGACCTAA AGCTTCCGAT TACATAGTTC CTGTTCAGCA GGAGTGCAAA
ATTTATGTAA TGCCCAAATA TGCAAAGCAC CCAAGAGAAG TGGCTAAAGT TTTTGAAGAA
ATATCTCCCT TCTATAATGA CAATGTAGGA TTCGAAAGCT GGATTAATAC ATTTCTGGAC
ACAGATGGAG AGAAGAATAC GGCGAGAATG ATGCTTGAAA AAGGGAAGGT ATCGTTGCAT
CAAGCGTATC CTACCTTTGA TAATCTTCTT TTCAATAAAA TAGCAAGAGA AATTATAATA
GACAACATTT CGGTTGAAGA TTTTGTAAAA AAATTCAAAG ATGAGGCCCA AAAGGCTATT
GATTCTGAAT ATGAAAGATA A
 
Protein sequence
MKNTAVKLLL VFPVLLAYIF FTGCSKKPAK AEENTQIQIS STPQQDFDLG GYTVRIAQWW 
DASPNDRSSI ARHKAAEEKY NCKIEYITIT WDQIVSKFTS SVLSGEPIAD IVLFEMTRAL
PVLAESDLII PVDDYFDFND RKWPPIIRQI GRYKGKQYGF TNYCWTVTGI FYNKVLFDRL
GLPDPYMLQE NGDWTWEKFA EIAQMATRDE DGDGENDLWG LAIQGHNLYS PLILSNNANI
INFDENGRAI YALDDPNAIE ALQFFEDLHN KYKVVAPVED PTDWYEAPRK FSEGNIAMFF
GHGWDGQELK NTMKDDFGFV FFPKGPKASD YIVPVQQECK IYVMPKYAKH PREVAKVFEE
ISPFYNDNVG FESWINTFLD TDGEKNTARM MLEKGKVSLH QAYPTFDNLL FNKIAREIII
DNISVEDFVK KFKDEAQKAI DSEYER