Gene Ccel_0150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0150 
Symbol 
ID7309060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp172536 
End bp174239 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content38% 
IMG OID643607079 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002504518 
Protein GI220927609 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTTAA GAAAGAGAAA TTGGCTAGTA GTACTATTAA CAGTTTTATT GGCAGTAAGC 
TTTACTTTAA CAGGTTGTGG CGGTTCTGAT AGCAGCTCTT CTACTTCAAC TTCCGGAACA
GCAACAGCTA CAGGTTCAGC AGCAGGTGAA AAGCTGGATC CTATAGAGAT AAGCTTCTTT
ATTTCTGATC CAGGTCAGGC TCCGACACCT GATAACAAGA TTTACAAGAA AATCAAGGAA
GAACTTGGAG TTACTTGTAA TTTTGAGTTT TTAGTAGGTG ACAAAAACCA GAAGATAGGA
GTTATGATTG CTGGTGGCGA ATATCCTGAT GTAGTAACTG TTGGATCTGA TACAGTTAGT
AAATTTACTG GAGCTGGTGC TTTAGTTCCT TTAGAGGACA TTATCGAAAA GAGTGCTCCA
AACCTCAAGA AACATTTTGA TCCATTTAAG AATAAGGTAA AGGATGTTGA AGACGGACAT
TTTTATGCAA TGCCTGGCTA CGGTGTATAT TATAATGATT TTAGTATAAA TGTTAATGAA
GGACCTGCTT TCTTTATACA AAAAGCAGTG TTAAAGGATG CTGGTTACCC GAAGGTTAAA
ACTCTTGACC AGTACTTCGA TCTTATAGAA AAATATAAAG CAAAAAATCC TACAATAGAC
GGACAACCAA CAATAGGTTT TGAAATTCTT TCAGAAGGCT GGAGAGATTT CTGCCTCAAG
AATCCACCTC AACATCTGAT CGGTCATCCA AACGATGGCG GTGTTGTAGT TAATAGTGAG
ACTAATACTG CAGAATTCTT CTGGGATAAG GATTATGCAA AGAGATACTA TAAGAAAATT
AATGAAATTA ATGCTAAGGG CTTACTTGAC CCTGAAACAT TTACTATGAA CTTTGACCAA
TATATCGCTA AGCTTTCAAG CGGTAGAGTT CTTGGTATGT TCGACCAGCA CTGGAACTTC
AATAATGCTG AATTAACATT AAAGACTCAG AAGAAGTTTG AAAGAACATA TGCTCCATTG
CCTTTAGTAT TCGACGAAGA TACTAGGGAT TACTACATGG ACAGACCTGT TCTCAATGTT
AACACAGGTT ATGCTATTAC AAAGAGTGCA AAAAATCCTG AGAGAATTGT CAAATTCTTC
GATGCATTGT TAACAGAAGA ATGGCAGACG ATTCTCGGAT GGGGAATCAA AGACGAAGAC
TACAAGGTTG GCGATGACGG CATGTTCTAC ATGACTCCTG AACAACGTGT AAACTACAAT
GACCAGACAT GGAGACTTGC AAACATGGCT CACACTCTCT GGTACTATGC TCCTAAGATG
GAAGGTACAT TTAGCGATGG AAATGCTACA GGCCCCGGCG GACAGCCAAA AGAATACTAT
GATGCACTTG ACCAGTACGA CAAAGATTTC TTTAAGGCTT ACGGATATGA TCAGCAGTCT
GATTTCTTTA GCCCAGCTCC TGAAAACAGA ATATCATACC CAGCATGGCA GATAGATCTT
GTTGACGGTT CACCTGCAAG CATGGCAAAC ACAAAAGTTG GAGATATTGC AACTAAATAT
TTACCAAAGG CAATACTTGC TAAACCAGAT AAATTCGACA GTGTATGGGA TGAATATGTA
AATCAACTCC ACAAAGAAGA TATTCAAGCT TATGTTGATA GAATAAATGA TCAACTCAAG
TGGAGAGCTG AGAACTGGAA GTAA
 
Protein sequence
MMLRKRNWLV VLLTVLLAVS FTLTGCGGSD SSSSTSTSGT ATATGSAAGE KLDPIEISFF 
ISDPGQAPTP DNKIYKKIKE ELGVTCNFEF LVGDKNQKIG VMIAGGEYPD VVTVGSDTVS
KFTGAGALVP LEDIIEKSAP NLKKHFDPFK NKVKDVEDGH FYAMPGYGVY YNDFSINVNE
GPAFFIQKAV LKDAGYPKVK TLDQYFDLIE KYKAKNPTID GQPTIGFEIL SEGWRDFCLK
NPPQHLIGHP NDGGVVVNSE TNTAEFFWDK DYAKRYYKKI NEINAKGLLD PETFTMNFDQ
YIAKLSSGRV LGMFDQHWNF NNAELTLKTQ KKFERTYAPL PLVFDEDTRD YYMDRPVLNV
NTGYAITKSA KNPERIVKFF DALLTEEWQT ILGWGIKDED YKVGDDGMFY MTPEQRVNYN
DQTWRLANMA HTLWYYAPKM EGTFSDGNAT GPGGQPKEYY DALDQYDKDF FKAYGYDQQS
DFFSPAPENR ISYPAWQIDL VDGSPASMAN TKVGDIATKY LPKAILAKPD KFDSVWDEYV
NQLHKEDIQA YVDRINDQLK WRAENWK