Gene Ccel_0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0145 
Symbol 
ID7312064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp164518 
End bp166110 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content38% 
IMG OID643607074 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002504513 
Protein GI220927604 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC TAAGTACCAG AATTGTAAGC CTGGCACTGT TAACATCAAT GAGCGTCACT 
CTTTTTGCTG GGTGCGGTTC AAACGGCGGA ACAGACGCAG ATAGTTCAAA GAGTGCTTCA
TCTTCAGGTG CAGCAAAATC GGGCCCAGCG GTAGAATTAA CCGTAGAAGT TTTTGACAGA
GCTACACCAG GATATAAGGC TGATGATAAT TTCCAGATAA AATGGATTCA AGAGAACTTT
GGTAAGCCAA ACAATATTAA TGTTAAGTTT GTACCTGTTT TAAGACAACA GGAAGTAGAA
AAGCTGAACG TTCTTATGGC ATCAAACCAA GCACCGGATA TCAGCTTTAC TTATAATGAC
GGTATTATTT ACAACTATGT TAAGAGCGGA GGACTTACTG ATTTAGGAGA CCTTTTAACA
AAGAATGCTT CCAATCTTAC AAAATATCTT GGGCAAACTT TACTTGATTA TGGCAAATTT
GACGGAAAGC AGCTTGCAGT TCCTGCAAAG AGAGTTATAG AAGGCTGCTT CTCGGCATAT
ATCCGTAAAG ACTGGCTGGA TGCAGTCGGA ATGTCAGTTC CTACTACAAC TGATGAATGG
TATCAGGTAA TGAAGGCATT TAAGGAAAAG GATCCCGGAA AACTTGGAGA CAAGAACTAT
CCTTTTAGTA CATTTGTTGA TCCAAACAAT ATAAACTGGA CTACATCAAT GTTGTTAGAG
TCCTTCAAAC AGCCTATTTC AGAAGAACAA AGAATGACAT TGCCAAACTG GGTAATTCCT
GGATTTAAAG ATGGTATGAA ATTCTTGAAT AAGCTATACA ACGAGGGTAT ATTGAATCCT
CAGTTCGCCT TGGATAAAGA CGGAAAGCAG TATGAAAAAG ATGTATCACA GGGTAGAATT
GGTTTCATGA TACATAATTA TGACTTCCCC ATCAGAGTTA CTCCCGGATT ATTATCTGAG
TTGAAAAAGC AGGTTCCTGG AGCAGATATG GTTCCATGTG ATCCGTTCAC AAACTCGGAT
GGTAAACATC CAAAAATGAA ATATAATCCT AATGGATTGT ATATAATAGT TCCAAAAGTA
AGCAAACATG CGGAAGAAGC CGTTAAGTAC CTTGAATGGC AATCAAAACC AGAAGTAATT
AAGTTCCTGC AAAATGGTAT TAAAGGTGAC CAGTATACTG ATGAAGTTGA CGGTATCCCG
GCTAACTTTA TACAGAATGA TCAGCTTTCT GATGACAAGA AAGCTAACTT CACTGATTTG
GCATTGATTG TTAACGGTAA AGAATTCGGA GATCCTGCAA AGAACATTCA GGCGGCATCT
TTCGGATACC CCGGATTTGA AGATACATTT AAGAAGGCAT ATGACATTTC TCTTACAGAT
GCAAACTATA TTCCACACTT TGATACTGTT ATAGAAGCAC AGGCAAAATA TCAGAAGGCT
CTTTCCGACA AAGAAGCAGA AATATTTGTT AAGAGTATCA CTTGCAAACC TGCTGATTTC
GATAAGACTT ATGACAAACT TGTTGCTGAG TACATGAAGT CAGGCGGACA GGAAATTGTT
GATGAGAAAT TAGCTGCATT GAAGAAGAAA TAA
 
Protein sequence
MKKLSTRIVS LALLTSMSVT LFAGCGSNGG TDADSSKSAS SSGAAKSGPA VELTVEVFDR 
ATPGYKADDN FQIKWIQENF GKPNNINVKF VPVLRQQEVE KLNVLMASNQ APDISFTYND
GIIYNYVKSG GLTDLGDLLT KNASNLTKYL GQTLLDYGKF DGKQLAVPAK RVIEGCFSAY
IRKDWLDAVG MSVPTTTDEW YQVMKAFKEK DPGKLGDKNY PFSTFVDPNN INWTTSMLLE
SFKQPISEEQ RMTLPNWVIP GFKDGMKFLN KLYNEGILNP QFALDKDGKQ YEKDVSQGRI
GFMIHNYDFP IRVTPGLLSE LKKQVPGADM VPCDPFTNSD GKHPKMKYNP NGLYIIVPKV
SKHAEEAVKY LEWQSKPEVI KFLQNGIKGD QYTDEVDGIP ANFIQNDQLS DDKKANFTDL
ALIVNGKEFG DPAKNIQAAS FGYPGFEDTF KKAYDISLTD ANYIPHFDTV IEAQAKYQKA
LSDKEAEIFV KSITCKPADF DKTYDKLVAE YMKSGGQEIV DEKLAALKKK