Gene Ccel_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0200 
Symbol 
ID7309104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp224941 
End bp226266 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content40% 
IMG OID643607129 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002504567 
Protein GI220927658 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAG TGTCAGTGAC AAAAAGAGGA GTAGCACTTA TACTGGCTGG TGCATTAACC 
GTTGGAATGG CGGCTTGTGG CAAGAATACA ACCAATAATA ATGCTGCGGG CACAACTAAT
AATGCAGGTG GAAACAAGGC TAAAAATGTT GAAATTAAAT TCAGTCATAT TTGGGGATCA
GCAGCAGACC CCTTTACACC AGCTGCTAAG AAAGTTATTG AGGACTACCA GACTGCAAAT
CCAAATGTTA AGATCGCTGT CGATACAAAC GAAAATGAAG CATATAAGAC AAAAATCAAA
GCAATGGCAG CAGCTAATGA ACTGCCTGAT ATTTTTTCCA CATGGGGCGG CGGATTCTCA
CAGCCATTCA TTCAGTCAAA ATCAGTAGTA CAACTTGATC AATATCTGAC GGATGATATA
AAGAACAAGC TTGTTAACGG TGCATTGACC AATGTGACTT ATGACGGGAA GGTGTATGGA
TTACCCTTCT TCTTGTCAGT AGGAGCAATG TTTGTCAATA CAGAGCTTTT TGATAAAAAT
GGAGTCAAGG TTCCTACTAC ATGGGAAGAG CTTCTTACCG CAGTAAAAAC CTTTAAAGCT
AAAGGAATAA CACCTATGGC TGTATCAGGT AAGGACAAAT GGACAATAGC AATGTACTTT
GACGTAATGG CACTAAGAGC TGCAGGCCCT GAAAAGGTAA CCAAAACTCT TACAAAGCAA
GGTTCATTCA AGGACCCAGA ATTCCTCAAT GCTGCTAACA GATTTAAAGA GTTAGTTGAT
GCAGGAGCAT TCTCAAAGGG TGCTGCAGGT GTTTCAAATG ATGAAGCTGA AGTACCATTT
TTTGAAGGAA AAATTCCAAT GATGTTCAAA GGTAGCTGGA CAGCAGGAAA AGCAGGTTCA
AAGGATTCCA AGGTTGCAGG AAAGGTCAAG GCAATATCCT TCCCATCAAT ACCTGGCGGT
CTGGGAAATC CAAAGCAGTT TACAGGCGGA GCTGTTGATG CTGTAATGGT AAGCGAAAAT
TCTAAGAACA AGGAAGAAGC AATTAAATTC CAGATATATT TCGCTGAAAA TATGGCTAAA
GAATCATACT TGTCCGGTGC ATCAATGCCT GCATGGAAAA CAGATGTTGA CGAAAGCAAG
GTTAATCCTT CGCTTGTTGA CGTTGTTAAT CTGACTAAGG ACGCTGAATC ATATACAATC
TGGTGGGATA CACTTCTTGC CGGCAAAGAT ACAGAAACTT ATCTCAATGC TTTGCAGGAA
TTATTTATGG GTACAAAAAC ACCTCAACAG TTTGTTAACA GTTTGCAGAC AATTTATGGT
AAGTAA
 
Protein sequence
MNKVSVTKRG VALILAGALT VGMAACGKNT TNNNAAGTTN NAGGNKAKNV EIKFSHIWGS 
AADPFTPAAK KVIEDYQTAN PNVKIAVDTN ENEAYKTKIK AMAAANELPD IFSTWGGGFS
QPFIQSKSVV QLDQYLTDDI KNKLVNGALT NVTYDGKVYG LPFFLSVGAM FVNTELFDKN
GVKVPTTWEE LLTAVKTFKA KGITPMAVSG KDKWTIAMYF DVMALRAAGP EKVTKTLTKQ
GSFKDPEFLN AANRFKELVD AGAFSKGAAG VSNDEAEVPF FEGKIPMMFK GSWTAGKAGS
KDSKVAGKVK AISFPSIPGG LGNPKQFTGG AVDAVMVSEN SKNKEEAIKF QIYFAENMAK
ESYLSGASMP AWKTDVDESK VNPSLVDVVN LTKDAESYTI WWDTLLAGKD TETYLNALQE
LFMGTKTPQQ FVNSLQTIYG K