Gene Ccel_0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0054 
Symbol 
ID7308973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp62580 
End bp63638 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content40% 
IMG OID643606983 
Productperiplasmic sugar-binding protein 
Protein accessionYP_002504422 
Protein GI220927513 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000350797 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGGATTA AGCAACTAAA AATATTTCTA CTACTTGCAA GTCTGATTAC TATGGTTCTT 
TTAGCCGGTT GCTCTTACAA AAACCCAATC GATAAACCTG ATAAAATTAC TATTGGTCTC
TCCATGGCAA CCCTCCAAGA GGAACGTTGG CACAGAGACA TAGAAGCTCT CAGGGCCAAG
GCCCAAGCCA AAGGAGCAGA AATTCTTTTC CGAAATGCAA ACAATAATAT AAATGACCAA
ATTTCCCAGG TTAAAAGTCT GTTGTCAAAG GATATTGACA TTCTTGTAAT CGTTCCACAG
GACGCCGAAA AGTCTCAGCA GGCTGTTCAG CTTGCCAGAA ATAAGGGAAT AAGGGTAATC
TGCTATGACA GACTGATTAA AAATTCAAAC ACTGATTTCT ATGTATCTTT TGATAACATA
AGAGTTGGAG AATATATGGC ATCCCTGATG GTATCCAAAG TACCTAAAGG GAACTACATA
TTAATTAATG GAGCTAAAAC AGATTACAAT AGCTTTATGT ATAACAAGGG TTTTAAAAAT
ATTTTAGGCA AATACTTATA CGAAGGGTCC ATAAAAATCG TTGACGAAGT GTGGGCAAAT
GACTGGAAAC CGGAGGATGC CTTTAAGTGT GTGGACAAGG CTCTTCGGGA CGGAAAAAAG
ATTGATGCTA TTATTGCCGC CAATGACAGT CTTGCCGGCG CAGCAATCAA GGCTCTTTCT
CAAAGGCGAT TGGCAGGAAA AGTTCCTGTT GCCGGGCATG ACGCCGATAT TTCAGGTTGT
CAGAGAGTAG CTGAAGGTAC TCAGTTGCTG ACCGTTTACA AACCTATAGA TCAATTAGCC
GAAAAGGCAA TTGACGTTGT ACTGGGACTT TTGAATAATG ATTATTATGC CTGCAATAAA
TTTATTAATG ACGGTGAAAG TGATATTCCC TATGAGATGG TGGAACCTGT CGTAGTCACA
AAGGATACCC TTGTTGACAC TGTTATAAGT GCAGGCTTTC ATAAACTTGA AGATGTATAC
CGTAATGTGC CTGAAAGTAA ATGGCCTCGA AAAAAATAG
 
Protein sequence
MWIKQLKIFL LLASLITMVL LAGCSYKNPI DKPDKITIGL SMATLQEERW HRDIEALRAK 
AQAKGAEILF RNANNNINDQ ISQVKSLLSK DIDILVIVPQ DAEKSQQAVQ LARNKGIRVI
CYDRLIKNSN TDFYVSFDNI RVGEYMASLM VSKVPKGNYI LINGAKTDYN SFMYNKGFKN
ILGKYLYEGS IKIVDEVWAN DWKPEDAFKC VDKALRDGKK IDAIIAANDS LAGAAIKALS
QRRLAGKVPV AGHDADISGC QRVAEGTQLL TVYKPIDQLA EKAIDVVLGL LNNDYYACNK
FINDGESDIP YEMVEPVVVT KDTLVDTVIS AGFHKLEDVY RNVPESKWPR KK