Gene Ccel_2997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2997 
Symbol 
ID7311607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3547922 
End bp3549310 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content37% 
IMG OID643609901 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002507271 
Protein GI220930362 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000366828 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAA ATAGGATTTT ACCATTAGTT CTGAGTGTGG CAATGATAGC TATAGCCTTT 
ACTGCATGCG GATCAAAGGA GAAGCCTAAT GACACAGATT TAACCTCGTC AGCGTCATCA
ACCGCAGCAG GTACAACTTC AAAAGCAAAG GATATTAAAG GAGAAATTCT GTTCTATAAT
ACCAGAACTG ATATGGAACT GGACAGTTAT GACAAAAACT GGGAATATTA TATTGGAGAA
TTTAATAAAA ATTATCCTAA TATAGAAGTA AATATTGAGA CATCAAAGGA TTATGAAGGT
GATCTTGCAA TCCGAATGAA TTCAAACGAA TACGGTGATG TTTTATTCAT GTCTGCAAAA
ATGAAGGATT CGGATCTTCC TAGCTTTTTC ATACCGTTAG GAAAAAAAGC AGATTTGGAA
AAGAAGTATG ATTTTGTTCA GGACAGATAT GTAGGTGAGG ATATATACGG AATTCCACCT
AACGGAAACG GACAGGGTAT AGTATATAAC AAGGCTGTAT TTGCAAAGGC TGGTATTACT
TCTTTGCCGA AGTCTGAGGA TGAATTCCTT TCTGATTTGA AACTTATTAA AGAAAAAACA
GACGCTATTC CTTTGTATAC AAACTACAAG GACAGCTGGG CCCTCAATGC ATGGGAAGGA
TACATAGATA GCGTGTCAGG CAGTGATACA TACACTAATC AGGTAATGCT TCACGAGGAT
GATCCGTTTG CACCCGGGAA ACCTCACTAC ATAGTTTACA AGTATTTATT TGATGTTGTA
AGTCAGAAAT TGGTTGAGGA CGATCCGATG ACTACTGACT GGGAAAGTTC AAAGCAACTA
CTGGCAGATG CTAAAATTGC AACTATGCCT TTAGGCTCAT GGGCAATTCC ACAGATTAAA
TCAAAAGCAA AAAATCCGGA TGACGTGTCA TACATGCCAT TCCCATATAA TATTGACGGA
AAAATGTATA CACAGGCTGC AGGCGACTAC AAGCTCTGCA TAAGTAAGAA CAGTAAGAAT
ATAGAAGCAG CAAAGGCATT TTTATGGTGG TTCCTTGATG AATCAAACTA TGCTCAGAAT
GAAGGGCTTA TACCATCACT TAAAGGCTCC GCATATCCTG ATACATTAAA AAACTTCCAA
GATATGGGAG TAACACTCCT TATTGATAAG GGAGCTATCA ATGATGTAAC CAAAAACGAA
AACGGATGGC TGGATGCTAT AGATAAAGAG TCAGAAGTAG GCTTGTGGAA TGAAAACTTT
AAGAAGGATA TTGTAGATAC AGCACTAGGT AATAAAAAAG GTACATATAA TGATGTTATG
AATGGCCTTA ACAAAAAATG GGCTGATACC AGAGCAAAGC TTATAAAAGA AGGAACTATA
GGTAAGTAA
 
Protein sequence
MIKNRILPLV LSVAMIAIAF TACGSKEKPN DTDLTSSASS TAAGTTSKAK DIKGEILFYN 
TRTDMELDSY DKNWEYYIGE FNKNYPNIEV NIETSKDYEG DLAIRMNSNE YGDVLFMSAK
MKDSDLPSFF IPLGKKADLE KKYDFVQDRY VGEDIYGIPP NGNGQGIVYN KAVFAKAGIT
SLPKSEDEFL SDLKLIKEKT DAIPLYTNYK DSWALNAWEG YIDSVSGSDT YTNQVMLHED
DPFAPGKPHY IVYKYLFDVV SQKLVEDDPM TTDWESSKQL LADAKIATMP LGSWAIPQIK
SKAKNPDDVS YMPFPYNIDG KMYTQAAGDY KLCISKNSKN IEAAKAFLWW FLDESNYAQN
EGLIPSLKGS AYPDTLKNFQ DMGVTLLIDK GAINDVTKNE NGWLDAIDKE SEVGLWNENF
KKDIVDTALG NKKGTYNDVM NGLNKKWADT RAKLIKEGTI GK