Gene Ccel_2115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2115 
Symbol 
ID7310813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2477197 
End bp2478495 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content35% 
IMG OID643609049 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002506440 
Protein GI220929531 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.845793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAGCT TAAAAAAACT TTTATGTTTA GTACTATTTA CACTAGTAAG TATAATATCA 
GTTTCTTGCA GCAGTGACAA TTCGTATGTT TCAAATGCTC GTAACAGCAA AACTACTCAG
GCTGAAACCA GTACAATAAG TTTAATGACA AGTTGGGGAG GTGTTGATAG TAAAGCCGGC
TGCTTGATGG ATTTGCTTGA TAGATTTGAA AATGGAAATC CATCTATAAA AGTTTCTAAT
CAATCTATTT TTGGAGACGA ATATCTACCT ACTCTTAAAA CCAGGTTTGC ATCAGGAAAT
GAACCCGATG TTTTCGGATT ATGGCCATGT TCGGATATAA AGTATATGAT CATGGCAAAT
AAGCTTGCTG ACCTGACGGA CATGCTTACG AAAGACAGTG AGTGGATGGA TAGCTTTAAG
GGAAATTACT TTGACTTGAC TACCTATAAC AACAGGATAT ATGGGATACC TTTTGAGCTT
GTATTTGAAG GGATGTTTAT TAATAAGGAC TTATTTCAGC AATTTAATGT AAAAATACCT
CAAAACTATG AGGAGCTTAA AAACGCTATA AATATCTTTA ATAAACATAA TATAACGCCT
ATAGCCTATA ATGCTACCGC CGAAGGTTCA TATATATATC AGAATATGAT TGCCAGTCTA
GGCGGCAATG ATGGAGTAGA AAATTATATG GTTAATAACC AGATAAACAA ATGTTATATT
GATGCTATGA AGTATATGAA GGAACTTCAC AAGATGCATG CATTTCCGAC TGATTTAATC
TCTATTACCA GCGAAGAAAG AAACAACCTT TTTATAAAAA AACAGGCAGC CATGATAGTT
CAGGGTTCAT GGTTTGCAGC ATACTTCGGA AAGTTTGATA AGACTGTTGA GATGATACCT
TTTCCGTCCA TGGGAAATGG AAACAGAAAA ATACCAGCAG GTTTAGGCGG GGGAACCTTT
TATATAAGTA AAAGTGCCTG GGGTACACCC AATAGCAAAG AAAATACCGT AAAACTTTTG
AAATTCCTTA CGTCAAAAGA GACATCAGAT TATTTATACA AGGAATCCGG CCTATTCAGC
ACACTAAATA TATCAAGGGA AACCCCTTTT AATGCATTGG CAAAACAAAG TATAGATATA
TATGAGAATA CTCCTGAACA AGACCGGTGT GCAATCCCTG ATCATGTTAT AGATAGAAGT
ACTTGGGAAA AAATTATTGT CAAGGAGTTT CCAGATTATC TTGATGATAA AATATCTGCG
GAACAAATAT GGGAAAAAGC ATTAGACAAA TTGCAGTAA
 
Protein sequence
MQSLKKLLCL VLFTLVSIIS VSCSSDNSYV SNARNSKTTQ AETSTISLMT SWGGVDSKAG 
CLMDLLDRFE NGNPSIKVSN QSIFGDEYLP TLKTRFASGN EPDVFGLWPC SDIKYMIMAN
KLADLTDMLT KDSEWMDSFK GNYFDLTTYN NRIYGIPFEL VFEGMFINKD LFQQFNVKIP
QNYEELKNAI NIFNKHNITP IAYNATAEGS YIYQNMIASL GGNDGVENYM VNNQINKCYI
DAMKYMKELH KMHAFPTDLI SITSEERNNL FIKKQAAMIV QGSWFAAYFG KFDKTVEMIP
FPSMGNGNRK IPAGLGGGTF YISKSAWGTP NSKENTVKLL KFLTSKETSD YLYKESGLFS
TLNISRETPF NALAKQSIDI YENTPEQDRC AIPDHVIDRS TWEKIIVKEF PDYLDDKISA
EQIWEKALDK LQ