Gene Ccel_2111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2111 
Symbol 
ID7310809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2470953 
End bp2471873 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content36% 
IMG OID643609045 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002506436 
Protein GI220929527 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.157384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATC TCTCAAAGAA ATCAGCACCT TATCTTTTCA TGGCACCTTA CTTCCTTCTT 
TTTCTAACAT TTTCTCTAAT ACCTATAATA TTTACTTTGT ATATGAGCCT TAATGAATGG
AACGGATACG AGGGAATAAA ATTTGTAGGA TTAGCAAATT ATCAAAGAAT GTTTGAAAAT
GGGGAGTTTC TTCTAGCTTT GAAAAATACT GCGATTATCA TGTTGCTGTT GATACCGTTG
CAGTTGATAT TTGCAGTAAT TATTGCCTAT ATGATTAATT CAAAGCTTGT CAGGCACAAA
GAAATTTTTA AGACTGCGGT TTTTACTCCA TATCTTGTTA TACCAATAGC TGCAGGTTTG
TTGTGGGCAT TTTTCTTCGA TGGAGGTTCA TCAGGAACAA TCAACGCAAT CTTAATGAAA
TTGCATATTC TTAAAGAGCC GGTTGATTAT CTTGCAAGTC CCAAATTAGC TAAAGTAGTT
ATTGCAGTTA TCATGTTGTG GAGATACACC GGATATTGTG TACTGTTCTT TATAGCCGGT
TTTGTATCAG TACCGGAGGA ATTGTATGAG GCTGCACGTG TTGATGGAGC CAGAGCATGG
CACAATTTCT GGAACATTTC TCTTCCAATG ATAAAGCCTA TAATAATATA TATGGTTATC
ACATCCCTTA TAGGTGGTTT CCAGACATTT GATGAACCAA ATATTATTTA TACTCAGGGA
AATTATCAAA CGGGACCATA CTCAGGGGGA CCTGATGGTG CAGTTTTGAC TTTGGTAATG
CTTATGGGTA AGGGTGCGTT CCAGAACTCT CAATATGGTT ATGGTAGTGC TGTTGCATAT
GGAATGTTTG TAGTAATAGC AATATTCTCA TTTGCTTCAT TAAAGTTCAT GAATAGGGGG
GATAAAGATG GAGCAATCTA A
 
Protein sequence
MKNLSKKSAP YLFMAPYFLL FLTFSLIPII FTLYMSLNEW NGYEGIKFVG LANYQRMFEN 
GEFLLALKNT AIIMLLLIPL QLIFAVIIAY MINSKLVRHK EIFKTAVFTP YLVIPIAAGL
LWAFFFDGGS SGTINAILMK LHILKEPVDY LASPKLAKVV IAVIMLWRYT GYCVLFFIAG
FVSVPEELYE AARVDGARAW HNFWNISLPM IKPIIIYMVI TSLIGGFQTF DEPNIIYTQG
NYQTGPYSGG PDGAVLTLVM LMGKGAFQNS QYGYGSAVAY GMFVVIAIFS FASLKFMNRG
DKDGAI