Gene Ccel_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1016 
Symbol 
ID7309841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1264277 
End bp1266034 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content40% 
IMG OID643607943 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002505358 
Protein GI220928449 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGT TTATATCCAA AGCTATTATT TGTGCCACGG TTACAGCCCT GTTATTAACT 
GGCTGCGGAT CGGGGACTAA TACTGAAAGC ACTGCAAGTT CTTCTGCCGG AAACTCAAGT
GTTCAGGCTC AAAAGTCTGA TATTTCATTT CCATTAAAGG AGAAAGCTAC ATTAACAGCA
TTCGTAATGA CTCCTTACTC TGGTGAAAAC GGTGACTATA CCAACAACTA CGTTACCAAC
TACCTAGAGG AAAAGCAGAA TATTAAAATT GATTTCAAGT ACTCCGTAAC CGGCGATGAC
GGTAAAACCA AGCTGAACTT GCTTATGGCC AGCGGAGAAA AATTGCCTGA CATATTTTTA
TCAACGAAAT GGTCCAAAGC CGAAACTATG CTTTATGGTA AGCAAGGACT GATTATACCG
TTGAATGATT ACTTAAAAGA TGCACCTAAC TGGAATGAAT TAAATAGGGT TAGTCCGTTA
AGATTGGGAG ATATCACAAT GCCTGACGGA AACATATATT GCTATGGAGA CGATAATGAG
TGCTTCCATT GTATGTTCCA GTCAAGAATG TGGATTTACA AACCTTGGGT TGATAAACTA
ATGGGCGGAA AAATGCCTTC CACTACAGAT GAACTGTATA CGTTTTTGAA GGCTGTCAAA
GAAAAAGATC CTAACGGCAA CGGAAAGGCT GACGAGATTC CCTTCACCGG TAATATTGCT
GCCGGAGGTT GGGCAACTGA TCCGACAACC TTTATAACAA ATGCGTTTAT TCAGAATAAT
AACATATTGT CAAATACAAA CCCTGTAGTA GGGGCAGGAT TTGTTGTAAA TAACGGTAAA
GTTGAATATC AGTTTACAAA AGACGAATAC AAGGATGCCT TAGTATACTT AAACAAGCTT
TATAAGGAAG GCTTACTGGA TTCACAGACC TTTACACAGA ATGCGGATCA GCAGAAAGCT
ACTGTACAAG GAACTCCTCA ACTGGCTGCC ATGGCACCGG GAGGTTGGTG GCCGTGTAAC
ACAGATGAAC TTTTGAAGGA GCAGGAAGGT TCATATCAAG ATTGGGTGGT TTTAGAGCCT
ATAAAAGGAC CAAACGGAGT ACAGCTATCT GCCTACTATC CAACAAACTA TTTCCAGAGC
AACTATGGTC TAGTATCTGC TGACTGTAAA AATCCTGAAC TAGCCGTTAA ATTCTTTGAT
TTGCTTGCAT CACAAGAAAT GACTCTTATT ACACAAAATG GACCACAGGG TATAGCATGG
GATTATGTAA CAGAAGGTAC TTCAATTGCA GGCGGAGAAG CTAAATGGAA GAAAATACCT
GCCAAGAAGT TAAGAAGCAG CCAGATTCCG GATTATTCCG ATGAAGGCTT GGATTTTGTA
AAATATGTTT GGGATCCAGA TGCAGTTATG ACTCATAATA CAAATGAGTT CAGACTTAGC
CAGTACTGTG CTAATCCTGA GACCAGCGTT GAGGCATTGT TGTATCAATG CGGTAAAGCA
TATTCAAAAT ACAAACCTGA CGACGCTACA ATGCTTCCAA ACCTCGCCTA CTCGGAAGAG
GATGCTAAGA AAATTGCCGA CTACACAGTT TCAATAGGTA AATTTGTAAA TCAGGCTACT
GTTCAGTTTA TTACAGGTGA CCTGGATATT AATACATGGC AAAATTATGT AGACAAGATT
AATAGTATGG ATTTGAAGGG ATACCTAGCT ATTCAACAAA ATGCATACGA CCAATATGCA
AAAAGTTTAA ACAAATAG
 
Protein sequence
MKKFISKAII CATVTALLLT GCGSGTNTES TASSSAGNSS VQAQKSDISF PLKEKATLTA 
FVMTPYSGEN GDYTNNYVTN YLEEKQNIKI DFKYSVTGDD GKTKLNLLMA SGEKLPDIFL
STKWSKAETM LYGKQGLIIP LNDYLKDAPN WNELNRVSPL RLGDITMPDG NIYCYGDDNE
CFHCMFQSRM WIYKPWVDKL MGGKMPSTTD ELYTFLKAVK EKDPNGNGKA DEIPFTGNIA
AGGWATDPTT FITNAFIQNN NILSNTNPVV GAGFVVNNGK VEYQFTKDEY KDALVYLNKL
YKEGLLDSQT FTQNADQQKA TVQGTPQLAA MAPGGWWPCN TDELLKEQEG SYQDWVVLEP
IKGPNGVQLS AYYPTNYFQS NYGLVSADCK NPELAVKFFD LLASQEMTLI TQNGPQGIAW
DYVTEGTSIA GGEAKWKKIP AKKLRSSQIP DYSDEGLDFV KYVWDPDAVM THNTNEFRLS
QYCANPETSV EALLYQCGKA YSKYKPDDAT MLPNLAYSEE DAKKIADYTV SIGKFVNQAT
VQFITGDLDI NTWQNYVDKI NSMDLKGYLA IQQNAYDQYA KSLNK