Gene Ccel_1136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1136 
Symbol 
ID7309946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1396496 
End bp1397791 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content38% 
IMG OID643608058 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002505473 
Protein GI220928564 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TATTCAAAGC CCTTATTATA CTGATGGTAG TAGTTATTTC CGGGTGCTTT 
TATGACGGAG ATAACGTGCT GGTAAAATCA AACCCGCCAA AAAAGCAATT AACTTTGTAT
ACTATACAGG GTGATTCATC TGTAAACCAA GTGATTGCCG ATTCTGTATA CAGATTTGAA
AAGGACAATA AAAGCTTTAA AGTTATAAAC GAGCTTATTC CCAATGACTT ATACAAGAAC
CGACTATCAG TCTGTGTAGC AACGAATCAG ATGCCTGATG TTTTCCCTAC CTGGTCTGGA
GGGATTTTGA AACAGTATAT AAGTATTGGT GGGGTAGTTA ATCTTGATAA ATATATGAAA
AATGACAATT ACAGTTCACG GTTTAATGAC AAGGCGTTAA ATATGGTTAC AGATGAGAAT
GGAATATGGG GTGTCCCTGT GGAAAATATG TCAATTGCTC TAGTATTCTA TAACAAAGAC
ATTTTTAATG CCTTGAAGTT ATCCGAACCA AAGACTTTTG ATGAACTCAA GAATATTATA
GTTAAGTTAA AACAAAGGAA TTATATTCCG TTTGCTCTTG CAAACAGGAC GGCTTGGACC
GGGTCGATGT TTTATATGTA TTTTGTAGAT CGCGTGGGGG GGCCATCTGT TTTTGATAAT
GCTGCAAACA GGAAGAATAA CGGTTCTTTT GATGACGATG TATTTGTGCA GGCTGGTAAA
ATGGTACATG AGCTCGTAAA TATGGGTGCT TTCCCGAAGG GCTTCAACTG GATGGATGAG
GATGCCGGGG ACTCCAGAAA CCTTTTATAT AATAATTCGG CAGGTATGCT ATTGGCTGGT
AGTTGGTTTG TTAGTAATGT CATGTACGAG AAACCTGATT TCGCAGAAAA GATAGGTGTG
TTTCCATTTC CTTCAATTTC AGGGGGAAAG GGTGATCCCC GTAATACTAT TGGAACACTT
GGGGACAACT TTTACTCCGT TGCAAGTTCA TGCGGGTACC CTGACAAAGC ATTCGAACTT
ATAAAATACT TAATTGATGA TACTGCTGAA AAGAAGCGTA TTGATGCGGG AAAAATACCG
CCTGTAAAGG ATCCCGACGT AGAGAATCCT TTGATTAAAG AAATATTGGG TTATATAAAT
CAGTCTCCCA ATGTTCAATT CTGGTATGAC CAATACCTTC CTCCAAAACT GTCGGAAGCC
CATTTAATGC TTTCACGGAG TATATTTGGA GGTGAAGACC CAAAAAAAGC TGCCGAGGAG
ATGGAAAAGA TTACCAAACA ATACTATAAT CAATGA
 
Protein sequence
MKKIFKALII LMVVVISGCF YDGDNVLVKS NPPKKQLTLY TIQGDSSVNQ VIADSVYRFE 
KDNKSFKVIN ELIPNDLYKN RLSVCVATNQ MPDVFPTWSG GILKQYISIG GVVNLDKYMK
NDNYSSRFND KALNMVTDEN GIWGVPVENM SIALVFYNKD IFNALKLSEP KTFDELKNII
VKLKQRNYIP FALANRTAWT GSMFYMYFVD RVGGPSVFDN AANRKNNGSF DDDVFVQAGK
MVHELVNMGA FPKGFNWMDE DAGDSRNLLY NNSAGMLLAG SWFVSNVMYE KPDFAEKIGV
FPFPSISGGK GDPRNTIGTL GDNFYSVASS CGYPDKAFEL IKYLIDDTAE KKRIDAGKIP
PVKDPDVENP LIKEILGYIN QSPNVQFWYD QYLPPKLSEA HLMLSRSIFG GEDPKKAAEE
MEKITKQYYN Q