Gene Ccel_1133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1133 
Symbol 
ID7309943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1393138 
End bp1394493 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content41% 
IMG OID643608055 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002505470 
Protein GI220928561 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TAATTGGAAA AACCCTTTCT GCCGCTTTAG CACTAGCTAT GACAGTATCA 
ATTGCGGCTT GTGGTTCAGG GAACCAATCA GAATCATCAT CCTCTTCTGC AGCAGGTAGT
TCTTCATCAG CTGTTGCGAC AAATACTGCG GCATCAGGCG ATCCTGTAAA ATTTACTTTG
TGGCATGTTC AGACAACCGA TCCTATGCCG ACTAATATCC AGTCAGACAT TGATCGTTTC
ACTAAAGACA ATCCAAAGTA TTCAGTTGAT GTTCAGGTTA TGCAGAACGA TGCATACAAA
ACAAAGCTAA AAATTGCGTT GAGTTCAAAT ACTGCACCGG ATATATTCTT TAGTTGGAGC
GGCGGCCCAA TGAACGAATA TGTTGACGCA GACAAGATTG TAGATTTAAC ACCTTATATG
AATAAAGATG ATTATAAGGG ACGCTTTATG GATGCATCTA TCAATCAGGC TACATACAAA
GATAAAATCT GGGGTGTTCC AGTAGAAAAC ACAGCTGTTG CAATGTTCTT CTACAACAAG
GACTTATTTG CTAAATACAA TCTGCAGGTT CCTAAAACAA TAATAGAACT TGAGGCTGTA
AGTGATACAT TAAAGAAAAA CGGAATTATT CCTTTCTCAC TTGCAAACAA GACTCAATGG
ACAGGTTCAA TGTACTATAT GTACCTTGTT GACCGTATTG GCGGAGCAGA TGCCTTCAAC
AATGCAGCCG GACGTACCGG ATCATTTGAA GACGATGCAT TTACACAGGC AGGAAATATT
ATACAGGATT GGGTTAAGAA GGATTACTTC AACAAGGGAT TCAATGGTCT TGATGAAGAT
TCCGGTCAAT CCCGTACACT TCTGTACACT GAAAAAGCAG CTATGACTCT TATGGGTTCA
TGGTTCCTTT CAACAGCAGC GGGTGAAAAT AAAGACTTCA TGAAAAAAGT TGGTTCATTC
CCATTCCCTG CTTATGAGGG TGGTAAAGGT GATGCTAACT CAGTTGTTGG TACTGTAGGG
GATAACTTCT ATCACATAGC AAAGACATGT AAAGACCCAG AAGGTGCATT CAAGGCTATT
CAGTATATGA TAGACGAAAC AGCTGTTCAA AAACGTATTG AAGCAGGAAG AGTTCCTCCT
GTAAAGGGTG TAAAGGTTAG CGATCCTCTT CTCCAGAACG TTTTAGATGC AGTTGAAAAG
GCTCCTTCCG TTCAGTTGTG GTATGACCAA TATCTGTCAC CTGAATTGTC TGACCTCCAC
AAGAGTACGT CACAAGCTAT CTTCGGATTG TCAAAGACAC CTGATCAGGT TAACAAGGAA
ATGGAAGCAA AGGCTAAAGA GTTAGCAGGT AAATAA
 
Protein sequence
MKKVIGKTLS AALALAMTVS IAACGSGNQS ESSSSSAAGS SSSAVATNTA ASGDPVKFTL 
WHVQTTDPMP TNIQSDIDRF TKDNPKYSVD VQVMQNDAYK TKLKIALSSN TAPDIFFSWS
GGPMNEYVDA DKIVDLTPYM NKDDYKGRFM DASINQATYK DKIWGVPVEN TAVAMFFYNK
DLFAKYNLQV PKTIIELEAV SDTLKKNGII PFSLANKTQW TGSMYYMYLV DRIGGADAFN
NAAGRTGSFE DDAFTQAGNI IQDWVKKDYF NKGFNGLDED SGQSRTLLYT EKAAMTLMGS
WFLSTAAGEN KDFMKKVGSF PFPAYEGGKG DANSVVGTVG DNFYHIAKTC KDPEGAFKAI
QYMIDETAVQ KRIEAGRVPP VKGVKVSDPL LQNVLDAVEK APSVQLWYDQ YLSPELSDLH
KSTSQAIFGL SKTPDQVNKE MEAKAKELAG K