Gene Ccel_2458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2458 
Symbol 
ID7311127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2971590 
End bp2973356 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content42% 
IMG OID643609388 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002506767 
Protein GI220929858 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGT GGCTTATCAA ATCCCTAAGT GCTGCCATTA TCCTCAGTTT TACCGTAAGT 
ATGGCAGCTT GTACCAAACC GGGTACCGAC AGCAGCTCGT CATCCGGTTC CGGCAGTTCT
ACATCCGGCA GTAACCCTTA TAAGGACACT GTGACTTTAG ATGTTTACAC AATGACAGGC
AATTTTCAGG GTGAACAGAT AGGATGGTTC GGAAAGGTAA TCAAGGACAA ATTTAATCTG
AAACTAAACA TTATTTCTGC TCAGACAGAC GGGAATGCCG ATCAGTCCTT TCAGACACGT
TCTGCTTCCG GAGACCTCGG TGATATCGTG GTATATGGTG CCATAGATAC AAAATTCACC
GATTCTTTAA AGGCGGGTCT TTTAATGAAG CTGTCTGATA ATGATTTGCT GGCAAAGCAC
GGAGATAACA TTGTAAAGAA TTTTTCAGGT GCTATTAAGC GTATTTCCGC TAAATACGGC
GATTATGCAA TTCCCAATAA TGTATCTAAC GAGTCTCCCA TTACTCCTTC CGAGGATTCT
GACCTTACAT TTGGTGTATA CACCCGTTAC GATTACTATC AGGAAATCGG TTCTCCTAAG
CTTAATAGCT TCGATGATGT ATATCCGATG TTAAAAGCAA TGGCAAAAAA GCATCCTACC
AACTCAAATG GTCAGAAGCA GTATGCTTTT TCACTGTTTA AAGATTGGGA CGGCTGCATG
ATGATGTTCG GTAAAATGAT TGCAGAGCTT TACGGCTATG AGGAAGCTCC CTTCGGCGGT
TTCCTGCTGA CAAACAATAT GGCTACCGAG TATCAGAGTA TTATTGATCC GGACGGTTAC
TATGTCAAGG CTCTTAAAGT TTATAATCAA GCATACCGTG ACGGTTTGCT TGACCCAGAT
TCAATCAATC AGACCTGGGA TGATATTACC AAAAAGTATG CAAACGGTCA GGTTCTGTAT
TCCCAATTCA GCTGGCTTGG CCCTAACAAT TTTAACAATG CTGACAACAT ATCAAAGGGT
GTAGGCTTTG CCCTTGTTCC TATAGCTGAT GAGAAAATCT GGTCAAATGG ATTTACTCCA
AACGGCAGCA CCTACCTTGT AGCTCTTGGT AAATCCTGTA AGAACCCCGA ACGTGCCATG
GATTTCTTAA ATTGGTACTA TTCATCTGAA GGTGAAATGA TAATAAAGAA CGGCCCTGAA
GGTCTTGCAT GGAGAATAGA AAATGGCAAA CCAACATTAA CTGATTTTGG TAGAAAATGT
ATGCCATCCA TTGCAGTTGA AGTATCTAGT GAGTTTGGTG GCAGTGATTG GCAGACAGGG
GAGTGTAAAG TCGGCTTTGA CGGCTTAGCA GCAAACTCTA TAAATCCAAA TACCAAAGAG
CCATACTATT ACAAGCTTTG GTCGTCGACC CTGTCATCCA ACACAAGTAC CCTTCAGAAA
AACTGGAGTG CAGCGATGGA CGATGCTCTG TCCACCAAAG ACTGGCTTAT GAAGAATAAC
CATGTTTCAG TTTCTCCCGG CACTGATTAT GTTGGACCTA CCCTTCCTTC CGATATTCAA
TCAACTCAGG ATGTTGTTAA AATGGATATT CGTGATTATT CCTGGAAAAT GGTTTATGCC
AAGAACGATA GTGAATTTAA TTCTCTTCTT AAGCAAATGA CTGAGAAGGT AAAAGGCGAA
GGTTATGACA CCGTTGTTGA ATGGAATAAA CAGCAGTTGG CTGAACTTTC TGCGGCCCGC
AAGCAGGCTG CAGAAAACAG CAAATAA
 
Protein sequence
MKKWLIKSLS AAIILSFTVS MAACTKPGTD SSSSSGSGSS TSGSNPYKDT VTLDVYTMTG 
NFQGEQIGWF GKVIKDKFNL KLNIISAQTD GNADQSFQTR SASGDLGDIV VYGAIDTKFT
DSLKAGLLMK LSDNDLLAKH GDNIVKNFSG AIKRISAKYG DYAIPNNVSN ESPITPSEDS
DLTFGVYTRY DYYQEIGSPK LNSFDDVYPM LKAMAKKHPT NSNGQKQYAF SLFKDWDGCM
MMFGKMIAEL YGYEEAPFGG FLLTNNMATE YQSIIDPDGY YVKALKVYNQ AYRDGLLDPD
SINQTWDDIT KKYANGQVLY SQFSWLGPNN FNNADNISKG VGFALVPIAD EKIWSNGFTP
NGSTYLVALG KSCKNPERAM DFLNWYYSSE GEMIIKNGPE GLAWRIENGK PTLTDFGRKC
MPSIAVEVSS EFGGSDWQTG ECKVGFDGLA ANSINPNTKE PYYYKLWSST LSSNTSTLQK
NWSAAMDDAL STKDWLMKNN HVSVSPGTDY VGPTLPSDIQ STQDVVKMDI RDYSWKMVYA
KNDSEFNSLL KQMTEKVKGE GYDTVVEWNK QQLAELSAAR KQAAENSK