Gene Ccel_2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2109 
Symbol 
ID7310807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2466969 
End bp2469404 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content38% 
IMG OID643609043 
Productglycosyltransferase 36 
Protein accessionYP_002506434 
Protein GI220929525 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAATACG GTTTCTTTGA TGATACAAAT AGAGAATATG TCATTACAAC ACCTAAAACA 
CCTTATCCTT GGATTAATTA TCTGGGAACT CAGGAGTTCT TCTCTTTGAT ATCAAATACT
GCAGGCGGTT ATTGTTTTTA TAAGGACGCA CGTTTACGTA GAATAACCCG TTACAGATAT
AATAATGTTC CGATAGACAT GGGAGGAAGA TATTTTTATA TAAACGACAA TGGCGTTCTC
TGGTCACCGG GATGGTCTCC AGTTAAGGCA GATTTGGATA AATATGAGTG CAGGCATGGT
TTGGGGTATA CAAAAATAAC CGGAAGTAAG AACGGTATCA GTACAGAGGT TTTATATTTT
GTACCATTGA ACTTTAATGG TGAAATTCAT CGTGTAAGAG TTAAGAATAC AACATCTGAT
AATAAAAGTG TTAAGTTATT CTCATGTATT GAATTCTGCT TATGGAATGC CTATGACGAC
ATGACAAATT ATCAGAGAAA CCTAAGTACA GGTGAGGTTG AAGTTGAGAA CTCGGTAATA
TACCATAAAA CAGAATATAA GGAAAGAAGA AACCATTTTT CCTTTTACTC TGTCAATGCT
GACTTAACAG GCTTTGATAC GGATAGAGAT GAGTTTATTG GCTTATACAA TGGTTTTGAC
GCTCCACAGG TCCCTGTCAG CGGTGAACCA AAAAACACCA TTGCCCACGG ATGGAGCCCT
ATGGCATCAC ATTGCATAAC CGTTGATTTG AAAGCTGGTG AAGAAAAGGA ACTTGATTTC
ATTCTGGGTT ATGTTGAAAA CGATGTAAAT GAAAAATGGG AAAGTAAAGG CGTTATAAAT
AAAAAGAAAG CATATAAGAT GATAGAAGAA AAAGGTAATC CTGCCGGAGT TCAGACAGCT
TTTGATGAAC TGCAGAATTA TTGGAGCCAG TTGTTTACAC AGTATAAACT TGAGCACAAA
GATGAGAAAC TGTCCAGAAT GGTGAACATA TGGAATCAGT ATCAATGTAT GGTTACATTT
AATATGTCAA GAAGTGCTTC ATATTTTGAA ACAGGTATCG GAAGGGGTAT GGGTTTTAGA
GACTCAAATC AGGATATATT GGGATTTGTA CACCAGATTC CTGACAGAGC TAGAGAAAGG
ATCCTAGACA TAGCTGCAAC CCAGTTGGAA AACGGTGGTG CATATCATCA ATACCAACCG
CTTACAAAGA AGGGTAACAA CGAAATTGGG GGAAATTTTA ATGACGACCC GGTGTGGCTT
ATTGCTTCTG TAGCTGCATA TATTAAAGAG ACAGGAGATA TGGACATACT AAAGGAAAAT
GTTCCTTTTG ACAACGATGA CACAAAAGCT GCTTCTCTTT TTGAACACTT AAGAAGGTCT
TTCTATCATG TTGTGAACAA TCTTGGACCT CATGGACTGC CTCTTATAGG AAGGGCAGAC
TGGAATGACT GTCTCAATCT AAACTGTTTC TCCGAGACTC CAGACGAATC ATTCCAGACA
ACCACAAGTA AAGATGGGAA AGTGGCAGAG TCTGTTTTGA TAGCCGGAAT GTTTGTTTAC
TACGGTCCGG AGTACGTAAA ACTTTGTGAG CTTAATGGAC TTGAAGATGA AGCTGCTAAG
GCTCAAGTTG AAATTGATAA AATGATAAAA ACTGTTAAGG AGTATGGTTG GGACGGCGAA
TGGTTTATCC GTGCTTACGA TGATAACAGT GAGAAGATTG GAAGTAATGA AAATGAAGAA
GGTAAGATTT TCATTGAATC ACAGGGCTTC TGTTCTATGG CGGAGATAGG TCTGGAAGAC
GGTTATGTCG AGAAAGCCTT GGATTCTGCA AGAAAGTATT TGGATACTCC GTACGGACTT
GTACTTCAAA ATCCGGCATT TACAAAATAC TATGTCAACA TGGGTGAGAT ATCTACATAT
CCTGCAGGAT ATAAAGAAAA TGCAGGCATA TTCTGCCACA ATAATCCTTG GATTATTGCA
GGTGAAACAG TTTTAGGCAG AGGCGACAGA GCTTTTGAAT ATTATTCCAA AATTGCTCCT
GCATATACTG AGGAGATAAG TGATATTCAC AAAACAGAAC CTTATGTTTA TTCTCAGATG
ATTGCAGGTA AGGATGCTAA GAGACCGGGA GAAGCAAAGA ATTCTTGGCT TACTGGTACA
GCAGCATGGA ACTTTGTAGT AATCTCACAA AATATACTTG GTATAAAGCC TGATTATTCA
GGACTAAAGA TAGATCCGTG TATACCAGCA AGTTGGGATG GTTACAAGAT AACCAGAAAA
TTCAGAAATG CAGTTTTTGA AATTGTTATT AGCAATCCCG AACACGTTTC AAAAGGAGTT
AAGAAAGTAG TTGTAGATGG GAAGGAAATA GGTGGTAATA TAATTCCTAT ATTTAATGAT
GAAAAGACAC ATCAGGTCGA AGTAATAATG GGATAA
 
Protein sequence
MKYGFFDDTN REYVITTPKT PYPWINYLGT QEFFSLISNT AGGYCFYKDA RLRRITRYRY 
NNVPIDMGGR YFYINDNGVL WSPGWSPVKA DLDKYECRHG LGYTKITGSK NGISTEVLYF
VPLNFNGEIH RVRVKNTTSD NKSVKLFSCI EFCLWNAYDD MTNYQRNLST GEVEVENSVI
YHKTEYKERR NHFSFYSVNA DLTGFDTDRD EFIGLYNGFD APQVPVSGEP KNTIAHGWSP
MASHCITVDL KAGEEKELDF ILGYVENDVN EKWESKGVIN KKKAYKMIEE KGNPAGVQTA
FDELQNYWSQ LFTQYKLEHK DEKLSRMVNI WNQYQCMVTF NMSRSASYFE TGIGRGMGFR
DSNQDILGFV HQIPDRARER ILDIAATQLE NGGAYHQYQP LTKKGNNEIG GNFNDDPVWL
IASVAAYIKE TGDMDILKEN VPFDNDDTKA ASLFEHLRRS FYHVVNNLGP HGLPLIGRAD
WNDCLNLNCF SETPDESFQT TTSKDGKVAE SVLIAGMFVY YGPEYVKLCE LNGLEDEAAK
AQVEIDKMIK TVKEYGWDGE WFIRAYDDNS EKIGSNENEE GKIFIESQGF CSMAEIGLED
GYVEKALDSA RKYLDTPYGL VLQNPAFTKY YVNMGEISTY PAGYKENAGI FCHNNPWIIA
GETVLGRGDR AFEYYSKIAP AYTEEISDIH KTEPYVYSQM IAGKDAKRPG EAKNSWLTGT
AAWNFVVISQ NILGIKPDYS GLKIDPCIPA SWDGYKITRK FRNAVFEIVI SNPEHVSKGV
KKVVVDGKEI GGNIIPIFND EKTHQVEVIM G