Gene Ccel_1439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1439 
Symbol 
ID7310212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1747872 
End bp1750226 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content42% 
IMG OID643608365 
Productglycosyltransferase 36 
Protein accessionYP_002505773 
Protein GI220928864 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTCG GGTATTTTGA CCGAAAGAAC AGGGAATATG TAGTAACAAG GCCTGATACA 
CCGACGCCAT GGATTAACTA CATAGGCAGT GGAAATTATG GTGGTATAGT TTCCAATACA
GGGGGAGGTT ACAGTTTTCA TAAGGACCCT CAAAATCGCA GAGTTACACG CTACAGGTAT
AATAATATAC CTATGGACAG ACCCGGAAGG TATGTATACA TAAGAAACAA GGACACAGGG
GAGTACTGGA ATCCAGGCTA TCAGCCCGTA CAGAAGAAAC TGGACGGATA CAGCTGCCGT
CACGGACTTG GCTACAGTGT TTTGACCGGG GAATATAAAG GAGTTATAGG CGAGGTTACA
TATTTTGTAC CTGATGATAA GAACTTTGAA CTATGGTTTG TCAAAGTATC TAATACTCGC
AGCATACAGC AGAATCTTCA GATTTTTGCA TACTCGGAGT TTTGCTTCTG GGATGCAATA
ATGGATCAGC AGAATGTTGA CTGGGTACAG CAGATTAATC AGGGCAGATT TGATGATGGA
ATTATTACCT ACCATCCTCA TCACGTTAGT GACAATGCCG CTTTTTTTGC AACGGGTGAA
AAGGTAAGCA GCTTTGATAC CAATCTTGAA ACCTTTATTG GAAGATATAG ATCGGAAGGC
AATCCTATTG CCGTAGAACA GGGAGCTTGC AGTAATTCCA TATCCTACAG GACAAACGGT
GTCGGTGCCT TTTGTATAGA CTGTGACCTT GGCCCCAATG AAGAACGTGA GTTGGTTTTT
GTGCTTGGGT TTGCAGAGGA AAAATCAGAG ATAAGAAAAG ACATAAAGGA ATATCTTTTG
CCCGAAAATG CTAAAGCGGC GTTTAGCAGA CTACAGGCTT CATGGCTTGA CTTTACATCC
AAGCTCAGTG TTGAAACACC TGATGAGGAT ATGAATCTGT TTGTAAACAT ATGGAATCAG
TATCAGTGCA AAACTACCCT CAACTGGTCA AGATTTGTTT CACTGTATCA GCTGGGTCTT
GGGAGAGGTA TGGGTATCAG AGACAGTGCA CAGGATACAC TTGGCGTAAT GCATACGATA
CCTGCCGAGG CAAAGGAGGT TATTATAAAG CTTCTTAAAT GCCAGTATAC AGACGGAAGA
GCATATCATC TGTTTTTTCC GCTTACAGGA GAAGGAGGAC AGGGGGATGC TCCCGTCAAG
AAATTTGACT GGTATTCCGA CGACCATTTG TGGCTGATAC TTGCTGTAAA TGCTTATATA
AAGGAAACTG GAGATTTTGA GTTTTTGAAC ATGGAAGTTC CGTATAACGA TAAAATTACC
TCACAGACCG TAATGCGGCA CCTTGATATG GCATTGGAAT TTACAAACAA TAACCGAGGC
CCTCATAATA TTGCGTTGGC GGGACGTGCT GACTGGAATG ACACACTTAA CCTTGATACA
GGTAAGGGTG TTGCAGAAAG TGTATTTACG TCTATGCTAT ATTGCAGGGC ATTAATAGAA
ATGATTGAAA TACTGGATTA CCTTAAAAAT ACAGATATGA TAAAAAAGTA TTCCGACATG
TATGAGGATA TGAAGAACGC TATAAATGAT ACCTGTTGGG ATGGAGAATG GTACAAGAGG
GCTTTTGATG ATAACAGTCA GCCTCTTGGT TCAAAGGAAA ATAAGTTCGG TAAAATATTC
ATAAATTCCC AGTCATGGGC AGTTTTAAGC AAGGTAGCGG AAAACGGAAG AGCAAATGAG
TCAATGGAAT CCGTTGAGAA GTATCTCAAT TCGAAATATG GAGTTGTAAC TATGTATCCT
GCTTACACAG AGTATGACAC CACAAAAGGA GGAGTAACTA CATTTCCACC GGGAACAAAG
GAGAATGGAG GGATCTTCCT TCACACGAAT CCTTGGGTAA TGATTTCAGA GGTAATGCTC
GGTCACGGGG ACAAGGCCTT CATGTACTAT AATCAGATTT TGCCGGGCAA AAGGAATGAT
GATGCGGAGT TGTATGAGGT AGAGCCGTAC GTATACTGCC AGAACATTCT CGGCAAGGAG
CATCCTCAGT TTGGTATAGG CAGAAATTCC TGGCTTTCGG GAACAGCTGC ATGGAATATG
GTAGCATCAA GCCAGTATAT ACTGGGAATA AGGGCAAACT ATGATTCACT GACGGTAGAT
CCGTGTATCC CTTCAAGCTG GAAGGGTTTT AAAGCTACAA GAGTATTCAG AGGTGCCACC
TATTATATAG AAGTACAAAA TCCAAACAGA GTTTGTGCAG GGGTCGAAAA AATAATTGTT
GATGGCGTTG AGACGGAAAA GATACCTGTT TTTGAGGCAG GAACAGAGCA CAATGTAGTC
GTTGTAATGA AATAA
 
Protein sequence
MRFGYFDRKN REYVVTRPDT PTPWINYIGS GNYGGIVSNT GGGYSFHKDP QNRRVTRYRY 
NNIPMDRPGR YVYIRNKDTG EYWNPGYQPV QKKLDGYSCR HGLGYSVLTG EYKGVIGEVT
YFVPDDKNFE LWFVKVSNTR SIQQNLQIFA YSEFCFWDAI MDQQNVDWVQ QINQGRFDDG
IITYHPHHVS DNAAFFATGE KVSSFDTNLE TFIGRYRSEG NPIAVEQGAC SNSISYRTNG
VGAFCIDCDL GPNEERELVF VLGFAEEKSE IRKDIKEYLL PENAKAAFSR LQASWLDFTS
KLSVETPDED MNLFVNIWNQ YQCKTTLNWS RFVSLYQLGL GRGMGIRDSA QDTLGVMHTI
PAEAKEVIIK LLKCQYTDGR AYHLFFPLTG EGGQGDAPVK KFDWYSDDHL WLILAVNAYI
KETGDFEFLN MEVPYNDKIT SQTVMRHLDM ALEFTNNNRG PHNIALAGRA DWNDTLNLDT
GKGVAESVFT SMLYCRALIE MIEILDYLKN TDMIKKYSDM YEDMKNAIND TCWDGEWYKR
AFDDNSQPLG SKENKFGKIF INSQSWAVLS KVAENGRANE SMESVEKYLN SKYGVVTMYP
AYTEYDTTKG GVTTFPPGTK ENGGIFLHTN PWVMISEVML GHGDKAFMYY NQILPGKRND
DAELYEVEPY VYCQNILGKE HPQFGIGRNS WLSGTAAWNM VASSQYILGI RANYDSLTVD
PCIPSSWKGF KATRVFRGAT YYIEVQNPNR VCAGVEKIIV DGVETEKIPV FEAGTEHNVV
VVMK