Gene Ccel_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2354 
Symbol 
ID7311026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2770902 
End bp2773292 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content42% 
IMG OID643609280 
Productglycosyltransferase 36 
Protein accessionYP_002506668 
Protein GI220929759 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTACG GTTATTTTGA TGATTGTAAT AAGGAATATG TTATTACAAG ACCCGATACA 
CCTACTCCAT GGTCAAATTA TTTAGGATCA ATGGAATATG GGGCTTTAAT AACTAATAAC
GCAGCCGGCT ATAGCTTCGT GAAATCCGGC TCTGAGGGGC GAATTTTACG TTTTCGGTTC
AACTCAGTTA CACAGGATAT GCCAGGTAGG TTTATTTACA TAAAGGATCA GAATTCGGGA
GATTACTGGT CAGCCTCCTG GCAGCCGACA GGTAAGGAGT TGAGTGACTA TAAATCGGTT
TGTCGGCATG GTACAGCTTA TACTGTAATT TCTTCAGGCT ATGACAAAAT TTCTTCCGAG
ACGCTTTACT ACGTTCCGTT GGGTCAAAAC CACGAGGTCT GGCATTTTAA AATAAGGAAC
AATGATACAA AAAGACGGAA AATATCTGTT TTCAGCTATG CGGAATTTAC AAGTGATAAT
AGTTCCATGC TGGATATGGA GAATATCCAG TACACCCGGT TTCTAAGCCG TACGTATTTT
AAGGACAACT ACATACTACA GTCACTAAAA GAACTGACCG AGGAAAGGGT TTTTCGGTTC
TTTGCCGCAA GCGGCGGAGT GAAGGTTTCA GGATATGATG GTTCAAGAGA AAAATTTGTA
GGACCCTACG GGTCATACAG TAACCCTTTG GCACTGAAGA ACGGATATTG CAGTAATTCG
CTGAATTATA CAGGAAACTC CTGTGGCTCT CTTCAAATAG ATTTGGAATT ATTGCCCAGT
ACCGAGAAGG AAATCGTTTT TATTCTTGGG GAGGGTAATG AAGAGACGGC TGAAATGAAG
GTAAAGCATT ACAAGGGCGG GGGAGTGGTT GAAGAGGAGC TGGCACAGTT AAAGGCATAC
TGGCATGGTA AGCTAGAAGT ATTTCAGGTA AAAACGCCTG ATTCTGCCTT TAACAGCATG
ATGAATGTAT GGCACAGCTA TGAATGCTTT GTGAATACCT TCTGGTCCAG AACAGCTTCC
CTTATTTATT CAAGCCTGAG AAATGGCTTT GGTTACAGAG ATACAATGGC AGATATTCAG
AGTATTATGC ATCTTGACAG TAAGCTTGCA GGTGAAAGAT TGGTTACAAT GTTATCGGGG
CAGGTATCAA ACGGGGGAGC TCTTCCTCTT GTAAGGTTTG ATCATAAGCC CGGGGCCGAA
CCTGTTCCAG GTTCTTCAGA GTATCAAGAA AAGACAGGCT ATAAGGAATA TCGCTGTGAT
GATGCACTTT GGCTGTTTCA GGCTGTTCCC CAATACATAA GGGAAAGCGG TGAGCTTGAT
TTTCTCAACA AGATTATTCC CTACTCCGAC AAAGGCGAAG ATACTGTTTA CCTGCATCTA
AAGAAGGCTC TAAATTTTAG CCTTGAAAGA TTGGGGCGGC ATAACCTGGT ACTGGGCATT
GATACAGACT GGAACGATTG TCTGAGACTT GGAGAGAACG GAGAATCTGT TTTTGCCTCC
TTTCAGCTTT ATCTGGCAAT ATGTGAATTC AAAAAAATTG CACTGAGTAA TGGGAACTGT
GAGGATGTAG ATTGGGCGGA AACGAACAGA AAAAAACTAT ATGACAGTCT ACAGAAATAT
TGCTGGGAGG ACGGCCAGTT TATAAGAGGC TTTACAGGGG ACAATCAGGT AATTGGTTCG
CCTAAAAGCA AGGAAGCTGC TTTGTGGCTG AATCCACAAA CATGGTCAGT TATTAGTGGC
GTTGCAACAC ATGACCAGGC CAAAAAAGCA TTAGACAAAG TACACGATAT CCTCAAAACC
AAATACGGTG CAATGCTTTT CTATCCATCT ACAAAGACGA TTGGACCGCC TATATTCCTT
ATGAGCTTGT ATCCACCCGG AATAAAGGAA AATGCCAGTA TATTTTTAAT GGCGGAAGCT
TGGATTATCC AGGCAGAAGC TATGATGGGT CACGGAAACC GTGCATGGGA TTACTATAAC
AGCACTAATC CTGCAGCTCA AAATGACTCG GCTGATTTAC GCCATACAGA GCCGTATGTT
TACAGTCAGT TTATTGATGG ACTGGAAAGC CCGAACCACG GCAGATCTCA CGGGCACTGG
TTGACAGGTT CCGCATCATC TATAATGACT GCCGTAGTTG AAGAAATTTT GGGACTTAAA
GCCGACTACG ACGGCTTGAT TATTGATCCG TGCATTCCAT CGGAGTGGAA AGAATTTAGT
ATGGTCAGGC ATTTCAGAGG AAGGAAACTG AATATTATTG TACAGAATTC CGGTGGTGTA
GAAAAAGGTG TAAAAAAAAT CAGCATAAAC GATAAAACCA TATTGAACAG CTGCCTTATC
CCGCTAGATT GTATGGAGGC AGTAAATACT GTTCAGGTAA TTATGGGTTG A
 
Protein sequence
MNYGYFDDCN KEYVITRPDT PTPWSNYLGS MEYGALITNN AAGYSFVKSG SEGRILRFRF 
NSVTQDMPGR FIYIKDQNSG DYWSASWQPT GKELSDYKSV CRHGTAYTVI SSGYDKISSE
TLYYVPLGQN HEVWHFKIRN NDTKRRKISV FSYAEFTSDN SSMLDMENIQ YTRFLSRTYF
KDNYILQSLK ELTEERVFRF FAASGGVKVS GYDGSREKFV GPYGSYSNPL ALKNGYCSNS
LNYTGNSCGS LQIDLELLPS TEKEIVFILG EGNEETAEMK VKHYKGGGVV EEELAQLKAY
WHGKLEVFQV KTPDSAFNSM MNVWHSYECF VNTFWSRTAS LIYSSLRNGF GYRDTMADIQ
SIMHLDSKLA GERLVTMLSG QVSNGGALPL VRFDHKPGAE PVPGSSEYQE KTGYKEYRCD
DALWLFQAVP QYIRESGELD FLNKIIPYSD KGEDTVYLHL KKALNFSLER LGRHNLVLGI
DTDWNDCLRL GENGESVFAS FQLYLAICEF KKIALSNGNC EDVDWAETNR KKLYDSLQKY
CWEDGQFIRG FTGDNQVIGS PKSKEAALWL NPQTWSVISG VATHDQAKKA LDKVHDILKT
KYGAMLFYPS TKTIGPPIFL MSLYPPGIKE NASIFLMAEA WIIQAEAMMG HGNRAWDYYN
STNPAAQNDS ADLRHTEPYV YSQFIDGLES PNHGRSHGHW LTGSASSIMT AVVEEILGLK
ADYDGLIIDP CIPSEWKEFS MVRHFRGRKL NIIVQNSGGV EKGVKKISIN DKTILNSCLI
PLDCMEAVNT VQVIMG