Gene Ccel_3412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3412 
Symbol 
ID7311973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3964636 
End bp3967110 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content40% 
IMG OID643610317 
Productglycosyltransferase 36 
Protein accessionYP_002507680 
Protein GI220930771 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTG GTCATTTTAA TCCAGTTAAC AAGGAGTATG TTATTACTCG CCCGGATACT 
CCTGCCCCTT GGTGTAATTA CCTTGGGTCT GTAGACTACG GTGCAATTAT ATCCAACAAT
GCTACAGGCT ATAGTTTTGT AAAATCCGGT GCAGCCGGGA GAATAATTCG TTTCAGATTA
AATTCCATGT CCAACGATCA ACCCGGCAGA TATATTTATA TACGGGATAA TGCAGACGGG
GACTACTGGT CAGGCTCATG GCAGCCGGTT TGCAAATCAA TTGACAGCTA TAAGAGTGAG
TGCAGACATG GAACTGCCTA TACTATTATT TCTTCCTCCT ACAAGGATAT AGAAACCCGT
ACTCTTTATT ACGTTCCCCT TGATAAAAAT TATGAAGTAT GGAATATCAG AATAAAAAAC
AGTAGTAATA ATAAAAGACA CCTATCCATA TATGGTACGG CGGAATTCAC AAACCACGAC
CACTATGAAA ATGACACTGT CAATCTGCAG TATTCGCAAT TTATAAGCAA AACATATTTC
AAGGATAACC ACATACTTCA GGTAATAAAT GAAAACGGCA GCGAAGCATC ATCAGATGTT
GAAGGAACCT CCAATAAAAA GGGTGACCCC ATATACAGAT TTTTTGGTCT GGCAGGCCAG
TCCGTTTCTG CTTTCGACGG CGAAAGAGAT ATGTTTATAG GTAACTACAG AAATTATGGA
AATCCTGTTG CCGTAGAATT AGGAAAGTGT TCAAATACTG TTGCCTACAA CGGAAATTCC
TTTGGAGCTC TCCAGACAGA TATAGAATTG AATCCTGGTG AGGAAACTGA AATGACTTTC
CTTCTTGGAG CAGGAAATGA AGATTTCGCA AGAAATATTA TTTCTAAATA TGATTCGGTT
GAAAAAGCTA ATATATGTTC TTGTGAAGGT TTTTGCAACT TCAGCGAGGT TGTATCACAT
GAGCTTACGC AGCTGAAAAA CTTCTGGCAT TCACGATTGG ATAACCTTCA GGTGGAAACC
CCGGATGATA ATTTTAATAA CATGCTGAAT GTCTGGAATG CCTACCAGTG CTTTATAACA
TTTTTCTGGT CAAGGGCCGC CTCTTTCCAA TACTGCGGCC TGAGAAACGG GTTAGGGTAC
CGTGACACTG TACAGGATAT ACAGGGTATA ATCCACTTGG ATTATAAAGC AGCTAAAGAA
CGGCTTTGGC TTATGCTTTC AGGACAAGTT TTAAATGGCG GCGGTCTGCC TCTTGTGAAG
TTCGACCATA AGCCGGGACA GGAAGCTACT CCTGATGAAT CACAATATGC AAAAGAAACG
GGACAATCCT TCTACCGGGC GGATGATGCA CTGTGGCTCT TCCCGACTGT AATTACATAT
ATCAAGGAAA GCGGTGACTG GAACTTTATA GACGAAAAAG TTCCTTATGC AGACAAGGGT
GAAGCAACCG TTTATGCCCA TCTTAAACAG GCGATTCAGT TTAATCTGGA CAGACAAGGC
AGTCATGGCC TGCCTGTAGG GTTATTTGCA GACTGGAACG ATTGTTTGAG GCTTGGCTCT
AAAGGTGAAT CACTTTTTGT TACCTTCCAG CTGTATTATG CACTTAAAAT TTTTAAAGAA
TTTGCTACAA AAAAGGATGC TTTGGCTGAC ATTGAATGGG CACAAAACTG TCTTAATGAA
TTAAGTGGGA ATATTCAAAA GTTTGCATGG GAAGGGGATC AATTTGTTCG TGGGTTTACA
GAGGATGGAT ATACTATAGG ATCAAAAAGT AATTCCGAAG CAAGCCTGTG GCTTAACCCT
CAAGTCTGGT CCGTTATCAG CGGTGCTGCT GACGAAAAAA CAGCTAAAAC CGTTCTTGAC
AAGGTATATG ACAATCTCAA TACTAAATAT GGTGCAATGT TGTTTTACCC GGCTTTCAGA
GAATACGGAC TTCCTGTTGC AAGAATGTCC CTTTTTAATG CAGGAACCAA AGAAAATGCC
GGAATTTTCT CTCAGCCCCA AGGTTGGGTA ATACTAGCTG AAACAATCAT AGGAAATGGC
AACAGAGCCT ACGAATATTT TACTGAAATT AATCCTGCCG CCATGAATGA CCATGCTGAA
ATAAGAAAAC TGGAGCCGTA CATACATGGT CAGGCCACTG AAGGGATTGA TACCCTAAAC
CACGGACGTT CACATGTTCA TTGGCTGACA GGTACTGCCT CAACTGTTAT GGTTTCCATG
GTATACGGTA TTCTGGGATT ACAACCTGAA TATAACGGTA TAAAAATAAA TCCATGCATC
CCTTCAGGCT GGAAGAATTT CAAAATGAAC AAAGTATTCA GAAATACTGT TCTCAACATA
ACCGTCGATA ACAGTCAGGG CGTTGAAAAG GGAGTACATT ATATTACCGT AAATGGAAAA
CGTATTGATG GCTGTTATAT CAGTGCGGAT GAACTTAAAG ATACTAATGA AATATTAGTA
GTAATGGGTA AGTAA
 
Protein sequence
MNFGHFNPVN KEYVITRPDT PAPWCNYLGS VDYGAIISNN ATGYSFVKSG AAGRIIRFRL 
NSMSNDQPGR YIYIRDNADG DYWSGSWQPV CKSIDSYKSE CRHGTAYTII SSSYKDIETR
TLYYVPLDKN YEVWNIRIKN SSNNKRHLSI YGTAEFTNHD HYENDTVNLQ YSQFISKTYF
KDNHILQVIN ENGSEASSDV EGTSNKKGDP IYRFFGLAGQ SVSAFDGERD MFIGNYRNYG
NPVAVELGKC SNTVAYNGNS FGALQTDIEL NPGEETEMTF LLGAGNEDFA RNIISKYDSV
EKANICSCEG FCNFSEVVSH ELTQLKNFWH SRLDNLQVET PDDNFNNMLN VWNAYQCFIT
FFWSRAASFQ YCGLRNGLGY RDTVQDIQGI IHLDYKAAKE RLWLMLSGQV LNGGGLPLVK
FDHKPGQEAT PDESQYAKET GQSFYRADDA LWLFPTVITY IKESGDWNFI DEKVPYADKG
EATVYAHLKQ AIQFNLDRQG SHGLPVGLFA DWNDCLRLGS KGESLFVTFQ LYYALKIFKE
FATKKDALAD IEWAQNCLNE LSGNIQKFAW EGDQFVRGFT EDGYTIGSKS NSEASLWLNP
QVWSVISGAA DEKTAKTVLD KVYDNLNTKY GAMLFYPAFR EYGLPVARMS LFNAGTKENA
GIFSQPQGWV ILAETIIGNG NRAYEYFTEI NPAAMNDHAE IRKLEPYIHG QATEGIDTLN
HGRSHVHWLT GTASTVMVSM VYGILGLQPE YNGIKINPCI PSGWKNFKMN KVFRNTVLNI
TVDNSQGVEK GVHYITVNGK RIDGCYISAD ELKDTNEILV VMGK