Gene Ccel_3406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3406 
Symbol 
ID7311969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3955948 
End bp3958806 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content40% 
IMG OID643610311 
ProductIg-like, group 2 
Protein accessionYP_002507674 
Protein GI220930765 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.68054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT ACATAAGGAA AAGTTTTGCA CTTTTGGTTA TATTAGCCAC ACTGCTGACC 
TCTTTCAGTG GTGTGTCTGC AGCACAGCAG CTTATAAGTC AGACTGTTGA AAAGCAGACG
ATTACTTCAG GTGTAACTTT GGAGAGCTAC GACCGCTTTA CTACAAGTGG GTGGATTAAA
TCCTATGTTC TCAGAGTTGA TCTGTCTAAT AAAAATGTGA AGGTTGACAC TCTTGTAAAC
AAAAAATCAG TGGTCGGTTA TTCAACTGTA TTGAACCTGG CTAAAAATAG CGGGGCTATT
GCTGCAGTAA ACGGTAGCTT TTTTGATTTC GGACCTAGTG GAAGCGGAAA AGGATATACA
TACGGTCCTG TAGTTTCTTC AGGTGAAATT GATCTAGCAG CTACCAGAGA CAGTAAGGAT
ACTGCAACCT TCTCCCTAAA TGATGTAAAC GAAGCTCTTT TTACATACTG GAACACCAAG
GTTGAGCTTG TAACTCCAAA GGGTGAAAGA AAAGTAGCTG CATCTTATAA CAGATATAAT
GGTAAATTTA ACGGAATGTC CATAGTAGAC TCAAAATGGG GTGCTAAAAC TCCGGGTGCT
ACATCCAACT ATCCATACTG GATAGAAATG GTAGTTGAAG ATGGTATTGT AAAAGAGTTC
AACGAGAACA AACCCAGTAT GGATATGCCA AAAAACGGTT TTGTTGTTTT AGGTGCGGGA
AGTCATATCC AGTATTTAAA AGACAATTTC AATGTTGGTG ATCCAGTAGA ATATAATATC
ACCATGAATG TTGACACCAA TAATATGAAG ATGGCCCTTA CAGGCGGAGC AATGCTTGTA
AAGGATGATA AAGTATTAAC TTCTTTCTCG CACAACCCTG TTTCGCCAAG TACGAGGGCA
TCAAGGACAG CAATCGGTAC ATCAAAAGAC GGAAAAACCC TTATTGTGGC TGCTGTTGAT
GGTAGGTCAA GTGCAAGTAT AGGTATGACA CAATCAGAAC TTGCATCATA CATGCATGAA
CTTGGATGTG CCAATGCACT TAATCTGGAT GGAGGCGGCT CAACTACACT GGTAGCAAGA
AAGCAAGGTA CAACAGGTTT AAGCGTTCAG AATCGTCCCT CAGACGGTTC ACAAAGAGGA
GTAGGAGCTT CTCTGGGAAT ATTCTCGGTA GGACCTCAAG GTCCTGTAGA TTCTCTTTAT
ATAACCTCTT ATGAGGATTA TGTATTTGTC AATACCTCCA GAGCCTTTAC CGCAAGGGGC
ATGGATAAGT ACATGAATCC TGTTAATATT AACCCCAAAG ATATAAAATG GTCTGTGTCT
GGTGTCAAAG GTACTTTCAA GGGAAATGTA CTGTATCCCA CCACAGCAGG TGAAGCCGTA
GTTACGGCAA AAATAGGCGA CAATGTAGTT GGAACTTGTC CTATTACCGT ATTGGAATCA
CCTTCCAAGC TCGAAATGAA CTATGACACA CTTAATGTTA ATCCCGGAAG TTCGATTACA
TTTTCCGTCA AAGGATGGGA TAAGAACGGC TACACAGCAA GTATTCCCCC TGCAAATATT
AAGTGGAGTA CAGGCGGAAA TGTAGGTAAT GTATCCTCAT CCAACGTATT TACAGCAAAT
AAAAGCGGTG GAACAGGTTA TGTAGCCGCC ACTGTAGGTT CAGCAGTTGT ATCATGTCCT
GTGTCTATCC GTAAGTCTGG GTTAACAAAA GTTATACAGG ACTTTAATTC AACCGGAATA
AAGCTGATCA CCTCACCAAA TTCGGCTAAA GCATCTTATA GTATGGCGTC AAACGTTTAT
AAGTCAGCAA AAAACTCCGC AAAAATCACA TATGATTTTT CTAAGGATGT GAATGTAAAC
AGAACAGCAT ACTTAACTTT GCCAAATGGA GGCTATACAT TGGAATCCTC TACTTCCAAG
CTTGGTATGT GGGTTTACAG TTCCGCCAAA AAACCAATAT GGATAGGGGC TACCGTACAC
GATTCTAAAG GAAACTACTC TAGTGAGTAT TTTGCAAAGG GGATCACCTG GACAGGTTGG
AAGTACCTCG AGGTATCACT GGATAATTTG AATACACCTA AAAAAATAAC AAACGTGTTT
GCTGTCCAGC CTAAAAGCGG AAAGTCCTCG GGAACGGTCT ATTTTGATGA CCTTACAATG
GTTTACACAG GCTATCCCGC TGTTGATATG ACAAAGGTTC CCAAGACAAC CGTACCTAAG
GATGATAATT ACAAGGAAAG TACTGTATCT GGCACGGACT CAATGACGTT TTCAGTGTTT
GGCCAATCAA CTGCATATTC ATCATCCAAC AAGACACAGA TTGACATGCT AAATACTCTT
GCTAGCCGTA TAAACAGCTC TGTTGATGTC TCTGTACTGG TTGGCTCAAA TGACGGGCTT
ACAAGAAGCA GTATCAAGAT TCCACAGTTG GCTACCACAG CAAGTTATAA ATCTTTAGAT
ATTAATGGAA ACAGACTTAT TCAACTGAAT ACTTCCAAAG GTGGTATTCG TGCGACAAAT
ACAAACGAGT GGTTATGGCT TAATAATCAA CTTAAAACCT TTGATGGTAA AAACCTATTT
GTTTTTCTAA TGGAAGATCC TATAAAATTC AATGATACCA AAGAAGGGCA GCTTTTAAAG
GACACTCTTT CAAATTATCA AAAGGAAACT GGAAAAAATG TATGGGTATT CTATAAGGGA
AATTCAAATT CAAGTTATAT GGACAAAGGT GTAAAATATG TGGTTACCGC AGGATTTGAT
TCTTATGGCT TCAGCGACAG CAACAAAAGT GCTGCAAAGT ATGCGGTTGT AAAAGTTAAG
GGAACCTCCA TAACATACCA GTTTAAATCA TTTAATTAG
 
Protein sequence
MKKYIRKSFA LLVILATLLT SFSGVSAAQQ LISQTVEKQT ITSGVTLESY DRFTTSGWIK 
SYVLRVDLSN KNVKVDTLVN KKSVVGYSTV LNLAKNSGAI AAVNGSFFDF GPSGSGKGYT
YGPVVSSGEI DLAATRDSKD TATFSLNDVN EALFTYWNTK VELVTPKGER KVAASYNRYN
GKFNGMSIVD SKWGAKTPGA TSNYPYWIEM VVEDGIVKEF NENKPSMDMP KNGFVVLGAG
SHIQYLKDNF NVGDPVEYNI TMNVDTNNMK MALTGGAMLV KDDKVLTSFS HNPVSPSTRA
SRTAIGTSKD GKTLIVAAVD GRSSASIGMT QSELASYMHE LGCANALNLD GGGSTTLVAR
KQGTTGLSVQ NRPSDGSQRG VGASLGIFSV GPQGPVDSLY ITSYEDYVFV NTSRAFTARG
MDKYMNPVNI NPKDIKWSVS GVKGTFKGNV LYPTTAGEAV VTAKIGDNVV GTCPITVLES
PSKLEMNYDT LNVNPGSSIT FSVKGWDKNG YTASIPPANI KWSTGGNVGN VSSSNVFTAN
KSGGTGYVAA TVGSAVVSCP VSIRKSGLTK VIQDFNSTGI KLITSPNSAK ASYSMASNVY
KSAKNSAKIT YDFSKDVNVN RTAYLTLPNG GYTLESSTSK LGMWVYSSAK KPIWIGATVH
DSKGNYSSEY FAKGITWTGW KYLEVSLDNL NTPKKITNVF AVQPKSGKSS GTVYFDDLTM
VYTGYPAVDM TKVPKTTVPK DDNYKESTVS GTDSMTFSVF GQSTAYSSSN KTQIDMLNTL
ASRINSSVDV SVLVGSNDGL TRSSIKIPQL ATTASYKSLD INGNRLIQLN TSKGGIRATN
TNEWLWLNNQ LKTFDGKNLF VFLMEDPIKF NDTKEGQLLK DTLSNYQKET GKNVWVFYKG
NSNSSYMDKG VKYVVTAGFD SYGFSDSNKS AAKYAVVKVK GTSITYQFKS FN