Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3406 |
Symbol | |
ID | 7311969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 3955948 |
End bp | 3958806 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643610311 |
Product | Ig-like, group 2 |
Protein accession | YP_002507674 |
Protein GI | 220930765 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.68054 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT ACATAAGGAA AAGTTTTGCA CTTTTGGTTA TATTAGCCAC ACTGCTGACC TCTTTCAGTG GTGTGTCTGC AGCACAGCAG CTTATAAGTC AGACTGTTGA AAAGCAGACG ATTACTTCAG GTGTAACTTT GGAGAGCTAC GACCGCTTTA CTACAAGTGG GTGGATTAAA TCCTATGTTC TCAGAGTTGA TCTGTCTAAT AAAAATGTGA AGGTTGACAC TCTTGTAAAC AAAAAATCAG TGGTCGGTTA TTCAACTGTA TTGAACCTGG CTAAAAATAG CGGGGCTATT GCTGCAGTAA ACGGTAGCTT TTTTGATTTC GGACCTAGTG GAAGCGGAAA AGGATATACA TACGGTCCTG TAGTTTCTTC AGGTGAAATT GATCTAGCAG CTACCAGAGA CAGTAAGGAT ACTGCAACCT TCTCCCTAAA TGATGTAAAC GAAGCTCTTT TTACATACTG GAACACCAAG GTTGAGCTTG TAACTCCAAA GGGTGAAAGA AAAGTAGCTG CATCTTATAA CAGATATAAT GGTAAATTTA ACGGAATGTC CATAGTAGAC TCAAAATGGG GTGCTAAAAC TCCGGGTGCT ACATCCAACT ATCCATACTG GATAGAAATG GTAGTTGAAG ATGGTATTGT AAAAGAGTTC AACGAGAACA AACCCAGTAT GGATATGCCA AAAAACGGTT TTGTTGTTTT AGGTGCGGGA AGTCATATCC AGTATTTAAA AGACAATTTC AATGTTGGTG ATCCAGTAGA ATATAATATC ACCATGAATG TTGACACCAA TAATATGAAG ATGGCCCTTA CAGGCGGAGC AATGCTTGTA AAGGATGATA AAGTATTAAC TTCTTTCTCG CACAACCCTG TTTCGCCAAG TACGAGGGCA TCAAGGACAG CAATCGGTAC ATCAAAAGAC GGAAAAACCC TTATTGTGGC TGCTGTTGAT GGTAGGTCAA GTGCAAGTAT AGGTATGACA CAATCAGAAC TTGCATCATA CATGCATGAA CTTGGATGTG CCAATGCACT TAATCTGGAT GGAGGCGGCT CAACTACACT GGTAGCAAGA AAGCAAGGTA CAACAGGTTT AAGCGTTCAG AATCGTCCCT CAGACGGTTC ACAAAGAGGA GTAGGAGCTT CTCTGGGAAT ATTCTCGGTA GGACCTCAAG GTCCTGTAGA TTCTCTTTAT ATAACCTCTT ATGAGGATTA TGTATTTGTC AATACCTCCA GAGCCTTTAC CGCAAGGGGC ATGGATAAGT ACATGAATCC TGTTAATATT AACCCCAAAG ATATAAAATG GTCTGTGTCT GGTGTCAAAG GTACTTTCAA GGGAAATGTA CTGTATCCCA CCACAGCAGG TGAAGCCGTA GTTACGGCAA AAATAGGCGA CAATGTAGTT GGAACTTGTC CTATTACCGT ATTGGAATCA CCTTCCAAGC TCGAAATGAA CTATGACACA CTTAATGTTA ATCCCGGAAG TTCGATTACA TTTTCCGTCA AAGGATGGGA TAAGAACGGC TACACAGCAA GTATTCCCCC TGCAAATATT AAGTGGAGTA CAGGCGGAAA TGTAGGTAAT GTATCCTCAT CCAACGTATT TACAGCAAAT AAAAGCGGTG GAACAGGTTA TGTAGCCGCC ACTGTAGGTT CAGCAGTTGT ATCATGTCCT GTGTCTATCC GTAAGTCTGG GTTAACAAAA GTTATACAGG ACTTTAATTC AACCGGAATA AAGCTGATCA CCTCACCAAA TTCGGCTAAA GCATCTTATA GTATGGCGTC AAACGTTTAT AAGTCAGCAA AAAACTCCGC AAAAATCACA TATGATTTTT CTAAGGATGT GAATGTAAAC AGAACAGCAT ACTTAACTTT GCCAAATGGA GGCTATACAT TGGAATCCTC TACTTCCAAG CTTGGTATGT GGGTTTACAG TTCCGCCAAA AAACCAATAT GGATAGGGGC TACCGTACAC GATTCTAAAG GAAACTACTC TAGTGAGTAT TTTGCAAAGG GGATCACCTG GACAGGTTGG AAGTACCTCG AGGTATCACT GGATAATTTG AATACACCTA AAAAAATAAC AAACGTGTTT GCTGTCCAGC CTAAAAGCGG AAAGTCCTCG GGAACGGTCT ATTTTGATGA CCTTACAATG GTTTACACAG GCTATCCCGC TGTTGATATG ACAAAGGTTC CCAAGACAAC CGTACCTAAG GATGATAATT ACAAGGAAAG TACTGTATCT GGCACGGACT CAATGACGTT TTCAGTGTTT GGCCAATCAA CTGCATATTC ATCATCCAAC AAGACACAGA TTGACATGCT AAATACTCTT GCTAGCCGTA TAAACAGCTC TGTTGATGTC TCTGTACTGG TTGGCTCAAA TGACGGGCTT ACAAGAAGCA GTATCAAGAT TCCACAGTTG GCTACCACAG CAAGTTATAA ATCTTTAGAT ATTAATGGAA ACAGACTTAT TCAACTGAAT ACTTCCAAAG GTGGTATTCG TGCGACAAAT ACAAACGAGT GGTTATGGCT TAATAATCAA CTTAAAACCT TTGATGGTAA AAACCTATTT GTTTTTCTAA TGGAAGATCC TATAAAATTC AATGATACCA AAGAAGGGCA GCTTTTAAAG GACACTCTTT CAAATTATCA AAAGGAAACT GGAAAAAATG TATGGGTATT CTATAAGGGA AATTCAAATT CAAGTTATAT GGACAAAGGT GTAAAATATG TGGTTACCGC AGGATTTGAT TCTTATGGCT TCAGCGACAG CAACAAAAGT GCTGCAAAGT ATGCGGTTGT AAAAGTTAAG GGAACCTCCA TAACATACCA GTTTAAATCA TTTAATTAG
|
Protein sequence | MKKYIRKSFA LLVILATLLT SFSGVSAAQQ LISQTVEKQT ITSGVTLESY DRFTTSGWIK SYVLRVDLSN KNVKVDTLVN KKSVVGYSTV LNLAKNSGAI AAVNGSFFDF GPSGSGKGYT YGPVVSSGEI DLAATRDSKD TATFSLNDVN EALFTYWNTK VELVTPKGER KVAASYNRYN GKFNGMSIVD SKWGAKTPGA TSNYPYWIEM VVEDGIVKEF NENKPSMDMP KNGFVVLGAG SHIQYLKDNF NVGDPVEYNI TMNVDTNNMK MALTGGAMLV KDDKVLTSFS HNPVSPSTRA SRTAIGTSKD GKTLIVAAVD GRSSASIGMT QSELASYMHE LGCANALNLD GGGSTTLVAR KQGTTGLSVQ NRPSDGSQRG VGASLGIFSV GPQGPVDSLY ITSYEDYVFV NTSRAFTARG MDKYMNPVNI NPKDIKWSVS GVKGTFKGNV LYPTTAGEAV VTAKIGDNVV GTCPITVLES PSKLEMNYDT LNVNPGSSIT FSVKGWDKNG YTASIPPANI KWSTGGNVGN VSSSNVFTAN KSGGTGYVAA TVGSAVVSCP VSIRKSGLTK VIQDFNSTGI KLITSPNSAK ASYSMASNVY KSAKNSAKIT YDFSKDVNVN RTAYLTLPNG GYTLESSTSK LGMWVYSSAK KPIWIGATVH DSKGNYSSEY FAKGITWTGW KYLEVSLDNL NTPKKITNVF AVQPKSGKSS GTVYFDDLTM VYTGYPAVDM TKVPKTTVPK DDNYKESTVS GTDSMTFSVF GQSTAYSSSN KTQIDMLNTL ASRINSSVDV SVLVGSNDGL TRSSIKIPQL ATTASYKSLD INGNRLIQLN TSKGGIRATN TNEWLWLNNQ LKTFDGKNLF VFLMEDPIKF NDTKEGQLLK DTLSNYQKET GKNVWVFYKG NSNSSYMDKG VKYVVTAGFD SYGFSDSNKS AAKYAVVKVK GTSITYQFKS FN
|
| |