Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2221 |
Symbol | |
ID | 7310908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2593951 |
End bp | 2596944 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643609153 |
Product | protein of unknown function DUF187 |
Protein accession | YP_002506543 |
Protein GI | 220929634 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG0860] N-acetylmuramoyl-L-alanine amidase [COG1649] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.155376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA AAATCGGGGT AGTATGTATA TTGCTGGTTT TTCTGATGAT TTTGCCAATT GCTGGTTACA AGCTGTTTTC AGACAGGAAC TATGACGGAA ATGTCTCAAA TGCACAGACA GTATCAAAAA TTGAAGATTT GAGGGGAGTA TGGATTGCAT CTGTGTCAAA TATAGATTTT CCCTCAAAGC CGGGTATCAG TGCGGAAAAG CAGAAGAAAG AGCTGGATGA TATCATCAGC AATGCTCAAT ACATGGGTCT TAATGCTATA TTTTTCCAGA TAAGACCGAC GGGAGATGCT CTTTATAAAT CTACAATTTT TCCATGGTCT GCATACCTGA CAGGAAAGCA GGGAAAAGAG AATGATAATG GTTTTGACCC TTTAGCGTAC ATAATTGAAC AGGCTCATAA AAAAGGAATA CAGATACATG CGTGGATAAA TCCTCTTAGA TTATCAATGG GTACAACCAG TAATCCTACC GGAAATATTA ATGTACTTTC GGACAACCAT CCCGCTAGAA AGATTCCTGA AGCAGTTGTA GCAGCCCCTA CAGGTCAGCT GTATCTTGAT CCGGGAAATC CCGCCGCAAT TAAGCTGATA ACTGATGGTG TAGCCGAAAT TGTCAAAAAT TATGATGTTG ATGGAATACA TTTTGATGAT TATTTCTATC CTTCAAAATC GGAAGGAAAA GGTGTTGATT TTAACGACTC GGCCTCTTAT GCAAAGTACA AGGGAAGTTT TAAAAACAAG GATGACTGGA GACGCAATAA CATCAATACA CTTGTAAAAA GCACCTACAA CACTGTTAAA AATATTAAGC CTTCAGTTCA GTTTGGAATA AGTCCTTTTG CAATATGGTC AAACAAGGAT AGAAATAAGG AAGGTTCAGA TACACAAGGC GGAATATCCA CATATTATGA CCACTATGCA GATTCTAAAA AGTGGGTAAA AGAAGCATAC ATAGATTACA TAGCACCACA GATTTATTGG AATATAGGGT TCAAGGTAGC AGATTATTCT GTACTGGTAA ACTGGTGGAA GAACGTGTGT AGAGGAACAA AAGTAAAGCT TTATGTAGGA CATGCCGCAT ACAAAATAAA TGATACAACA CAGTCAAATG ACTGGCTTGA CCCGCTTCAA ATTCCAAAGC AGATTGCATA TAACAGAAAA AGCAATTCTG TAGACGGAAG CATATTTTAC GGATACTCCA AGCTGAAGAA CAATACACTT GGTATAAAGG ACAAGCTCAA AGGAATATTT GTATCAGGCA GAGATCCCGG AAGTACCGTT CCAGAGGACA GGCAGCTTTA TATAGCTTCC CCTTCTAACG GTTATAAAAC TTCATCATCC AGGATAAGTA TTATGGGGGG CGGTGATCCT GACCAGCCAA TATACCTAAA CGGTAAGAAA ATTGAGACTT CCTCCAATGG TTACTTCACC GTATATATGG ACTTGAAAGT TGGAGAGAAC AAATTTGTAT TCAAACATAA AGGTAAGGAG ACAGTGTTAA AGATAACACG CAATACAAAA ACAACCTCGA GTCCTTACAA GATGACAAAG GCTGAGTTCA GAAATGGTTA TTTTTCACCT ACCCAGAGTA TGACTATGCA GACGGGTAAA AAGATTACAT TTTCATGTCA AGCTCCTGCA GGAGCAAAGG TTTGGGTCGA AATCGGAGGC TATAAGGCAG AGTTGAAACA GACAGCCACT GTTGATGCAA ACAAAGGAAC ACTAACACCT GCAAAGTATT CAGGAACCTT TACAATGCCC TCTGTTTCAG GAAAAGAACG TACTAAAAGC CTCGGAAAGC CTGTATTTGT AATGGAATAT AATGGTAAAA GAATTACTTC AGAACAAAGT AATATAATAA GTGTACAATC ATCCAAATAC TATAAATATG CTGTTGTAAA TACCAGTGAT GCTGAGGCGG TAGCACGTTC CGGGCCATCA ACGGATTATT CAAGAATAAC ACCTTTGATT AACGGTGCGG CAGATTATAT TGTAGGTCAG CAAAATGGTT TTTATCTTCT GAAGAGCGGA GTATGGACAG CCACAAGCAA TGTAAAAGTT ATAAACGACA AAGCAATTGC AACTAACAAG GTTTCGTCAG TTACATTAAA GTCAAACGGC AGCTATACGG ATATAAGCTT TAAGATGCCT GTTAATACGG TATTTGGTGT TAAGTCAGCT TCAAATACGC TTAAATTGAC CCTGTACAAT ACATCGGGTA TGAGCGTTAA TAAATCGATA CCATCTGATG CACCATTTTC TTCAATAGGG TACAAGGCTG TTTCCGGCGG AGCTCAATAT ACATTTCAAT TAAAATCTGA GGGCAACTAT TTTGGTTACT ATGCAGAATA TAAAAACGGT TCACTTGTAT TCTCCATGAA AAATGCACCC AAGATTTCCA AGAGCGGTTC AAAGCCGCTT AAAGGCTTGA AGGTAGTACT TGATGCAGGT CATGGAGGTT CAGAATCGGG AGCTATTGGG CCTATGGGCA GATACGGACT CTATGAAAAA CAGGTAAACC TGGGAATAAC GTTAAATGCC CGGAAATATC TGCAATCACT TGGTGCAACG GTTGTTATGA CAAGGACTTC GGATAAGACT GTCAGCCTGA ATGACAGGGC AAACCTCATA CGCAAGGAAA AGCCGGATAT TGCAGTATCA ATTCATAACA ATTCAATGGA TGTAACGGCT GACTATACAA AGCATACAGG GTTACTGGTA CTGTACTCAA AAGACAGCTC TAAGGTTGTA GCCGGATATA TCAAAGATCA GTTGGTTGCA GACCTGAAGA GAAGGGATGA TGGCTACAGA TGGCAGAGTC TTTCAGTATG TACTGTCACT CAATCACCTG CAATACTTAT TGAGGGAGGA TTTATGTCAA ATCCTGCCGA GTATGAGTGG CTTGCAGATT ATGACAATCA GGTTAAGATA GGTAATTCCG TTGGTAAAGC CATTGAAAAC TGGGCATATG CCAATGCAAG ATAA
|
Protein sequence | MNKKIGVVCI LLVFLMILPI AGYKLFSDRN YDGNVSNAQT VSKIEDLRGV WIASVSNIDF PSKPGISAEK QKKELDDIIS NAQYMGLNAI FFQIRPTGDA LYKSTIFPWS AYLTGKQGKE NDNGFDPLAY IIEQAHKKGI QIHAWINPLR LSMGTTSNPT GNINVLSDNH PARKIPEAVV AAPTGQLYLD PGNPAAIKLI TDGVAEIVKN YDVDGIHFDD YFYPSKSEGK GVDFNDSASY AKYKGSFKNK DDWRRNNINT LVKSTYNTVK NIKPSVQFGI SPFAIWSNKD RNKEGSDTQG GISTYYDHYA DSKKWVKEAY IDYIAPQIYW NIGFKVADYS VLVNWWKNVC RGTKVKLYVG HAAYKINDTT QSNDWLDPLQ IPKQIAYNRK SNSVDGSIFY GYSKLKNNTL GIKDKLKGIF VSGRDPGSTV PEDRQLYIAS PSNGYKTSSS RISIMGGGDP DQPIYLNGKK IETSSNGYFT VYMDLKVGEN KFVFKHKGKE TVLKITRNTK TTSSPYKMTK AEFRNGYFSP TQSMTMQTGK KITFSCQAPA GAKVWVEIGG YKAELKQTAT VDANKGTLTP AKYSGTFTMP SVSGKERTKS LGKPVFVMEY NGKRITSEQS NIISVQSSKY YKYAVVNTSD AEAVARSGPS TDYSRITPLI NGAADYIVGQ QNGFYLLKSG VWTATSNVKV INDKAIATNK VSSVTLKSNG SYTDISFKMP VNTVFGVKSA SNTLKLTLYN TSGMSVNKSI PSDAPFSSIG YKAVSGGAQY TFQLKSEGNY FGYYAEYKNG SLVFSMKNAP KISKSGSKPL KGLKVVLDAG HGGSESGAIG PMGRYGLYEK QVNLGITLNA RKYLQSLGAT VVMTRTSDKT VSLNDRANLI RKEKPDIAVS IHNNSMDVTA DYTKHTGLLV LYSKDSSKVV AGYIKDQLVA DLKRRDDGYR WQSLSVCTVT QSPAILIEGG FMSNPAEYEW LADYDNQVKI GNSVGKAIEN WAYANAR
|
| |