Gene Ccel_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2221 
Symbol 
ID7310908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2593951 
End bp2596944 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content39% 
IMG OID643609153 
Productprotein of unknown function DUF187 
Protein accessionYP_002506543 
Protein GI220929634 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase
[COG1649] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.155376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AAATCGGGGT AGTATGTATA TTGCTGGTTT TTCTGATGAT TTTGCCAATT 
GCTGGTTACA AGCTGTTTTC AGACAGGAAC TATGACGGAA ATGTCTCAAA TGCACAGACA
GTATCAAAAA TTGAAGATTT GAGGGGAGTA TGGATTGCAT CTGTGTCAAA TATAGATTTT
CCCTCAAAGC CGGGTATCAG TGCGGAAAAG CAGAAGAAAG AGCTGGATGA TATCATCAGC
AATGCTCAAT ACATGGGTCT TAATGCTATA TTTTTCCAGA TAAGACCGAC GGGAGATGCT
CTTTATAAAT CTACAATTTT TCCATGGTCT GCATACCTGA CAGGAAAGCA GGGAAAAGAG
AATGATAATG GTTTTGACCC TTTAGCGTAC ATAATTGAAC AGGCTCATAA AAAAGGAATA
CAGATACATG CGTGGATAAA TCCTCTTAGA TTATCAATGG GTACAACCAG TAATCCTACC
GGAAATATTA ATGTACTTTC GGACAACCAT CCCGCTAGAA AGATTCCTGA AGCAGTTGTA
GCAGCCCCTA CAGGTCAGCT GTATCTTGAT CCGGGAAATC CCGCCGCAAT TAAGCTGATA
ACTGATGGTG TAGCCGAAAT TGTCAAAAAT TATGATGTTG ATGGAATACA TTTTGATGAT
TATTTCTATC CTTCAAAATC GGAAGGAAAA GGTGTTGATT TTAACGACTC GGCCTCTTAT
GCAAAGTACA AGGGAAGTTT TAAAAACAAG GATGACTGGA GACGCAATAA CATCAATACA
CTTGTAAAAA GCACCTACAA CACTGTTAAA AATATTAAGC CTTCAGTTCA GTTTGGAATA
AGTCCTTTTG CAATATGGTC AAACAAGGAT AGAAATAAGG AAGGTTCAGA TACACAAGGC
GGAATATCCA CATATTATGA CCACTATGCA GATTCTAAAA AGTGGGTAAA AGAAGCATAC
ATAGATTACA TAGCACCACA GATTTATTGG AATATAGGGT TCAAGGTAGC AGATTATTCT
GTACTGGTAA ACTGGTGGAA GAACGTGTGT AGAGGAACAA AAGTAAAGCT TTATGTAGGA
CATGCCGCAT ACAAAATAAA TGATACAACA CAGTCAAATG ACTGGCTTGA CCCGCTTCAA
ATTCCAAAGC AGATTGCATA TAACAGAAAA AGCAATTCTG TAGACGGAAG CATATTTTAC
GGATACTCCA AGCTGAAGAA CAATACACTT GGTATAAAGG ACAAGCTCAA AGGAATATTT
GTATCAGGCA GAGATCCCGG AAGTACCGTT CCAGAGGACA GGCAGCTTTA TATAGCTTCC
CCTTCTAACG GTTATAAAAC TTCATCATCC AGGATAAGTA TTATGGGGGG CGGTGATCCT
GACCAGCCAA TATACCTAAA CGGTAAGAAA ATTGAGACTT CCTCCAATGG TTACTTCACC
GTATATATGG ACTTGAAAGT TGGAGAGAAC AAATTTGTAT TCAAACATAA AGGTAAGGAG
ACAGTGTTAA AGATAACACG CAATACAAAA ACAACCTCGA GTCCTTACAA GATGACAAAG
GCTGAGTTCA GAAATGGTTA TTTTTCACCT ACCCAGAGTA TGACTATGCA GACGGGTAAA
AAGATTACAT TTTCATGTCA AGCTCCTGCA GGAGCAAAGG TTTGGGTCGA AATCGGAGGC
TATAAGGCAG AGTTGAAACA GACAGCCACT GTTGATGCAA ACAAAGGAAC ACTAACACCT
GCAAAGTATT CAGGAACCTT TACAATGCCC TCTGTTTCAG GAAAAGAACG TACTAAAAGC
CTCGGAAAGC CTGTATTTGT AATGGAATAT AATGGTAAAA GAATTACTTC AGAACAAAGT
AATATAATAA GTGTACAATC ATCCAAATAC TATAAATATG CTGTTGTAAA TACCAGTGAT
GCTGAGGCGG TAGCACGTTC CGGGCCATCA ACGGATTATT CAAGAATAAC ACCTTTGATT
AACGGTGCGG CAGATTATAT TGTAGGTCAG CAAAATGGTT TTTATCTTCT GAAGAGCGGA
GTATGGACAG CCACAAGCAA TGTAAAAGTT ATAAACGACA AAGCAATTGC AACTAACAAG
GTTTCGTCAG TTACATTAAA GTCAAACGGC AGCTATACGG ATATAAGCTT TAAGATGCCT
GTTAATACGG TATTTGGTGT TAAGTCAGCT TCAAATACGC TTAAATTGAC CCTGTACAAT
ACATCGGGTA TGAGCGTTAA TAAATCGATA CCATCTGATG CACCATTTTC TTCAATAGGG
TACAAGGCTG TTTCCGGCGG AGCTCAATAT ACATTTCAAT TAAAATCTGA GGGCAACTAT
TTTGGTTACT ATGCAGAATA TAAAAACGGT TCACTTGTAT TCTCCATGAA AAATGCACCC
AAGATTTCCA AGAGCGGTTC AAAGCCGCTT AAAGGCTTGA AGGTAGTACT TGATGCAGGT
CATGGAGGTT CAGAATCGGG AGCTATTGGG CCTATGGGCA GATACGGACT CTATGAAAAA
CAGGTAAACC TGGGAATAAC GTTAAATGCC CGGAAATATC TGCAATCACT TGGTGCAACG
GTTGTTATGA CAAGGACTTC GGATAAGACT GTCAGCCTGA ATGACAGGGC AAACCTCATA
CGCAAGGAAA AGCCGGATAT TGCAGTATCA ATTCATAACA ATTCAATGGA TGTAACGGCT
GACTATACAA AGCATACAGG GTTACTGGTA CTGTACTCAA AAGACAGCTC TAAGGTTGTA
GCCGGATATA TCAAAGATCA GTTGGTTGCA GACCTGAAGA GAAGGGATGA TGGCTACAGA
TGGCAGAGTC TTTCAGTATG TACTGTCACT CAATCACCTG CAATACTTAT TGAGGGAGGA
TTTATGTCAA ATCCTGCCGA GTATGAGTGG CTTGCAGATT ATGACAATCA GGTTAAGATA
GGTAATTCCG TTGGTAAAGC CATTGAAAAC TGGGCATATG CCAATGCAAG ATAA
 
Protein sequence
MNKKIGVVCI LLVFLMILPI AGYKLFSDRN YDGNVSNAQT VSKIEDLRGV WIASVSNIDF 
PSKPGISAEK QKKELDDIIS NAQYMGLNAI FFQIRPTGDA LYKSTIFPWS AYLTGKQGKE
NDNGFDPLAY IIEQAHKKGI QIHAWINPLR LSMGTTSNPT GNINVLSDNH PARKIPEAVV
AAPTGQLYLD PGNPAAIKLI TDGVAEIVKN YDVDGIHFDD YFYPSKSEGK GVDFNDSASY
AKYKGSFKNK DDWRRNNINT LVKSTYNTVK NIKPSVQFGI SPFAIWSNKD RNKEGSDTQG
GISTYYDHYA DSKKWVKEAY IDYIAPQIYW NIGFKVADYS VLVNWWKNVC RGTKVKLYVG
HAAYKINDTT QSNDWLDPLQ IPKQIAYNRK SNSVDGSIFY GYSKLKNNTL GIKDKLKGIF
VSGRDPGSTV PEDRQLYIAS PSNGYKTSSS RISIMGGGDP DQPIYLNGKK IETSSNGYFT
VYMDLKVGEN KFVFKHKGKE TVLKITRNTK TTSSPYKMTK AEFRNGYFSP TQSMTMQTGK
KITFSCQAPA GAKVWVEIGG YKAELKQTAT VDANKGTLTP AKYSGTFTMP SVSGKERTKS
LGKPVFVMEY NGKRITSEQS NIISVQSSKY YKYAVVNTSD AEAVARSGPS TDYSRITPLI
NGAADYIVGQ QNGFYLLKSG VWTATSNVKV INDKAIATNK VSSVTLKSNG SYTDISFKMP
VNTVFGVKSA SNTLKLTLYN TSGMSVNKSI PSDAPFSSIG YKAVSGGAQY TFQLKSEGNY
FGYYAEYKNG SLVFSMKNAP KISKSGSKPL KGLKVVLDAG HGGSESGAIG PMGRYGLYEK
QVNLGITLNA RKYLQSLGAT VVMTRTSDKT VSLNDRANLI RKEKPDIAVS IHNNSMDVTA
DYTKHTGLLV LYSKDSSKVV AGYIKDQLVA DLKRRDDGYR WQSLSVCTVT QSPAILIEGG
FMSNPAEYEW LADYDNQVKI GNSVGKAIEN WAYANAR