Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2213 |
Symbol | |
ID | 7310901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2585235 |
End bp | 2586275 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643609145 |
Product | hypothetical protein |
Protein accession | YP_002506535 |
Protein GI | 220929626 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTACG ATACTATATC TTTAAACGAA AAGCAAGTCA AGACCATACA AAAAAAGTCT GCCAAAAAGA AGAAAAAGAA AAAAGGCAGA CTCAGGAGTC TTTTGGGTTT TTTAATATTT GAGTTTTTCT TTATGTCAAT AACAACACCG CTGCTTATAT TCTACGGGCC CTTTGAGAAC GTAAAAAGAA CCGCTACAGG CATGGTCTGG AACTCAATGA CCAAGCAGGT TATAGCCAAG ACCTTCTTGT CTGATAAGGC TATTGCTAAA ATTCTGGGAG ACGGGTATGC AATCTCCAAT ATAAATACTG AAGATATAAA AATGTTGGAT TTCAGGGTAA AGCATAATAA CAATCTGGAA TATTTTGATG TTGAGAGCAG AAATTTCAAA GGCAAAATGA TTATAGTGGA TGATCCTACG CGTATTAAAG TAGGGTATTC CAGCAAAATG CCGCGGTCTG GGGAAACTAC CAGCAGTATT GCAAGGCGAA ACGGGGCAGT TGCTGCCATT AACGGAGGAG GCTTCATTGA CAAAGGGTGG GCAGGAACTG GAGGAGTAGC AATTGGTTTT GTAATAAGCA ACGGCAAATA CATTAGCGGA AAGCTGACTA ACAACTATAC AAAAAGGGAT ACTATTGCAT TTACAAAAGA TGGTATGTTA ATTGTAGGTA AACATTCCCA AGCAGAACTA GCTAAATATA ATATTAAAGA GGGAATAAGC TTCGGCCCGC CTTTAATTGT TAACGGCAAG CCTACTATCA ACAAGGGTGA CGGAGGCTGG GGCATATCCC CAAGAACTGC AATAGGTCAA AAAGAAGATG GCTCAGTAAT GCTTCTTGTT ATTGATGGAA GAAGCCTAAA GTCCTTTGGA GCAACTTTAA AAGAGGTTCA GGATATTATG CTGGAGCACG GAGCAGTCAA TGCTGCAAAC CTTGATGGAG GTTCATCGGC TACCATGTAC TATGACGGAA AAGTTGTAAA TACTCCGTCT GATGCGTTAG GAGAAAGAAC AGTAGCTACG GCATTTGTTG TAATGCCTTG A
|
Protein sequence | MNYDTISLNE KQVKTIQKKS AKKKKKKKGR LRSLLGFLIF EFFFMSITTP LLIFYGPFEN VKRTATGMVW NSMTKQVIAK TFLSDKAIAK ILGDGYAISN INTEDIKMLD FRVKHNNNLE YFDVESRNFK GKMIIVDDPT RIKVGYSSKM PRSGETTSSI ARRNGAVAAI NGGGFIDKGW AGTGGVAIGF VISNGKYISG KLTNNYTKRD TIAFTKDGML IVGKHSQAEL AKYNIKEGIS FGPPLIVNGK PTINKGDGGW GISPRTAIGQ KEDGSVMLLV IDGRSLKSFG ATLKEVQDIM LEHGAVNAAN LDGGSSATMY YDGKVVNTPS DALGERTVAT AFVVMP
|
| |