Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2123 |
Symbol | |
ID | 7310821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2489013 |
End bp | 2490290 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643609056 |
Product | glycosyl hydrolase 53 domain protein |
Protein accession | YP_002506447 |
Protein GI | 220929538 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.052262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGTT CGAAAAATAA AAAATTAATT GTTATTTCTC TTTTGATATT GATACTAGTG TTTGAAAACC TGATTCTGCA CATAAATAAT TACACTGTTT CTGCTGCCGC TCCACAGCTT TTATATGGTG ACGTTGACGT GAGCGGAGAA GTGAATTCTT TGGATTATGC ACTAATAAAA AGCTATTTGC TGGGAAATAT AACTGATTTC CCGGATGTCA ACGGTAAAAA GGCTGCTGAT GTAAACGGTG ACGGCAGCAT AGACTCTTTG GACATTTCAT TGATAAAAAG TTTTATTTTG GGGATTATAG AAAGGTTTCC AATAGAAACT CCTGCAAATA CGTTTGCAAA AGGTGCGGAT ATAAGCTGGT TGCCGCAGAT GGAGGCAAGC GGATATAAAT TTTATAACAA TAAAGGCCTT CAGCAGGACT GCTTACAGAT CTTAAAGGAT TATGGAGTTA ACTCAGTCAG AATAAGAACG TGGGTAAATC CGTCAACTGA TAAATGGAAT GGCCACTGCA GTACCAATGA AACAATAGCT TTAGCAAAGA GGGCTAAGAA CCTGGGGTTC AGGATTATGA TTGACTTTCA TTACAGTGAT TCTTGGGCAG ATCCCGGAAA GCAGACAAAA CCCGCAGCCT GGTTAAATCT AGATTTCAAT GGGTTAATGA AAATGACATA TGACTATACA TATGATGTAA TGACTAAACT AAGGAATAAT GGAATATTAC CAGAGTGGGT GCAGGTTGGG AATGAAACCA ACAATGGCAT GCTATGGGAG GATGGAAAAG CGTCAAACAA TATGAAAAAT TTCGCATGGC TTGTGAATTG CGGTTATGAT GCTGTAAAAG CTGTAAACCC CAAAACCAAG GTAATTGTAC ACATATCAAA TGGATTTAAC AATACATTGT TTAGGTGGAT GTTTGACGGA CTTAACTCCA ATGGGGCAAA ATACGATGTT ATAGGAATGT CATTATATCC TGACAAAGAC AATTATCCTG CTCTTTTAAA CCAATGCCTG AATAATATGA ATGATATGGT ATCAAGGTAT AACAAAGAAA TAATGATTTG TGAAATAGGA ATGCAATATA ACTATGCTTC AGAGAGTAAA GCTTTTATTA TAGATATGGT AAATAAGACT AAATCGTTAC CAAACAATAA AGGTCTGGGC GTATTCTATT GGGAACCGGA GTCATATCCA GGAATGAACG GTTACAATAA AGGCTGTTGG AATTCTGATG GAAAGCCTAC AATCGCATTG GATGGATTTT TAAATTAG
|
Protein sequence | MKGSKNKKLI VISLLILILV FENLILHINN YTVSAAAPQL LYGDVDVSGE VNSLDYALIK SYLLGNITDF PDVNGKKAAD VNGDGSIDSL DISLIKSFIL GIIERFPIET PANTFAKGAD ISWLPQMEAS GYKFYNNKGL QQDCLQILKD YGVNSVRIRT WVNPSTDKWN GHCSTNETIA LAKRAKNLGF RIMIDFHYSD SWADPGKQTK PAAWLNLDFN GLMKMTYDYT YDVMTKLRNN GILPEWVQVG NETNNGMLWE DGKASNNMKN FAWLVNCGYD AVKAVNPKTK VIVHISNGFN NTLFRWMFDG LNSNGAKYDV IGMSLYPDKD NYPALLNQCL NNMNDMVSRY NKEIMICEIG MQYNYASESK AFIIDMVNKT KSLPNNKGLG VFYWEPESYP GMNGYNKGCW NSDGKPTIAL DGFLN
|
| |