Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1036 |
Symbol | |
ID | 7309858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1288460 |
End bp | 1289965 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643607963 |
Product | alpha-L-arabinofuranosidase domain protein |
Protein accession | YP_002505378 |
Protein GI | 220928469 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0176098 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGG CGAAAATCGT ATTAGACAAA GATTTTATAA TTTCAAAAAT AGACGAAAAA GTATTTGGTT CATTTGTTGA ACCTTTAGGT AGATGCATAT ATGGCGGTAT TTATGAGCCT GGACATCCTG CTGCCGATGA AAAAGGCTTT AGAAGAGATG TTCTGGAATT AACCAAGCCA TTGAATGTTA CATTAAATCG TTTTCCGGGT GGGAATTACG TATCAACCTT TCGTTGGGAA GACGGAATCG GCCCTAAAGA AAAGAGGCCA CGTCGTGCTG AGGTTGCTTG GCAAAGTATA GAAACAAATC AATTCGGGAT TAATGAATTC GCTGATTGGT CAAAATTAAA CGGATCTGAT GTAATGATGA CAGTCAATCT TGCAACAAGA GGCGTTTTAG AAGCAATGGA TTGCGTGGAG TATTGTAACT TTAAGGAAGG AACTTATTGG TCTGATCTGC GTATTTCTCA TGGTTACAAA GAGCCTCATG GATATCGTTA CTGGTGCTTG ACCAATGAAA TCGATGGTGT TTGGCAGGTT GGCCAGAAAA CCGGAACAGA TTACGGTAGA ATAGCAAGAG AAGCGTCAAA GGGAATGAAA CTTCTTGATG AGAATATTAA AACGGTATTA GCCGGTTCTT CTTCACCGTC GCAGGATAGT TTCCCAAGCT TTGATGCAGC CGCTCTTGAA GAATCTTACG AGTTTATAGA TTACTTATCA ATACATCAGT ATATAGGAAA TGCTAAGAAT GATACACCAA ACTACCTTGC AAAGCCTTTG ATTACTGACA AATATCTTAA GACTGCAATC GCTACCATTG ACTATATTAA GGCTAAGACC AAGAGCAAAA ATAAAGTAAA TATTTCATTT GATGAATTTA ACACATGGCA TTCAATTGCT GAGGAAGCAC GTTTTAATAA TAAGTGGCGG ATTGCTCCTC CTCTATTAGA AGATGAATAT ACATTAGAAG ATGCATTAGC TCTTGGCGGT ATGCTGCTTG CAGTACTAAA AAATGCTGAC CGTGTTGAAA TTGCTTGTAT CTCAGAATTA GTGAATTGTA TTTCTCATAT ACGTACCAGA AATGGCGGGG GTGCGTGGGT ACTGCCACCT TATTACACCT TCCTGTTATT CTCAAAATAC GGTAGGGGAA CATCATTAGT TACTTCAATA AGTTCTCCGA AATATGACTC TACTGATTTT ACTGATGTCC CTTATCTTGA TGCAGCAGCA ACAATGGATG ACAATGGTGA CGTTACTATA TTTGCAATTA ATAGGAGCAC AGAGGAAACT CTGCCCCTTG AAACCGAGTT GAGAGGATTT GAAAACTATA GGGTGGAAAC TCATATTGTT CTTACGAGTG CGAACCCAAA AGATACTAAT ACAGAAGAGT GTCCAAACTA TGTTACTCCA AAGAATAATG GTGATGCACA AATAGACGGA AATAAAGTTT TAGCGAATTT GCCGAGACTT TCCTGGAATG TTATTCGACT CCAAAAAGTT AAATAA
|
Protein sequence | MNKAKIVLDK DFIISKIDEK VFGSFVEPLG RCIYGGIYEP GHPAADEKGF RRDVLELTKP LNVTLNRFPG GNYVSTFRWE DGIGPKEKRP RRAEVAWQSI ETNQFGINEF ADWSKLNGSD VMMTVNLATR GVLEAMDCVE YCNFKEGTYW SDLRISHGYK EPHGYRYWCL TNEIDGVWQV GQKTGTDYGR IAREASKGMK LLDENIKTVL AGSSSPSQDS FPSFDAAALE ESYEFIDYLS IHQYIGNAKN DTPNYLAKPL ITDKYLKTAI ATIDYIKAKT KSKNKVNISF DEFNTWHSIA EEARFNNKWR IAPPLLEDEY TLEDALALGG MLLAVLKNAD RVEIACISEL VNCISHIRTR NGGGAWVLPP YYTFLLFSKY GRGTSLVTSI SSPKYDSTDF TDVPYLDAAA TMDDNGDVTI FAINRSTEET LPLETELRGF ENYRVETHIV LTSANPKDTN TEECPNYVTP KNNGDAQIDG NKVLANLPRL SWNVIRLQKV K
|
| |