Gene Ccel_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1036 
Symbol 
ID7309858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1288460 
End bp1289965 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content38% 
IMG OID643607963 
Productalpha-L-arabinofuranosidase domain protein 
Protein accessionYP_002505378 
Protein GI220928469 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0176098 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGG CGAAAATCGT ATTAGACAAA GATTTTATAA TTTCAAAAAT AGACGAAAAA 
GTATTTGGTT CATTTGTTGA ACCTTTAGGT AGATGCATAT ATGGCGGTAT TTATGAGCCT
GGACATCCTG CTGCCGATGA AAAAGGCTTT AGAAGAGATG TTCTGGAATT AACCAAGCCA
TTGAATGTTA CATTAAATCG TTTTCCGGGT GGGAATTACG TATCAACCTT TCGTTGGGAA
GACGGAATCG GCCCTAAAGA AAAGAGGCCA CGTCGTGCTG AGGTTGCTTG GCAAAGTATA
GAAACAAATC AATTCGGGAT TAATGAATTC GCTGATTGGT CAAAATTAAA CGGATCTGAT
GTAATGATGA CAGTCAATCT TGCAACAAGA GGCGTTTTAG AAGCAATGGA TTGCGTGGAG
TATTGTAACT TTAAGGAAGG AACTTATTGG TCTGATCTGC GTATTTCTCA TGGTTACAAA
GAGCCTCATG GATATCGTTA CTGGTGCTTG ACCAATGAAA TCGATGGTGT TTGGCAGGTT
GGCCAGAAAA CCGGAACAGA TTACGGTAGA ATAGCAAGAG AAGCGTCAAA GGGAATGAAA
CTTCTTGATG AGAATATTAA AACGGTATTA GCCGGTTCTT CTTCACCGTC GCAGGATAGT
TTCCCAAGCT TTGATGCAGC CGCTCTTGAA GAATCTTACG AGTTTATAGA TTACTTATCA
ATACATCAGT ATATAGGAAA TGCTAAGAAT GATACACCAA ACTACCTTGC AAAGCCTTTG
ATTACTGACA AATATCTTAA GACTGCAATC GCTACCATTG ACTATATTAA GGCTAAGACC
AAGAGCAAAA ATAAAGTAAA TATTTCATTT GATGAATTTA ACACATGGCA TTCAATTGCT
GAGGAAGCAC GTTTTAATAA TAAGTGGCGG ATTGCTCCTC CTCTATTAGA AGATGAATAT
ACATTAGAAG ATGCATTAGC TCTTGGCGGT ATGCTGCTTG CAGTACTAAA AAATGCTGAC
CGTGTTGAAA TTGCTTGTAT CTCAGAATTA GTGAATTGTA TTTCTCATAT ACGTACCAGA
AATGGCGGGG GTGCGTGGGT ACTGCCACCT TATTACACCT TCCTGTTATT CTCAAAATAC
GGTAGGGGAA CATCATTAGT TACTTCAATA AGTTCTCCGA AATATGACTC TACTGATTTT
ACTGATGTCC CTTATCTTGA TGCAGCAGCA ACAATGGATG ACAATGGTGA CGTTACTATA
TTTGCAATTA ATAGGAGCAC AGAGGAAACT CTGCCCCTTG AAACCGAGTT GAGAGGATTT
GAAAACTATA GGGTGGAAAC TCATATTGTT CTTACGAGTG CGAACCCAAA AGATACTAAT
ACAGAAGAGT GTCCAAACTA TGTTACTCCA AAGAATAATG GTGATGCACA AATAGACGGA
AATAAAGTTT TAGCGAATTT GCCGAGACTT TCCTGGAATG TTATTCGACT CCAAAAAGTT
AAATAA
 
Protein sequence
MNKAKIVLDK DFIISKIDEK VFGSFVEPLG RCIYGGIYEP GHPAADEKGF RRDVLELTKP 
LNVTLNRFPG GNYVSTFRWE DGIGPKEKRP RRAEVAWQSI ETNQFGINEF ADWSKLNGSD
VMMTVNLATR GVLEAMDCVE YCNFKEGTYW SDLRISHGYK EPHGYRYWCL TNEIDGVWQV
GQKTGTDYGR IAREASKGMK LLDENIKTVL AGSSSPSQDS FPSFDAAALE ESYEFIDYLS
IHQYIGNAKN DTPNYLAKPL ITDKYLKTAI ATIDYIKAKT KSKNKVNISF DEFNTWHSIA
EEARFNNKWR IAPPLLEDEY TLEDALALGG MLLAVLKNAD RVEIACISEL VNCISHIRTR
NGGGAWVLPP YYTFLLFSKY GRGTSLVTSI SSPKYDSTDF TDVPYLDAAA TMDDNGDVTI
FAINRSTEET LPLETELRGF ENYRVETHIV LTSANPKDTN TEECPNYVTP KNNGDAQIDG
NKVLANLPRL SWNVIRLQKV K