Gene Acel_0128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0128 
Symbol 
ID4484616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp129121 
End bp130548 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content66% 
IMG OID639728890 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_871889 
Protein GI117927338 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.174382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTCG TTCAGCTCGT CAATAGCTGC CTGTTGCCCG GCTTTGCCGG TGGTGACCAG 
TTACCGGACT GGGTACGCCG GGCGCTCGAC CAGGGACTCG CCGGCGTGGC GATTTACGGG
CACAACCTCG TGGACGACGG GTCGGTCGCC CGCATTGCGC AGGCCGTTCA CAACACCGCG
CCGGATGCAC TGGTTGCCCT CGACGAAGAA GGCGGCGACG TCACCCGACT CGAATATCGG
ACGGGTAGCT CCTACCCGGG CAATTTGGCG CTCGGCGTGG TGGACGACCT CGAACTGACG
GCACGTGTCG CGGCGGCGAT CGCAGCGGAT CTCGTTGCCG CAGGAGTGAA TTACAACCTG
GCGCCGGCCG TCGACGTGAA TAGCGATCCG CGAAATCCAG TCATCGGCGT CCGGTCGTTC
GGCGCTGATC CCGACAAGGT GGCGGCGCAC GGTGCGACGT TCATCACCGC CATGCAGTCT
CGGGGAATTG CAACCGCAGC GAAGCATTTT CCGGGACACG GCGCCACCGT TGCCGATTCG
CACCACACGC TGCCAGTGAT CGACGTCGAT GAACCGACCT TTCGACGGCG GGATTTGCCG
CCGTTCGTCG CGGCGATCCA AGCCGGCGCA GCGAGCATCA TGACGTCACA CGTTGTCTTT
ACCGCGCTCG ACGCTGATCT CCCGGCCACT CTCAGTCCAC GCCTGCTGCG CGGCCTGCTT
CGCAGTGAAC TCGGCTATTC CGGCGTCGTC GTGACCGATG CGCTCGACAT GCGAGCCGTC
GCCGATACGT GGGGAATAGC CGGTGCGGCG GTTCGTGCGC TTGCTGCGGG TGCGGACCTG
CTTCTTGTCG GCGCTGTGGA CGGCGAGCGC TACTGCGCCG AGATTCACGC TGCGGTAACC
GACGCGATTG CGGCTGGTGA TCTGACCGTG GAGACGTTGG AGGCAGCCGC CGCGCGCATC
CGAGCGCTGC GGGAATTCGC AGCTGTTCGT CGCGGAGATT CTCGTCGTGC GGATGGCCGT
GGCGGGCGTG ACAGCGGCCT TCTCGCTGCA CGACGCGCCC TCCAGGTGCG CGGCGACGTG
CACATTGCGG AACCAGCTGT CGTCGTCGAG TTACGGGCTG CCGCCAATCC TGCCGTGGGT
GAGGCGTATT GGAGTCTTGC TGACGCCCTC GACAGATTTG GCCTCCTTGC GGAACGCATC
GCTGTTCATG ACGGAAGTCC GCATGCGGAC GAGATAGCGG CCCGCGCCCA GGGACGTCCG
CGGTGGTCGT CGCGGTCCGC GACGCCTATC GGAGTGCGTG GCAGCGCGAC TGGGTGCGCG
CTTTTTTCGG CGGGCGTCCG GACGCTGTAC TGGTGGCCGT CGGAATGCCG AATGATGCGG
AACTCTCTAA CGGGCGTGTC CTGCTTACCT TCGGCGCAGG CCTGGTGA
 
Protein sequence
MDVVQLVNSC LLPGFAGGDQ LPDWVRRALD QGLAGVAIYG HNLVDDGSVA RIAQAVHNTA 
PDALVALDEE GGDVTRLEYR TGSSYPGNLA LGVVDDLELT ARVAAAIAAD LVAAGVNYNL
APAVDVNSDP RNPVIGVRSF GADPDKVAAH GATFITAMQS RGIATAAKHF PGHGATVADS
HHTLPVIDVD EPTFRRRDLP PFVAAIQAGA ASIMTSHVVF TALDADLPAT LSPRLLRGLL
RSELGYSGVV VTDALDMRAV ADTWGIAGAA VRALAAGADL LLVGAVDGER YCAEIHAAVT
DAIAAGDLTV ETLEAAAARI RALREFAAVR RGDSRRADGR GGRDSGLLAA RRALQVRGDV
HIAEPAVVVE LRAAANPAVG EAYWSLADAL DRFGLLAERI AVHDGSPHAD EIAARAQGRP
RWSSRSATPI GVRGSATGCA LFSAGVRTLY WWPSECRMMR NSLTGVSCLP SAQAW