Gene Acel_0618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0618 
Symbol 
ID4486400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp664806 
End bp668702 
Gene Length3897 bp 
Protein Length1298 aa 
Translation table11 
GC content63% 
IMG OID639729385 
Productcellulose-binding family II protein 
Protein accessionYP_872377 
Protein GI117927826 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0939482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00792989 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCGTT CGGAGAACAT CCGTCTGACT ATGAGATCAC GACGATTGGT ATCACTGCTC 
GCCGCCACTG CGTCGTTCGC CGTGGCCGCC GCTCTGGGAG TTCTGCCCAT CGCGATAACG
GCTTCTCCTG CGCACGCGGC GACGACTCAG CCGTACACCT GGAGCAACGT GGCGATCGGG
GGCGGCGGCT TTGTCGACGG GATCGTCTTC AATGAAGGTG CACCGGGAAT TCTGTACGTG
CGGACGGACA TCGGGGGGAT GTATCGATGG GATGCCGCCA ACGGGCGGTG GATCCCTCTT
CTGGATTGGG TGGGATGGAA CAATTGGGGG TACAACGGCG TCGTCAGCAT TGCGGCAGAC
CCGATCAATA CTAACAAGGT ATGGGCCGCC GTCGGAATGT ACACCAACAG CTGGGACCCA
AACGACGGAG CGATTCTCCG CTCGTCTGAT CAGGGCGCAA CGTGGCAAAT AACGCCCCTG
CCGTTCAAGC TTGGCGGCAA CATGCCCGGG CGTGGAATGG GCGAGCGGCT TGCGGTGGAT
CCAAACAATG ACAACATTCT GTATTTCGGC GCCCCGAGCG GCAAAGGGCT CTGGAGAAGC
ACAGATTCCG GCGCGACCTG GTCCCAGATG ACGAACTTTC CGGACGTAGG CACGTACATT
GCAAATCCCA CTGACACGAC CGGCTATCAG AGCGATATTC AAGGCGTCGT CTGGGTCGCT
TTCGACAAGT CTTCGTCATC GCTCGGGCAA GCGAGTAAGA CCATTTTTGT GGGCGTGGCG
GATCCCAATA ATCCGGTCTT CTGGAGCAGA GACGGCGGCG CGACGTGGCA GGCGGTGCCG
GGTGCGCCGA CCGGCTTCAT CCCGCACAAG GGCGTCTTTG ACCCGGTCAA CCACGTGCTC
TATATTGCCA CCAGCAATAC GGGTGGTCCG TATGACGGGA GCTCCGGCGA CGTCTGGAAA
TTCTCGGTGA CCTCCGGGAC ATGGACGCGA ATCAGCCCGG TACCTTCGAC GGACACGGCC
AACGACTACT TTGGTTACAG CGGCCTCACT ATCGACCGCC AGCACCCGAA CACGATAATG
GTGGCAACCC AGATATCGTG GTGGCCGGAC ACCATAATCT TTCGGAGCAC CGACGGCGGT
GCGACGTGGA CGCGGATCTG GGATTGGACG AGTTATCCCA ATCGAAGCTT GCGATATGTG
CTTGACATTT CGGCGGAGCC TTGGCTGACC TTCGGCGTAC AGCCGAATCC TCCCGTACCG
AGTCCGAAGC TCGGCTGGAT GGATGAAGCG ATGGCAATCG ATCCGTTCAA CTCTGATCGG
ATGCTCTACG GAACAGGCGC GACGTTGTAC GCAACAAATG ATCTCACGAA GTGGGACTCC
GGCGGCCAGA TTCATATCGC GCCGATGGTC AAAGGATTGG AGGAGACGGC GGTAAACGAT
CTCATCAGCC CGCCGTCTGG CGCCCCGCTC ATCAGCGCTC TCGGAGACCT CGGCGGCTTC
ACCCACGCCG ACGTTACTGC CGTGCCATCG ACGATCTTCA CGTCACCGGT GTTCACGACC
GGCACCAGCG TCGACTATGC GGAATTGAAT CCGTCGATCA TCGTTCGCGC TGGAAGTTTC
GATCCATCGA GCCAACCGAA CGACAGGCAC GTCGCGTTCT CGACAGACGG CGGCAAGAAC
TGGTTCCAAG GCAGCGAACC TGGCGGGGTG ACGACGGGCG GCACCGTCGC CGCATCGGCC
GACGGCTCTC GTTTCGTCTG GGCTCCCGGC GATCCCGGTC AGCCTGTGGT GTACGCAGTC
GGATTTGGCA ACTCCTGGGC TGCTTCGCAA GGTGTTCCCG CCAATGCCCA GATCCGCTCA
GACCGGGTGA ATCCAAAGAC TTTCTATGCC CTATCCAATG GAACCTTCTA TCGAAGCACG
GACGGCGGCG TGACATTCCA ACCGGTCGCG GCCGGTCTTC CGAGCAGCGG TGCCGTCGGT
GTCATGTTCC ACGCGGTGCC TGGAAAAGAA GGCGATCTGT GGCTCGCTGC ATCGAGCGGG
CTTTACCACT CAACCAATGG CGGCAGCAGT TGGTCTGCAA TCACCGGCGT ATCCTCCGCG
GTGAACGTGG GATTTGGTAA GTCTGCGCCC GGGTCGTCAT ACCCAGCCGT CTTTGTCGTC
GGCACGATCG GAGGCGTTAC GGGGGCGTAC CGCTCCGACG ACGGTGGGAC GACCTGGGTA
CGGATCAATG ATGACCAGCA CCAATACGGA AATTGGGGAC AAGCAATCAC CGGTGACCCG
CGAATTTACG GGCGGGTGTA CATAGGCACG AACGGCCGTG GAATTGTCTA CGGGGACATT
GCTGGTGCGC CGTCCGGATC GCCGTCTCCG TCGGTGAGTC CGTCGGCTTC GCCGAGCCTG
AGCCCGAGCC CGAGCCCGAG CAGCTCGCCA TCGCCGTCGC CGTCACCGAG CTCGAGTCCA
TCCTCGTCGC CGTCTCCGTC GCCGTCACCA TCGCCGAGTC CGTCTCGGTC TCCGTCACCA
TCGGCGTCGC CGAGCCCGTC TTCGTCACCG AGCCCGTCTT CGTCACCGTC TTCGTCGCCG
AGCCCAACGC CGTCGTCGTC GCCGGTGTCG GGTGGGGTGA AGGTGCAGTA TAAGAATAAT
GATTCGGCGC CGGGTGATAA TCAGATCAAG CCGGGTTTGC AGGTGGTGAA TACCGGGTCG
TCGTCGGTGG ATTTGTCGAC GGTGACGGTG CGGTACTGGT TCACCCGGGA TGGTGGCTCG
TCGACACTGG TGTACAACTG TGACTGGGCG GCGATCGGGT GTGGGAATAT CCGCGCCTCG
TTCGGCTCGG TGAACCCGGC GACGCCGACG GCGGACACCT ACCTGCAGTT GTCGTTCACT
GGTGGAACGT TGGCCGCTGG TGGGTCGACG GGTGAGATTC AAAACCGGGT GAATAAGAGT
GACTGGTCGA ATTTCACCGA GACGAATGAC TACTCGTATG GGACGAACAC CGTCTTCCAG
GATTGGTCGA AGGTGACGGT GTACGTCAAC GGCCGGCTGG TGTGGGGGAC TGAACCGTCC
GGCACCAGCC CCAGCCCCAC ACCCAGCCCC AGCCCCACAC CATCCCCGAG CCCGAGCCCG
AGCCCGGGTG GGGATGTGAC GCCGCCGAGT GTGCCGACCG GCGTGGTGGT GACGGGGGTG
AGTGGGTCGT CGGTGTCGTT GGCGTGGAAT GCGTCGACGG ATAACGTGGG GGTGGCGCAT
TACAACGTGT ACCGCAACGG GGTGTTGGTG GGCCAGCCGA CGGTGACCTC GTTCACCGAC
ACGGGTTTGG CCGCGGGAAC CGCCTACACC TACACGGTGG CCGCGGTGGA CGCTGCGGGC
AACACCTCCG CCCCATCCAC CCCCGTCACC GCCACCACCA CGAGTCCCAG CCCCAGCCCC
AGCCCGACCC CCAGCCCGAC CCCCAGCCCG ACCCCCAGCC CCAGCCCCAG CCCCAGCCTC
TCCCCGTCCC CGTCGCCGAG CCCGAGCCCG AGCCCGAGCC CGAGCCTCAG CCCGAGCCCG
AGCACGTCCC CAAGCCCAAG CCCAAGCCCC ACGCCGTCCC CGTCGTCGTC GGGTGTGGGG
TGCCGGGCGA CGTATGTGGT GAATAGTGAT TGGGGTTCTG GGTTTACGGC GACGGTGACG
GTGACGAATA CCGGGAGCCG GGCGACGAGC GGGTGGACGG TGGCGTGGTC GTTTGGTGGG
AATCAGACGG TCACGAACTA CTGGAACACC CTGTTGACCC AATCAGGTGC ATCGGTGACG
GCGACGAACC TGAGCTACAA CAACGTGATC CAACCCGGTC AGTCGACCAC CTTCGGGTTC
AACGCCACCT ACGCCGGAAC CAACACCCCA CCCACCCCCA CCTGCACCAC CAACTAA
 
Protein sequence
MDRSENIRLT MRSRRLVSLL AATASFAVAA ALGVLPIAIT ASPAHAATTQ PYTWSNVAIG 
GGGFVDGIVF NEGAPGILYV RTDIGGMYRW DAANGRWIPL LDWVGWNNWG YNGVVSIAAD
PINTNKVWAA VGMYTNSWDP NDGAILRSSD QGATWQITPL PFKLGGNMPG RGMGERLAVD
PNNDNILYFG APSGKGLWRS TDSGATWSQM TNFPDVGTYI ANPTDTTGYQ SDIQGVVWVA
FDKSSSSLGQ ASKTIFVGVA DPNNPVFWSR DGGATWQAVP GAPTGFIPHK GVFDPVNHVL
YIATSNTGGP YDGSSGDVWK FSVTSGTWTR ISPVPSTDTA NDYFGYSGLT IDRQHPNTIM
VATQISWWPD TIIFRSTDGG ATWTRIWDWT SYPNRSLRYV LDISAEPWLT FGVQPNPPVP
SPKLGWMDEA MAIDPFNSDR MLYGTGATLY ATNDLTKWDS GGQIHIAPMV KGLEETAVND
LISPPSGAPL ISALGDLGGF THADVTAVPS TIFTSPVFTT GTSVDYAELN PSIIVRAGSF
DPSSQPNDRH VAFSTDGGKN WFQGSEPGGV TTGGTVAASA DGSRFVWAPG DPGQPVVYAV
GFGNSWAASQ GVPANAQIRS DRVNPKTFYA LSNGTFYRST DGGVTFQPVA AGLPSSGAVG
VMFHAVPGKE GDLWLAASSG LYHSTNGGSS WSAITGVSSA VNVGFGKSAP GSSYPAVFVV
GTIGGVTGAY RSDDGGTTWV RINDDQHQYG NWGQAITGDP RIYGRVYIGT NGRGIVYGDI
AGAPSGSPSP SVSPSASPSL SPSPSPSSSP SPSPSPSSSP SSSPSPSPSP SPSPSRSPSP
SASPSPSSSP SPSSSPSSSP SPTPSSSPVS GGVKVQYKNN DSAPGDNQIK PGLQVVNTGS
SSVDLSTVTV RYWFTRDGGS STLVYNCDWA AIGCGNIRAS FGSVNPATPT ADTYLQLSFT
GGTLAAGGST GEIQNRVNKS DWSNFTETND YSYGTNTVFQ DWSKVTVYVN GRLVWGTEPS
GTSPSPTPSP SPTPSPSPSP SPGGDVTPPS VPTGVVVTGV SGSSVSLAWN ASTDNVGVAH
YNVYRNGVLV GQPTVTSFTD TGLAAGTAYT YTVAAVDAAG NTSAPSTPVT ATTTSPSPSP
SPTPSPTPSP TPSPSPSPSL SPSPSPSPSP SPSPSLSPSP STSPSPSPSP TPSPSSSGVG
CRATYVVNSD WGSGFTATVT VTNTGSRATS GWTVAWSFGG NQTVTNYWNT LLTQSGASVT
ATNLSYNNVI QPGQSTTFGF NATYAGTNTP PTPTCTTN