Gene Acel_0615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0615 
Symbol 
ID4486397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp655010 
End bp658639 
Gene Length3630 bp 
Protein Length1209 aa 
Translation table11 
GC content64% 
IMG OID639729382 
Productglycoside hydrolase family protein 
Protein accessionYP_872374 
Protein GI117927823 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.435681 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00420105 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGCCA TCTCAAAACG GCTGCGAGCC GGCGTCCTCG CCGGGGCGGT GAGCATCGCA 
GCCTCCATCG TGCCGCTGGC GATGCAGCAT CCTGCCATCG CCGCGACGCA CGTCGACAAT
CCCTATGCGG GAGCGACCTT CTTCGTCAAC CCGTACTGGG CGCAAGAAGT ACAGAGCGAA
GCGGCGAACC AGACCAATGC CACTCTCGCA GCGAAAATGC GCGTCGTTTC CACATATTCG
ACGGCCGTCT GGATGGACCG CATCGCTGCG ATCAACGGCG TCAACGGCGG ACCCGGCTTG
ACGACATATC TGGACGCCGC CCTCTCCCAG CAGCAGGGAA CCACCCCTGA AGTCATTGAG
ATTGTCATCT ACGATCTGCC GGGACGCGAC TGCGCGGCGC TCGCCTCCAA CGGCGAACTG
CCCGCTACGG CAGCAGGTTT GCAGACCTAT GAAACGCAGT ACATCGATCC GATTGCGAGT
ATCCTGAGCA ATCCGAAGTA CTCCAGCCTG CGGATCGTGA CGATCATTGA GCCGGACTCG
CTGCCAAACG CGGTCACCAA TATGAGCATT CAAGCGTGTG CAACGGCGGT GCCGTATTAC
GAGCAAGGCA TCGAGTACGC GCTCACGAAA TTGCACGCCA TTCCGAACGT GTACATCTAC
ATGGACGCCG CCCACTCCGG CTGGCTTGGG TGGCCCAATA ATGCCAGCGG ATACGTACAG
GAAGTCCAGA AGGTCCTCAA CGCGAGCATC GGGGTCAACG GCATCGACGG CTTCGTCACC
AACACGGCGA ATTACACGCC GTTGAAGGAG CCGTTCATGA CCGCCACCCA GCAGGTCGGC
GGACAGCCGG TGGAGTCGGC GAATTTCTAC CAGTGGAATC CTGACATCGA CGAAGCCGAC
TACGCGGCTG ACTTGTACTC GCGGTTCGTC GCCGCTGGCT TCCCAAGCAG CATCGGCATG
CTCATCGACA CCTCACGCAA CGGTTGGGGT GGTCCGAACG AACCAACAGG CCCGAGCACC
GCGACCGATG TCAACACCTT CGTCAACCAG TCGAAGATTG ACCTTCGGCA GCACCGCGGC
CTGTGGTGCA ACCAGAACGG TGCGGGCCTC GGCCAGCCGC CGCAGGCAAG CCCGACGGAC
TTCCCGAACG CGCACCTCGA CGCGTATGTC TGGATCAAGC CGCCGGGTGA GTCGGACGGC
ACAAGCGCTG CGAGCGATCC GACAACTGGC AAGAAGTCGG ACCCCATGTG CGACCCGACG
TACACGACGT CGTACGGGGT ACTGACCAAC GCGTTACCGA ACTCCCCGAT CGCCGGCCAG
TGGTTCCCGG CGCAGTTTGA CCAGCTTGTC GCGAACGCAC GGCCAGCGGT GCCGACGTCG
ACCAGCTCGA GCCCGCCGCC TCCGCCGCCG AGTCCGTCGG CTTCGCCGAG TCCGAGCCCG
AGTCCGAGCC CGAGCAGCTC GCCATCGCCG TCGCCGTCTC CGAGCTCGAG CCCGTCTCCG
TCGCCGAGCC CGAGTCCGAG CCCGAGTAGC TCGCCGTCGC CGTCTCCGAG CTCGAGCCCG
TCTCCGTCGC CGAGCCCGAG TCCGAGCCCG AGTAGCTCGC CGTCGCCGTC TCCGAGCTCG
AGCCCGTCTC CGTCGCCGAG CCCGAGTCCG AGCCCGAGTA GCTCGCCGTC GCCGTCTCCG
ACGTCGTCGC CGGTGTCGGG TGGGCTGAAG GTGCAGTACA AGAACAATGA TTCGGCGCCG
AGTGACAACC AGATCAAACC GGGTCTCCAG TTGGTGAATA CCGGGTCGTC GTCGGTGGAT
TTGTCGACGG TGACGGTGCG GTACTGGTTC ACCCGGGATG GTGGGTCGTC GACACTGGTG
TACAACTGTG ACTGGGCGGC GATGGGGTGT GGGAATATCC GCGCCTCGTT CGGCTCGGTG
AACCCGGCGA CGCCGACGGC GGACACCTAC CTGCAGTTGT CGTTCACTGG TGGAACGTTG
GCCGCTGGTG GGTCGACGGG TGAGATTCAA AACCGGGTGA ATAAGAGTGA CTGGTCGAAT
TTCACCGAGA CCAATGACTA CTCGTATGGG ACGAACACCA CCTTCCAGGA CTGGACGAAG
GTGACGGTGT ACGTCAACGG CGTGTTGGTG TGGGGGACTG AACCGTCCGG CACCAGCCCC
AGCCCCACAC CATCCCCGAG CCCGAGCCCG AGCCCGAGCC CGGGTGGGGA TGTGACGCCG
CCGAGTGTGC CGACCGGCTT GGTGGTGACG GGGGTGAGTG GGTCGTCGGT GTCGTTGGCG
TGGAATGCGT CGACGGATAA CGTGGGGGTG GCGCATTACA ACGTGTACCG CAACGGGGTG
TTGGTGGGCC AGCCGACGGT GACCTCGTTC ACCGACACGG GTTTGGCCGC GGGAACCGCG
TACACCTACA CGGTGGCCGC GGTGGACGCT GCGGGTAACA CCTCCGCCCC ATCCACCCCC
GTCACCGCCA CCACCACGAG TCCCAGCCCC AGCCCCACGC CGACGGGGAC CACGGTCACC
GACTGCACGC CCGGTCCTAA CCAGAATGGT GTGACCAGCG TGCAGGGCGA CGAATACCGG
GTGCAGACCA ATGAGTGGAA TTCGTCGGCC CAGCAGTGCC TCACCATCAA TACCGCGACC
GGTGCCTGGA CGGTGAGCAC TGCGAACTTC AGCGGTGGGA CCGGCGGTGC GCCCGCGACG
TATCCGTCGA TCTACAAGGG CTGCCACTGG GGCAACTGCA CCACGAAGAA CGTCGGGATG
CCGATTCAGA TCAGTCAGAT TGGTTCGGCT GTGACGTCGT GGAGTACGAC GCAGGTGTCG
TCGGGCGCGT ATGACGTGGC CTACGACATT TGGACGAACA GTACCCCAAC GACAACCGGT
CAGCCAAACG GTACCGAAAT CATGATTTGG CTGAATTCGC GTGGTGGGGT GCAGCCGTTC
GGGTCGCAGA CAGCGACGGG TGTGACGGTC GCTGGTCACA CGTGGAATGT CTGGCAGGGT
CAGCAGACCT CGTGGAAGAT TATTTCCTAC GTCCTGACCC CCGGTGCGAC GTCGATCAGT
AATCTGGATT TGAAGGCGAT TTTCGCGGAC GCCGCGGCAC GCGGGTCGCT CAACACCTCC
GATTACCTGC TCGACGTTGA GGCCGGGTTT GAGATCTGGC AAGGTGGTCA GGGCCTGGGC
AGCAACTCGT TCAGCGTCTC CGTGACGAGC GGCACGTCCA GCCCGACACC GAGCCCGAGC
CCGACGCCGA CACCGAGCCC GACGCCGACA CCGTCTCCGA GCCCGACCCC GTCGCCGAGT
CCGACCAGCT CGCCGTCGTC GTCGGGTGTG GCGTGCCGGG CGACGTATGT GGTGAATAGT
GATTGGGGTT CTGGGTTTAC GGCGACGGTG ACGGTGACGA ATACCGGGAG CCGGGCGACG
AACGGGTGGA CGGTGGCGTG GTCGTTTGGT GGGAATCAGA CGGTCACGAA CTACTGGAAC
ACTGCGTTGA CCCAATCAGG TGCATCGGTG ACGGCGACGA ACCTGAGTTA CAACAACGTG
ATCCAACCGG GTCAGTCGAC CACCTTCGGA TTCAACGGAA GTTACTCAGG AACAAACGCC
GCGCCGACGC TCAGCTGCAC AGCCAGCTGA
 
Protein sequence
MPAISKRLRA GVLAGAVSIA ASIVPLAMQH PAIAATHVDN PYAGATFFVN PYWAQEVQSE 
AANQTNATLA AKMRVVSTYS TAVWMDRIAA INGVNGGPGL TTYLDAALSQ QQGTTPEVIE
IVIYDLPGRD CAALASNGEL PATAAGLQTY ETQYIDPIAS ILSNPKYSSL RIVTIIEPDS
LPNAVTNMSI QACATAVPYY EQGIEYALTK LHAIPNVYIY MDAAHSGWLG WPNNASGYVQ
EVQKVLNASI GVNGIDGFVT NTANYTPLKE PFMTATQQVG GQPVESANFY QWNPDIDEAD
YAADLYSRFV AAGFPSSIGM LIDTSRNGWG GPNEPTGPST ATDVNTFVNQ SKIDLRQHRG
LWCNQNGAGL GQPPQASPTD FPNAHLDAYV WIKPPGESDG TSAASDPTTG KKSDPMCDPT
YTTSYGVLTN ALPNSPIAGQ WFPAQFDQLV ANARPAVPTS TSSSPPPPPP SPSASPSPSP
SPSPSSSPSP SPSPSSSPSP SPSPSPSPSS SPSPSPSSSP SPSPSPSPSP SSSPSPSPSS
SPSPSPSPSP SPSSSPSPSP TSSPVSGGLK VQYKNNDSAP SDNQIKPGLQ LVNTGSSSVD
LSTVTVRYWF TRDGGSSTLV YNCDWAAMGC GNIRASFGSV NPATPTADTY LQLSFTGGTL
AAGGSTGEIQ NRVNKSDWSN FTETNDYSYG TNTTFQDWTK VTVYVNGVLV WGTEPSGTSP
SPTPSPSPSP SPSPGGDVTP PSVPTGLVVT GVSGSSVSLA WNASTDNVGV AHYNVYRNGV
LVGQPTVTSF TDTGLAAGTA YTYTVAAVDA AGNTSAPSTP VTATTTSPSP SPTPTGTTVT
DCTPGPNQNG VTSVQGDEYR VQTNEWNSSA QQCLTINTAT GAWTVSTANF SGGTGGAPAT
YPSIYKGCHW GNCTTKNVGM PIQISQIGSA VTSWSTTQVS SGAYDVAYDI WTNSTPTTTG
QPNGTEIMIW LNSRGGVQPF GSQTATGVTV AGHTWNVWQG QQTSWKIISY VLTPGATSIS
NLDLKAIFAD AAARGSLNTS DYLLDVEAGF EIWQGGQGLG SNSFSVSVTS GTSSPTPSPS
PTPTPSPTPT PSPSPTPSPS PTSSPSSSGV ACRATYVVNS DWGSGFTATV TVTNTGSRAT
NGWTVAWSFG GNQTVTNYWN TALTQSGASV TATNLSYNNV IQPGQSTTFG FNGSYSGTNA
APTLSCTAS