Gene Acel_0617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0617 
Symbol 
ID4486399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp661169 
End bp664534 
Gene Length3366 bp 
Protein Length1121 aa 
Translation table11 
GC content64% 
IMG OID639729384 
Productglycoside hydrolase family protein 
Protein accessionYP_872376 
Protein GI117927825 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0413787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00805779 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAGGAT TACGACGGCG ACTCCGCGCC GGTATCGTCT CGGCGGCGGC GTTGGGGTCG 
CTGGTTAGCG GGCTCGTTGC CGTCGCACCA GTCGCGCACG CGGCGGTGAC TCTCAAAGCG
CAGTATAAGA ACAATGATTC GGCGCCGAGT GACAACCAGA TCAAACCGGG TCTCCAGTTG
GTGAATACCG GGTCGTCGTC GGTGGATTTG TCGACGGTGA CGGTGCGGTA CTGGTTCACC
CGGGATGGTG GGTCGTCGAC ACTGGTGTAC AACTGTGACT GGGCGGCGAT GGGGTGTGGG
AATATCCGCG CCTCGTTCGG CTCGGTGAAC CCGGCGACGC CGACGGCGGA CACCTACCTG
CAGTTGTCGT TCACTGGTGG AACGTTGGCC GCTGGTGGGT CGACGGGTGA GATTCAAAAC
CGGGTGAATA AGAGTGACTG GTCGAACTTT GATGAGACCA ATGACTACTC GTATGGGACG
AACACCACCT TCCAGGACTG GACGAAGGTG ACGGTGTACG TCAACGGCGT GTTGGTCTGG
GGGACCGAAC CGTCCGGAGC GACGGCGTCT CCATCCGCGT CGGCGACGCC CAGCCCGTCC
AGTTCACCGA CCACGAGTCC GAGTTCGTCC CCGTCGCCGA GCAGCAGCCC GACGCCGACA
CCGAGCAGCT CGTCGCCGCC CCCGTCGTCC AACGACCCGT ACATCCAGCG GTTCCTCACG
ATGTACAACA AGATTCACGA CCCAGCGAAC GGCTACTTCA GCCCGCAGGG AATTCCCTAC
CACTCGGTAG AAACGCTCAT CGTTGAGGCA CCGGACTACG GGCACGAGAC AACTTCGGAG
GCGTACAGCT TCTGGCTCTG GCTCGAAGCG ACGTACGGCG CAGTGACCGG CAACTGGACG
CCGTTCAACA ACGCCTGGAC GACGATGGAA ACGTACATGA TCCCGCAGCA CGCGGACCAG
CCGAACAACG CGTCGTACAA CCCCAACAGC CCGGCGTCGT ACGCTCCGGA AGAGCCGCTG
CCCAGCATGT ACCCGGTTGC CATCGACAGC AGCGTGCCGG TTGGGCACGA CCCGCTCGCC
GCCGAATTGC AGTCGACGTA CGGCACTCCG GACATTTACG GCATGCACTG GCTGGCCGAC
GTTGACAACA TCTACGGATA CGGCGACAGC CCCGGCGGTG GTTGCGAACT CGGTCCTTCC
GCTAAGGGCG TCTCCTACAT CAACACATTC CAGCGCGGCT CGCAGGAGTC CGTCTGGGAG
ACGGTCACCC AGCCGACGTG CGACAACGGC AAGTACGGTG GGGCGCACGG CTACGTCGAC
CTGTTCATCC AGGGTTCGAC GCCGCCGCAG TGGAAGTACA CCGATGCCCC GGACGCCGAC
GCCCGTGCCG TCCAGGCTGC GTACTGGGCC TACACCTGGG CATCGGCGCA GGGCAAGGCA
AGCGCGATTG CCCCGACGAT CGCCAAGGCG GCCAAACTCG GCGACTACCT GCGGTACTCG
CTCTTTGACA AGTACTTCAA GCAGGTCGGC AACTGCTACC CGGCCAGCTC CTGCCCTGGA
GCAACCGGAC GCCAGAGCGA GACCTACCTG ATCGGCTGGT ACTACGCCTG GGGCGGCTCA
AGCCAAGGCT GGGCCTGGCG CATTGGTGAC GGCGCCGCGC ACTTCGGCTA CCAGAATCCG
CTTGCCGCGT GGGCGATGTC GAACGTGACA CCGCTCATTC CGCTCTCGCC CACGGCAAAG
AGCGACTGGG CGGCGAGCTT GCAGCGCCAG CTGGAGTTCT ACCAGTGGTT GCAATCCGCG
GAAGGAGCCA TTGCGGGCGG CGCCACCAAC AGCTGGAACG GCAATTACGG GACCCCGCCG
GCCGGAGACT CGACCTTCTA CGGCATGGCG TACGACTGGG AGCCGGTCTA CCACGACCCG
CCGAGCAACA ACTGGTTCGG CTTCCAGGCG TGGTCCATGG AACGGGTTGC CGAGTACTAC
TACGTCACCG GCGACCCGAA GGCCAAGGCG CTGCTCGACA AGTGGGTCGC ATGGGTGAAG
CCGAATGTCA CCACCGGTGC CTCATGGTCG ATTCCGTCGA ATTTGTCCTG GAGCGGCCAA
CCGGATACCT GGAATCCGAG CAACCCAGGA ACGAATGCCA ACCTGCACGT GACCATCACG
TCGTCCGGGC AGGACGTCGG TGTTGCCGCG GCGCTCGCGA AGACACTCGA GTACTACGCG
GCAAAATCCG GCGATACGGC CTCGCGCGAC CTCGCGAAGG GATTGCTCGA CTCCATCTGG
AACAACGACC AGGACAGCCT CGGTGTGAGC ACACCGGAGA CGCGGACCGA CTACTCTCGG
TTCACTCAGG TGTACGACCC GACGACTGGT GACGGCCTCT ACATCCCGTC GGGTTGGACG
GGGACCATGC CCAACGGTGA CCAAATCAAG CCGGGTGCGA CCTTCCTGAG CATCCGGTCC
TGGTACACCA AGGATCCGCA GTGGTCGAAG GTGCAGGCGT ACCTCAACGG CGGGCCTGCT
CCGACGTTCA ACTACCACCG GTTCTGGGCG GAGTCCGACT TCGCGATGGC GAACGCCGAT
TTTGGCATGC TCTTCCCATC CGGGTCGCCC AGCCCGACCC CGAGCCCGAC TCCGACGTCG
TCCCCGAGCC CGACTCCGAG CAGCTCGCCG ACGCCGTCGC CCAGCCCGTC ACCGACCGGC
GACACCACGC CGCCGAGCGT GCCGACGGGT CTTCAGGTCA CCGGGACAAC GACGTCGTCC
GTGTCGCTCA GCTGGACCGC GTCCACCGAC AACGTCGGCG TCGCGCACTA CAACGTGTAC
CGAAACGGCA CGCTGGTGGG TCAGCCGACA GCGACGTCGT TCACGGACAC CGGCCTGGCT
GCTGGCACGT CGTACACGTA CACAGTGGCG GCCGTTGATG CGGCCGGTAA CACGTCGGCG
CAGAGCTCGC CGGTGACAGC GACGACGGCA TCGCCGTCGC CGAGCCCGTC GCCGAGCCCG
ACTCCGACGT CGTCCCCGAG CCCAACGCCG TCGCCGACAC CGTCACCGAC GTCCACCAGC
GGCGCATCGT GCACTGCTAC CTACGTTGTC AATAGCGACT GGGGTAGCGG CTTCACGACA
ACCGTGACCG TGACGAACAC CGGCACCAGG GCCACCAGTG GCTGGACGGT CACGTGGAGC
TTTGCCGGTA ATCAGACGGT CACCAACTAC TGGAACACCG CGCTGACGCA ATCCGGAAAG
TCGGTGACCG CAAAGAACCT GAGTTACAAC AACGTCATCC AACCTGGTCA GTCGACGACC
TTTGGATTCA ACGGAAGTTA CTCAGGAACA AACACCGCGC CGACGCTCAG CTGCACGGCA
AGCTGA
 
Protein sequence
MPGLRRRLRA GIVSAAALGS LVSGLVAVAP VAHAAVTLKA QYKNNDSAPS DNQIKPGLQL 
VNTGSSSVDL STVTVRYWFT RDGGSSTLVY NCDWAAMGCG NIRASFGSVN PATPTADTYL
QLSFTGGTLA AGGSTGEIQN RVNKSDWSNF DETNDYSYGT NTTFQDWTKV TVYVNGVLVW
GTEPSGATAS PSASATPSPS SSPTTSPSSS PSPSSSPTPT PSSSSPPPSS NDPYIQRFLT
MYNKIHDPAN GYFSPQGIPY HSVETLIVEA PDYGHETTSE AYSFWLWLEA TYGAVTGNWT
PFNNAWTTME TYMIPQHADQ PNNASYNPNS PASYAPEEPL PSMYPVAIDS SVPVGHDPLA
AELQSTYGTP DIYGMHWLAD VDNIYGYGDS PGGGCELGPS AKGVSYINTF QRGSQESVWE
TVTQPTCDNG KYGGAHGYVD LFIQGSTPPQ WKYTDAPDAD ARAVQAAYWA YTWASAQGKA
SAIAPTIAKA AKLGDYLRYS LFDKYFKQVG NCYPASSCPG ATGRQSETYL IGWYYAWGGS
SQGWAWRIGD GAAHFGYQNP LAAWAMSNVT PLIPLSPTAK SDWAASLQRQ LEFYQWLQSA
EGAIAGGATN SWNGNYGTPP AGDSTFYGMA YDWEPVYHDP PSNNWFGFQA WSMERVAEYY
YVTGDPKAKA LLDKWVAWVK PNVTTGASWS IPSNLSWSGQ PDTWNPSNPG TNANLHVTIT
SSGQDVGVAA ALAKTLEYYA AKSGDTASRD LAKGLLDSIW NNDQDSLGVS TPETRTDYSR
FTQVYDPTTG DGLYIPSGWT GTMPNGDQIK PGATFLSIRS WYTKDPQWSK VQAYLNGGPA
PTFNYHRFWA ESDFAMANAD FGMLFPSGSP SPTPSPTPTS SPSPTPSSSP TPSPSPSPTG
DTTPPSVPTG LQVTGTTTSS VSLSWTASTD NVGVAHYNVY RNGTLVGQPT ATSFTDTGLA
AGTSYTYTVA AVDAAGNTSA QSSPVTATTA SPSPSPSPSP TPTSSPSPTP SPTPSPTSTS
GASCTATYVV NSDWGSGFTT TVTVTNTGTR ATSGWTVTWS FAGNQTVTNY WNTALTQSGK
SVTAKNLSYN NVIQPGQSTT FGFNGSYSGT NTAPTLSCTA S