Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0617 |
Symbol | |
ID | 4486399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 661169 |
End bp | 664534 |
Gene Length | 3366 bp |
Protein Length | 1121 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639729384 |
Product | glycoside hydrolase family protein |
Protein accession | YP_872376 |
Protein GI | 117927825 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0413787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00805779 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCAGGAT TACGACGGCG ACTCCGCGCC GGTATCGTCT CGGCGGCGGC GTTGGGGTCG CTGGTTAGCG GGCTCGTTGC CGTCGCACCA GTCGCGCACG CGGCGGTGAC TCTCAAAGCG CAGTATAAGA ACAATGATTC GGCGCCGAGT GACAACCAGA TCAAACCGGG TCTCCAGTTG GTGAATACCG GGTCGTCGTC GGTGGATTTG TCGACGGTGA CGGTGCGGTA CTGGTTCACC CGGGATGGTG GGTCGTCGAC ACTGGTGTAC AACTGTGACT GGGCGGCGAT GGGGTGTGGG AATATCCGCG CCTCGTTCGG CTCGGTGAAC CCGGCGACGC CGACGGCGGA CACCTACCTG CAGTTGTCGT TCACTGGTGG AACGTTGGCC GCTGGTGGGT CGACGGGTGA GATTCAAAAC CGGGTGAATA AGAGTGACTG GTCGAACTTT GATGAGACCA ATGACTACTC GTATGGGACG AACACCACCT TCCAGGACTG GACGAAGGTG ACGGTGTACG TCAACGGCGT GTTGGTCTGG GGGACCGAAC CGTCCGGAGC GACGGCGTCT CCATCCGCGT CGGCGACGCC CAGCCCGTCC AGTTCACCGA CCACGAGTCC GAGTTCGTCC CCGTCGCCGA GCAGCAGCCC GACGCCGACA CCGAGCAGCT CGTCGCCGCC CCCGTCGTCC AACGACCCGT ACATCCAGCG GTTCCTCACG ATGTACAACA AGATTCACGA CCCAGCGAAC GGCTACTTCA GCCCGCAGGG AATTCCCTAC CACTCGGTAG AAACGCTCAT CGTTGAGGCA CCGGACTACG GGCACGAGAC AACTTCGGAG GCGTACAGCT TCTGGCTCTG GCTCGAAGCG ACGTACGGCG CAGTGACCGG CAACTGGACG CCGTTCAACA ACGCCTGGAC GACGATGGAA ACGTACATGA TCCCGCAGCA CGCGGACCAG CCGAACAACG CGTCGTACAA CCCCAACAGC CCGGCGTCGT ACGCTCCGGA AGAGCCGCTG CCCAGCATGT ACCCGGTTGC CATCGACAGC AGCGTGCCGG TTGGGCACGA CCCGCTCGCC GCCGAATTGC AGTCGACGTA CGGCACTCCG GACATTTACG GCATGCACTG GCTGGCCGAC GTTGACAACA TCTACGGATA CGGCGACAGC CCCGGCGGTG GTTGCGAACT CGGTCCTTCC GCTAAGGGCG TCTCCTACAT CAACACATTC CAGCGCGGCT CGCAGGAGTC CGTCTGGGAG ACGGTCACCC AGCCGACGTG CGACAACGGC AAGTACGGTG GGGCGCACGG CTACGTCGAC CTGTTCATCC AGGGTTCGAC GCCGCCGCAG TGGAAGTACA CCGATGCCCC GGACGCCGAC GCCCGTGCCG TCCAGGCTGC GTACTGGGCC TACACCTGGG CATCGGCGCA GGGCAAGGCA AGCGCGATTG CCCCGACGAT CGCCAAGGCG GCCAAACTCG GCGACTACCT GCGGTACTCG CTCTTTGACA AGTACTTCAA GCAGGTCGGC AACTGCTACC CGGCCAGCTC CTGCCCTGGA GCAACCGGAC GCCAGAGCGA GACCTACCTG ATCGGCTGGT ACTACGCCTG GGGCGGCTCA AGCCAAGGCT GGGCCTGGCG CATTGGTGAC GGCGCCGCGC ACTTCGGCTA CCAGAATCCG CTTGCCGCGT GGGCGATGTC GAACGTGACA CCGCTCATTC CGCTCTCGCC CACGGCAAAG AGCGACTGGG CGGCGAGCTT GCAGCGCCAG CTGGAGTTCT ACCAGTGGTT GCAATCCGCG GAAGGAGCCA TTGCGGGCGG CGCCACCAAC AGCTGGAACG GCAATTACGG GACCCCGCCG GCCGGAGACT CGACCTTCTA CGGCATGGCG TACGACTGGG AGCCGGTCTA CCACGACCCG CCGAGCAACA ACTGGTTCGG CTTCCAGGCG TGGTCCATGG AACGGGTTGC CGAGTACTAC TACGTCACCG GCGACCCGAA GGCCAAGGCG CTGCTCGACA AGTGGGTCGC ATGGGTGAAG CCGAATGTCA CCACCGGTGC CTCATGGTCG ATTCCGTCGA ATTTGTCCTG GAGCGGCCAA CCGGATACCT GGAATCCGAG CAACCCAGGA ACGAATGCCA ACCTGCACGT GACCATCACG TCGTCCGGGC AGGACGTCGG TGTTGCCGCG GCGCTCGCGA AGACACTCGA GTACTACGCG GCAAAATCCG GCGATACGGC CTCGCGCGAC CTCGCGAAGG GATTGCTCGA CTCCATCTGG AACAACGACC AGGACAGCCT CGGTGTGAGC ACACCGGAGA CGCGGACCGA CTACTCTCGG TTCACTCAGG TGTACGACCC GACGACTGGT GACGGCCTCT ACATCCCGTC GGGTTGGACG GGGACCATGC CCAACGGTGA CCAAATCAAG CCGGGTGCGA CCTTCCTGAG CATCCGGTCC TGGTACACCA AGGATCCGCA GTGGTCGAAG GTGCAGGCGT ACCTCAACGG CGGGCCTGCT CCGACGTTCA ACTACCACCG GTTCTGGGCG GAGTCCGACT TCGCGATGGC GAACGCCGAT TTTGGCATGC TCTTCCCATC CGGGTCGCCC AGCCCGACCC CGAGCCCGAC TCCGACGTCG TCCCCGAGCC CGACTCCGAG CAGCTCGCCG ACGCCGTCGC CCAGCCCGTC ACCGACCGGC GACACCACGC CGCCGAGCGT GCCGACGGGT CTTCAGGTCA CCGGGACAAC GACGTCGTCC GTGTCGCTCA GCTGGACCGC GTCCACCGAC AACGTCGGCG TCGCGCACTA CAACGTGTAC CGAAACGGCA CGCTGGTGGG TCAGCCGACA GCGACGTCGT TCACGGACAC CGGCCTGGCT GCTGGCACGT CGTACACGTA CACAGTGGCG GCCGTTGATG CGGCCGGTAA CACGTCGGCG CAGAGCTCGC CGGTGACAGC GACGACGGCA TCGCCGTCGC CGAGCCCGTC GCCGAGCCCG ACTCCGACGT CGTCCCCGAG CCCAACGCCG TCGCCGACAC CGTCACCGAC GTCCACCAGC GGCGCATCGT GCACTGCTAC CTACGTTGTC AATAGCGACT GGGGTAGCGG CTTCACGACA ACCGTGACCG TGACGAACAC CGGCACCAGG GCCACCAGTG GCTGGACGGT CACGTGGAGC TTTGCCGGTA ATCAGACGGT CACCAACTAC TGGAACACCG CGCTGACGCA ATCCGGAAAG TCGGTGACCG CAAAGAACCT GAGTTACAAC AACGTCATCC AACCTGGTCA GTCGACGACC TTTGGATTCA ACGGAAGTTA CTCAGGAACA AACACCGCGC CGACGCTCAG CTGCACGGCA AGCTGA
|
Protein sequence | MPGLRRRLRA GIVSAAALGS LVSGLVAVAP VAHAAVTLKA QYKNNDSAPS DNQIKPGLQL VNTGSSSVDL STVTVRYWFT RDGGSSTLVY NCDWAAMGCG NIRASFGSVN PATPTADTYL QLSFTGGTLA AGGSTGEIQN RVNKSDWSNF DETNDYSYGT NTTFQDWTKV TVYVNGVLVW GTEPSGATAS PSASATPSPS SSPTTSPSSS PSPSSSPTPT PSSSSPPPSS NDPYIQRFLT MYNKIHDPAN GYFSPQGIPY HSVETLIVEA PDYGHETTSE AYSFWLWLEA TYGAVTGNWT PFNNAWTTME TYMIPQHADQ PNNASYNPNS PASYAPEEPL PSMYPVAIDS SVPVGHDPLA AELQSTYGTP DIYGMHWLAD VDNIYGYGDS PGGGCELGPS AKGVSYINTF QRGSQESVWE TVTQPTCDNG KYGGAHGYVD LFIQGSTPPQ WKYTDAPDAD ARAVQAAYWA YTWASAQGKA SAIAPTIAKA AKLGDYLRYS LFDKYFKQVG NCYPASSCPG ATGRQSETYL IGWYYAWGGS SQGWAWRIGD GAAHFGYQNP LAAWAMSNVT PLIPLSPTAK SDWAASLQRQ LEFYQWLQSA EGAIAGGATN SWNGNYGTPP AGDSTFYGMA YDWEPVYHDP PSNNWFGFQA WSMERVAEYY YVTGDPKAKA LLDKWVAWVK PNVTTGASWS IPSNLSWSGQ PDTWNPSNPG TNANLHVTIT SSGQDVGVAA ALAKTLEYYA AKSGDTASRD LAKGLLDSIW NNDQDSLGVS TPETRTDYSR FTQVYDPTTG DGLYIPSGWT GTMPNGDQIK PGATFLSIRS WYTKDPQWSK VQAYLNGGPA PTFNYHRFWA ESDFAMANAD FGMLFPSGSP SPTPSPTPTS SPSPTPSSSP TPSPSPSPTG DTTPPSVPTG LQVTGTTTSS VSLSWTASTD NVGVAHYNVY RNGTLVGQPT ATSFTDTGLA AGTSYTYTVA AVDAAGNTSA QSSPVTATTA SPSPSPSPSP TPTSSPSPTP SPTPSPTSTS GASCTATYVV NSDWGSGFTT TVTVTNTGTR ATSGWTVTWS FAGNQTVTNY WNTALTQSGK SVTAKNLSYN NVIQPGQSTT FGFNGSYSGT NTAPTLSCTA S
|
| |