Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0619 |
Symbol | |
ID | 4486401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 669117 |
End bp | 670328 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639729386 |
Product | cellulose-binding family II protein |
Protein accession | YP_872378 |
Protein GI | 117927827 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00682246 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTAGTGC TCAGGGCGCC GCGATGGCGA GTGGTCATCA CGGCAGCCGC AGTGGGGATC GCGGCCGCCG GAATACTTGT GACAAGTCAC TCCGCCTATG GTGCAACGAC ATCAACGTGT TCACCTACCG CCGTGGTGAG TGTGGCCGGT GACGAATACC GGGTGCAGGC CAATGAGTGG AATTCGTCGG CCCAACAATG CCTCACCATT GACACGTCAA CGGGCGCCTG GTCAGTCAGC ACGGCGAATT TCAATCTTGC GACCAACGGA GCGCCGGCGA CGTATCCGTC GATTTACAAG GGTTGCCACT GGGGCAACTG CACGACAGCC AATGTCGGGA TGCCGATTCA GGTCAGCAAG ATTGGTTCGG CTGTGACGTC GTGGAGTACG ACGCAGGTGT CGTCGGGCGC GTATGACGTG GCCTACGACA TTTGGACGAA CAGCACCCCG ACGACCTCTG GTCAGCCGAA TGGCACAGAG GTGATGATTT GGTTGAATTC GCGGGGTGGG GTGCAGCCGT TCGGGTCGCA GACGGCGACG GGTGTGACGG TCGCTGGTCA CACGTGGAAC GTCTGGCAGG GCCAGCAGAC GTCCTGGAAG ATTATTTCCT ACGTCCTGAC CCCTGGTGCG ACGTCGATCA GCAATCTGGA TTTGAAGGCG ATTCTCGCGG ACGCTGCTGC GCGCGGCTCG CTCAACACCT CCGATTACCT CATCGATGTT GAGGCCGGGT TTGAGATCTG GCAAGGTGGT CAGGGCCTGG GTAGTAACTC GTTCAGCGTC TCCGTGACGA GCGGCACGTC CAGCCCGACA CCGACACCGT CTCCAAGCCC ATCCCCGAGC CCCGCGCCCA GCCCGTCCCC GAGCCCGAGC CCAACGCCCA CGTCCAGCCC GACATCGTCG TCTGGTGGTG TTGGGTGCAA GGCTGCCTAT GCGGTTAGTA ATGATTGGGG TTCTGGGTTT ACGGCGACGG TGACGGTGAC AAATACCGGG AGCCGGGCGA CGAGCGGGTG GACGGTGGCG TGGTCGTTTG GTGGGAATCA GACGGTCACG AACTACTGGA ACACTGCGTT GACCCAATCA GGTAAGTCGG TGACGGCGAC GAACCTGAGC TACAACAACG TGATCCAACC GGGTCAGTCG ACCACCTTCG GGTTCAACGC CAACTACACC GGCAGTAACA CCCCACCCAC ACTCACCTGC ACCGCCAGCT GA
|
Protein sequence | MLVLRAPRWR VVITAAAVGI AAAGILVTSH SAYGATTSTC SPTAVVSVAG DEYRVQANEW NSSAQQCLTI DTSTGAWSVS TANFNLATNG APATYPSIYK GCHWGNCTTA NVGMPIQVSK IGSAVTSWST TQVSSGAYDV AYDIWTNSTP TTSGQPNGTE VMIWLNSRGG VQPFGSQTAT GVTVAGHTWN VWQGQQTSWK IISYVLTPGA TSISNLDLKA ILADAAARGS LNTSDYLIDV EAGFEIWQGG QGLGSNSFSV SVTSGTSSPT PTPSPSPSPS PAPSPSPSPS PTPTSSPTSS SGGVGCKAAY AVSNDWGSGF TATVTVTNTG SRATSGWTVA WSFGGNQTVT NYWNTALTQS GKSVTATNLS YNNVIQPGQS TTFGFNANYT GSNTPPTLTC TAS
|
| |