Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0135 |
Symbol | |
ID | 4485572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 139082 |
End bp | 140491 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639728897 |
Product | cellulase |
Protein accession | YP_871896 |
Protein GI | 117927345 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAACGT ACCCAATCCG GTCAGTGTCC GGCGGTGTCG CGCTCGCCGC CTGCGCCGTC CTCACGATGA CCACTGCGGC GGCTGCAACA CCGATCCATG ATGCCTCATC GCCGCACACC ATTCCGCCGC ACGCACGGCT CTATACCCCT CCGCCAGACA AAGGTGCGAT CAAGCAAATC ACCGACCTGC TGAAGGCGCG CGACGTCCGC GACGCCCGCC TGATTGCGGA AATGATTTCC ACTCCGCAGG CGGTCTGGTT CACCGGCGGC ACGCCCGATC AGGTACGCCG CGACGTCCAT CGGGTCGTCA CCAAGGCGGC GGCGCATCAC GCCATTCCCG TCCTCGTTGC GTACAACATT CCATTCCGCG ACTGCTCGCA ATATTCCGCC GGCGGCGCGG TGGACACGGC CGCGTACGAA GCATGGATCG ACGGATTCGC TGCTGGAATC GGCGACAAAA GAGCCATCGT GCTCCTCGAG CCGGACAGCC TGGGCATCAT TCCGTACAAC ACGGATATCA ACGGAAACGC CGAGTGGTGC AAACCGGATC TCAGCGGTAC GGGATTGACG CCCGACGAGG CGAACCAAGC ACGCTACGAC CAGCTGAACT ACGCAGTCGA CGCACTCGAG GCGCACCGCA ATGTGAGCGT CTACCTCGAC GGCACGCACA GCGGATGGCT CGGGGTCGGA GATATTGCGC AGCGGCTCGT CCGAGCCGGT GTGCAACGGG CACAGGGCTT TTTCGTCAAC GTGTCCAATT ACCAGACGAC CGAGCGGCAA ATCAAATACG GCACCTGGAT TTCCGAGTGC ATCGCCTTTG CGAACGATCC GGAGGAAGGC GGCTGGCGAC TCGGACACTA CAGCTGGTGC GCCAGCCAGT ACTACCCGGC GAATCCGAAC GACTTCAGCA CGTGGGTTCA GACCGACCAG TGGTATGCGA GCAATTTAGG AACGGCGGTT CCGACGACGC ACTTCGTCAT CGACACCAGC CGTAACGGGC GCGGACCGAA CGACATGACG GCGTACGCCG CCGCGCCGTA CAACCAACCG GCCAGCGTCA TTTCGGCGCT CCAAGGCGGT AGCTGGTGCA ATCCGCCGGG CCGGGGACTT GGGTTGCGGC CCACGGTGAA TACCGGCGTA CCGCTGCTCG ATGCCTACCT CTGGGTGAAG ATTCCCGGCG AATCGGATGG GCAGTGCGAT GCTGCCGGCG GCGCCCGGGC CTGGGACTAC TCGGCGTACA CCGAACCGGG TTGGCCGACC GATCCCAGCC AGCAGGCGCT CTTCGACCCG CTCTGGGGCT TGTACGACCC GCCCGCCGGG CAGTGGTTCC CGCAGCAGGC CCTTCAGCTT GCGCAGCTCG CTGTCCCGCC GTTGCAGCCG CAGTGGCCCG TCCCGCCGGT GCATCACTGA
|
Protein sequence | MGTYPIRSVS GGVALAACAV LTMTTAAAAT PIHDASSPHT IPPHARLYTP PPDKGAIKQI TDLLKARDVR DARLIAEMIS TPQAVWFTGG TPDQVRRDVH RVVTKAAAHH AIPVLVAYNI PFRDCSQYSA GGAVDTAAYE AWIDGFAAGI GDKRAIVLLE PDSLGIIPYN TDINGNAEWC KPDLSGTGLT PDEANQARYD QLNYAVDALE AHRNVSVYLD GTHSGWLGVG DIAQRLVRAG VQRAQGFFVN VSNYQTTERQ IKYGTWISEC IAFANDPEEG GWRLGHYSWC ASQYYPANPN DFSTWVQTDQ WYASNLGTAV PTTHFVIDTS RNGRGPNDMT AYAAAPYNQP ASVISALQGG SWCNPPGRGL GLRPTVNTGV PLLDAYLWVK IPGESDGQCD AAGGARAWDY SAYTEPGWPT DPSQQALFDP LWGLYDPPAG QWFPQQALQL AQLAVPPLQP QWPVPPVHH
|
| |