Gene Acel_0135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0135 
Symbol 
ID4485572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp139082 
End bp140491 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content65% 
IMG OID639728897 
Productcellulase 
Protein accessionYP_871896 
Protein GI117927345 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACGT ACCCAATCCG GTCAGTGTCC GGCGGTGTCG CGCTCGCCGC CTGCGCCGTC 
CTCACGATGA CCACTGCGGC GGCTGCAACA CCGATCCATG ATGCCTCATC GCCGCACACC
ATTCCGCCGC ACGCACGGCT CTATACCCCT CCGCCAGACA AAGGTGCGAT CAAGCAAATC
ACCGACCTGC TGAAGGCGCG CGACGTCCGC GACGCCCGCC TGATTGCGGA AATGATTTCC
ACTCCGCAGG CGGTCTGGTT CACCGGCGGC ACGCCCGATC AGGTACGCCG CGACGTCCAT
CGGGTCGTCA CCAAGGCGGC GGCGCATCAC GCCATTCCCG TCCTCGTTGC GTACAACATT
CCATTCCGCG ACTGCTCGCA ATATTCCGCC GGCGGCGCGG TGGACACGGC CGCGTACGAA
GCATGGATCG ACGGATTCGC TGCTGGAATC GGCGACAAAA GAGCCATCGT GCTCCTCGAG
CCGGACAGCC TGGGCATCAT TCCGTACAAC ACGGATATCA ACGGAAACGC CGAGTGGTGC
AAACCGGATC TCAGCGGTAC GGGATTGACG CCCGACGAGG CGAACCAAGC ACGCTACGAC
CAGCTGAACT ACGCAGTCGA CGCACTCGAG GCGCACCGCA ATGTGAGCGT CTACCTCGAC
GGCACGCACA GCGGATGGCT CGGGGTCGGA GATATTGCGC AGCGGCTCGT CCGAGCCGGT
GTGCAACGGG CACAGGGCTT TTTCGTCAAC GTGTCCAATT ACCAGACGAC CGAGCGGCAA
ATCAAATACG GCACCTGGAT TTCCGAGTGC ATCGCCTTTG CGAACGATCC GGAGGAAGGC
GGCTGGCGAC TCGGACACTA CAGCTGGTGC GCCAGCCAGT ACTACCCGGC GAATCCGAAC
GACTTCAGCA CGTGGGTTCA GACCGACCAG TGGTATGCGA GCAATTTAGG AACGGCGGTT
CCGACGACGC ACTTCGTCAT CGACACCAGC CGTAACGGGC GCGGACCGAA CGACATGACG
GCGTACGCCG CCGCGCCGTA CAACCAACCG GCCAGCGTCA TTTCGGCGCT CCAAGGCGGT
AGCTGGTGCA ATCCGCCGGG CCGGGGACTT GGGTTGCGGC CCACGGTGAA TACCGGCGTA
CCGCTGCTCG ATGCCTACCT CTGGGTGAAG ATTCCCGGCG AATCGGATGG GCAGTGCGAT
GCTGCCGGCG GCGCCCGGGC CTGGGACTAC TCGGCGTACA CCGAACCGGG TTGGCCGACC
GATCCCAGCC AGCAGGCGCT CTTCGACCCG CTCTGGGGCT TGTACGACCC GCCCGCCGGG
CAGTGGTTCC CGCAGCAGGC CCTTCAGCTT GCGCAGCTCG CTGTCCCGCC GTTGCAGCCG
CAGTGGCCCG TCCCGCCGGT GCATCACTGA
 
Protein sequence
MGTYPIRSVS GGVALAACAV LTMTTAAAAT PIHDASSPHT IPPHARLYTP PPDKGAIKQI 
TDLLKARDVR DARLIAEMIS TPQAVWFTGG TPDQVRRDVH RVVTKAAAHH AIPVLVAYNI
PFRDCSQYSA GGAVDTAAYE AWIDGFAAGI GDKRAIVLLE PDSLGIIPYN TDINGNAEWC
KPDLSGTGLT PDEANQARYD QLNYAVDALE AHRNVSVYLD GTHSGWLGVG DIAQRLVRAG
VQRAQGFFVN VSNYQTTERQ IKYGTWISEC IAFANDPEEG GWRLGHYSWC ASQYYPANPN
DFSTWVQTDQ WYASNLGTAV PTTHFVIDTS RNGRGPNDMT AYAAAPYNQP ASVISALQGG
SWCNPPGRGL GLRPTVNTGV PLLDAYLWVK IPGESDGQCD AAGGARAWDY SAYTEPGWPT
DPSQQALFDP LWGLYDPPAG QWFPQQALQL AQLAVPPLQP QWPVPPVHH