Gene Acel_0603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0603 
Symbol 
ID4485525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp639808 
End bp641226 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content64% 
IMG OID639729370 
Productglycoside hydrolase family protein 
Protein accessionYP_872362 
Protein GI117927811 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.238448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0453952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTCGA GATTCCGTCA CACCCGCTGG ATTACCCGTA CGCAGCGCCA CACCCGCTGG 
AGTACCCGAA CGCACCGCCG CACCGGGCGG CTTGCCCCGG CGGCGGCCGT CCTCGTCGCC
GCGGTCGTCG TCCCATCAAG CGCGGCCGCC CACGAACCGC GCAGCGACGG CCTCGTCCGC
GCCGCGTACT ACACACAGTG GTCGGTCTAC AGCGGTTTCA CCGTAGCCGG CGTGGTCGCA
AACGGTGACG CCGGCCGCCT CAACCAAATC AACTACGCAT TCATCAATGT CGCGCCGAAC
CCTGCCGTGT CCGGCTCACC CATTGAATGC CTCAGCGGCG ACCCGTGGGC CGATTACCAA
ATGCCGTTCG GCGGCGCCAC CGGACGCCCA AGTGTCGACG GCACCGCCGA CGACTGGTCC
GGACTTCAGG GAAATTTCAA GCAGCTGAGG GAATTGAAGA CGCGTTACCC GAATCTTCGT
ATCATCGCCT CCCTCGGCGG ATACTCGTGG TCGGGATACT TCTCCGACGC CGCCCTGACA
GCTGAGTCCC GCGCACATCT TGTCGCCTCA TGCATCGACC TGCTCATCAA CGGCAACCTT
CCCGGTCTGC CGCCCGGTGC CGCTAAGGGC ATCTTCGACG GCATTGACGT GGACTGGGAA
TATCCCGGTG CTGCCGGTGC CACGCTCACC AACGGAAACG GTAATCCCAC CGCACGGCCT
GAGGACACCC GGAACTTCAC CTTGCTGCTT GCCGAATTTC GCCGCCAACT GGACAGCGCA
GCCCGCGCGA ATCACCACCG CCGGTACCTG CTCACCGCCG CACTGTCAGC GAACCCGACG
AAGATTGCCC TCCTCGAGGT GCCGCAGATT TCGCGGCTTC TCGACCAGAT GGATGTCATG
GATTACGACT TCCACGGGCC ATGGGAGGCA CATGGCCCGA CGGACTTTCA ATCCGAGCTG
TACCCGTCCG CGGCTGAAGT GGCCGTGATC GGCTCGGCGC AGCAATTCAG TGTCGACCAG
TCGATCGACG CCTTTCTCCG CGCCGGCGCC GATCGGCACA AGCTCCTCGT CGGTGTGCCG
TTCTACGGAC ATGGCTGGGT CGGCGTCCCC GACGGTGGCA CACACGGCTT GTACCAGACG
GCGACCGGTC CGTCGTGGCT GAATGGCGGC TCTCCGACCT GGGCGCAACT GGAGGCTCTC
GGCTACGCGC CCTACCGCGA TCCGATAACC GGTGGTTATT GGCTCTACGA CCAGGCGAGT
GAGACCCTCT ATGTCGTCGA CGACCCGGTA GAAATCGGTC AGAAAATGCA CTACATTCTC
CGCCGCGACC TCGGAGGCAC CGCTGCGTGG TCACTGGACG GCGACGACAG CGCGGGTAGT
TTGGGAGCAG CCCTCGCACT CGGTCTCATC GATCACTGA
 
Protein sequence
MLSRFRHTRW ITRTQRHTRW STRTHRRTGR LAPAAAVLVA AVVVPSSAAA HEPRSDGLVR 
AAYYTQWSVY SGFTVAGVVA NGDAGRLNQI NYAFINVAPN PAVSGSPIEC LSGDPWADYQ
MPFGGATGRP SVDGTADDWS GLQGNFKQLR ELKTRYPNLR IIASLGGYSW SGYFSDAALT
AESRAHLVAS CIDLLINGNL PGLPPGAAKG IFDGIDVDWE YPGAAGATLT NGNGNPTARP
EDTRNFTLLL AEFRRQLDSA ARANHHRRYL LTAALSANPT KIALLEVPQI SRLLDQMDVM
DYDFHGPWEA HGPTDFQSEL YPSAAEVAVI GSAQQFSVDQ SIDAFLRAGA DRHKLLVGVP
FYGHGWVGVP DGGTHGLYQT ATGPSWLNGG SPTWAQLEAL GYAPYRDPIT GGYWLYDQAS
ETLYVVDDPV EIGQKMHYIL RRDLGGTAAW SLDGDDSAGS LGAALALGLI DH