Gene Acel_0619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0619 
Symbol 
ID4486401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp669117 
End bp670328 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content61% 
IMG OID639729386 
Productcellulose-binding family II protein 
Protein accessionYP_872378 
Protein GI117927827 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00682246 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTAGTGC TCAGGGCGCC GCGATGGCGA GTGGTCATCA CGGCAGCCGC AGTGGGGATC 
GCGGCCGCCG GAATACTTGT GACAAGTCAC TCCGCCTATG GTGCAACGAC ATCAACGTGT
TCACCTACCG CCGTGGTGAG TGTGGCCGGT GACGAATACC GGGTGCAGGC CAATGAGTGG
AATTCGTCGG CCCAACAATG CCTCACCATT GACACGTCAA CGGGCGCCTG GTCAGTCAGC
ACGGCGAATT TCAATCTTGC GACCAACGGA GCGCCGGCGA CGTATCCGTC GATTTACAAG
GGTTGCCACT GGGGCAACTG CACGACAGCC AATGTCGGGA TGCCGATTCA GGTCAGCAAG
ATTGGTTCGG CTGTGACGTC GTGGAGTACG ACGCAGGTGT CGTCGGGCGC GTATGACGTG
GCCTACGACA TTTGGACGAA CAGCACCCCG ACGACCTCTG GTCAGCCGAA TGGCACAGAG
GTGATGATTT GGTTGAATTC GCGGGGTGGG GTGCAGCCGT TCGGGTCGCA GACGGCGACG
GGTGTGACGG TCGCTGGTCA CACGTGGAAC GTCTGGCAGG GCCAGCAGAC GTCCTGGAAG
ATTATTTCCT ACGTCCTGAC CCCTGGTGCG ACGTCGATCA GCAATCTGGA TTTGAAGGCG
ATTCTCGCGG ACGCTGCTGC GCGCGGCTCG CTCAACACCT CCGATTACCT CATCGATGTT
GAGGCCGGGT TTGAGATCTG GCAAGGTGGT CAGGGCCTGG GTAGTAACTC GTTCAGCGTC
TCCGTGACGA GCGGCACGTC CAGCCCGACA CCGACACCGT CTCCAAGCCC ATCCCCGAGC
CCCGCGCCCA GCCCGTCCCC GAGCCCGAGC CCAACGCCCA CGTCCAGCCC GACATCGTCG
TCTGGTGGTG TTGGGTGCAA GGCTGCCTAT GCGGTTAGTA ATGATTGGGG TTCTGGGTTT
ACGGCGACGG TGACGGTGAC AAATACCGGG AGCCGGGCGA CGAGCGGGTG GACGGTGGCG
TGGTCGTTTG GTGGGAATCA GACGGTCACG AACTACTGGA ACACTGCGTT GACCCAATCA
GGTAAGTCGG TGACGGCGAC GAACCTGAGC TACAACAACG TGATCCAACC GGGTCAGTCG
ACCACCTTCG GGTTCAACGC CAACTACACC GGCAGTAACA CCCCACCCAC ACTCACCTGC
ACCGCCAGCT GA
 
Protein sequence
MLVLRAPRWR VVITAAAVGI AAAGILVTSH SAYGATTSTC SPTAVVSVAG DEYRVQANEW 
NSSAQQCLTI DTSTGAWSVS TANFNLATNG APATYPSIYK GCHWGNCTTA NVGMPIQVSK
IGSAVTSWST TQVSSGAYDV AYDIWTNSTP TTSGQPNGTE VMIWLNSRGG VQPFGSQTAT
GVTVAGHTWN VWQGQQTSWK IISYVLTPGA TSISNLDLKA ILADAAARGS LNTSDYLIDV
EAGFEIWQGG QGLGSNSFSV SVTSGTSSPT PTPSPSPSPS PAPSPSPSPS PTPTSSPTSS
SGGVGCKAAY AVSNDWGSGF TATVTVTNTG SRATSGWTVA WSFGGNQTVT NYWNTALTQS
GKSVTATNLS YNNVIQPGQS TTFGFNANYT GSNTPPTLTC TAS