Gene Acel_0616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0616 
Symbol 
ID4486398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp658797 
End bp661088 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content61% 
IMG OID639729383 
Productcellulose-binding family II protein 
Protein accessionYP_872375 
Protein GI117927824 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3934] Endo-beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.683683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00244969 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGTCTAG TGCGTCGCCC TGCGCGAGCA TTTGTTGCGA CCGCGGCCGG CACTGCCGTT 
GCTGCCGCGG CGACGCTCGG CTCAATCACC ATGCCGTCAG CCACGGCAGC GCCGGCGGGA
TTCGTCACCG CATCCGGCGG TCAGTTCGTT CTGAACGGCC TTCCCTATCG TTACGGGGGA
ACGAACAACT ATTACCTCAG CTATCAGTCG CACGCCGACG TCGATGACGT GTTGGCCAAG
GCTCAAGCGA TGAATCTTTC TGTCATCCGG ACCTGGGGTT TCATCGACAT CGGCTCTCTT
GACGGCTCCG TGCCCACAAT CGATGGCAAC AAGAACGGCT TCTACTTTCA GTACTGGGAC
CCGTCGACCG GCGCTCCGGC GTACAACGAC GGGCCGACCG GCTTGCAAGG CCTTGACTAC
GCGATCGCGA GCGCGGCCGC GCACGGCCTT CGGGTGATTG TCGTCCTCAC CAACGACTGG
AAAGAATTTG GGGGAATGGA TCAATACGAC AAGTGGTACG GCCTTCCTTA CCACGACAAC
TTCTACACCG ACCCCCGGAC CCAGCAGGCG TACAAGAATT GGGTCAATCA TCTACTGAAC
CGGGTCAACA GCATTACCGG CGTGACGTAC AAGAACGATC CAACGATCTT TGCTTGGGAA
CTTGCCAATG AGCCGCGCTG CGTAGGAAGC GGCACATTAC CAACCTCGGG CACGTGCACT
CAGGCGACCA TTGTCAACTG GGTCGATCAA ATGTCGGCGT ACGTCAAAAG CATAGACCCT
AACCATATGG TCTCGGTCGG CGACGAAGGG TTCTACATTG GGTCAACGCA GGGAAGCGGC
TGGCCATACA ACGACCCGTC CGACGGCGTC GACAACAATG CTCTTCTCCG TGTCAAGAAC
ATTGACTTTG GCACGTATCA CCTGTACCCG AATTACTGGG GCCAGAACGC GGACTGGGGA
ACGCAATGGA TCAAGGATCA TATTGCGAAT GCCGCAGCGA TCGGCAAGCC GACCATTCTC
GAAGAATTCG GCTGGCAGGA CACCGGGACC CGCGATTCCG TCTATCAGAC GTGGACCCAG
ACTGTGCGTA CGAACGGTGG AGCAGGCTGG AACTTCTGGA TGCTCGCTGG GAATGTCAAC
GGCCAGCCAT ATCCGAACTA TGACGGCTTC AACGTCTACT ACCCAAGTTC AACAGCGACC
GTCCTCGCCA GCGAGGCGCT CGCAATCAGT ACCGGCACAT CGCCTCCGCC GTCGCCGAGC
TCGAGTCCAT CCTCGTCGCC GTCTCCGTCG CCGTCTCCGT CGGCGTCTCC GTCGGCGTCT
CCGTCGGCGT CTTCGTCGCC GAGCCCGTCT CCGTCGTCGT CGCCGGTGTC GGGTGGGGTG
AAGGTGCAGT ACAAGAACAA TGATTCGGCG CCGGGTGATA ACCAGATCAA ACCGGGTCTC
CAGTTGGTGA ATACGGGGTC GTCGTCGGTG GATTTGTCGA CGGTGACGGT GCGGTACTGG
TTCACCCGGG ATGGTGGGTC GTCGACACTG GTGTACAACT GTGACTGGGC GGCGATGGGG
TGTGGGAATA TCCGCGCCTC GTTCGGCTCG GTGAACCCGG CGACGCCGAC GGCGGACACC
TACCTGCAGT TGTCGTTCAC TGGTGGAACG TTGGCCGCTG GTGGGTCGAC GGGTGAGATT
CAAAACCGGG TGAATAAGAG TGACTGGTCG AACTTTGATG AGACCAATGA CTACTCGTAT
GGGACGAACA CCGCCTTCCA GGATTGGACG AAGGTGACGG TGTATGTCAA TGGCCGGCTG
GTGTGGGGGA CTGAACCGTC CGGCACCAGC CCCAGCCCCA CACCCAGCCC CAGCCCAACC
CCGTCCCCGA GCCCGAGCCC GACCCCAAGC CCCAGCTCCT CCCCATCCCC GTCCCCGAGC
CCCAGCCCCA GCCCTACGCC GTCCCCGTCG CCGAGCCCGT CGCCGTCGCC GAGTGTGTCG
TCGTCGGGTG TGGGGTGCCG GGCGACGTAT GTGGTGAATA GTGATTGGGG TTCTGGGTTT
ACGGCGACGG TGACGGTGAC GAATACCGGG AGCCGGGCGA CGAGCGGGTG GACGGTGGCG
TGGTCGTTTG GTGGGAATCA GACGGTCACG AACTACTGGA ACACTGCGTT GACCCAATCA
GGTGCATCGG TGACGGCGAC GAACCTGAGC TACAACAACG TGATCCAACC GGGTCAGTCG
ACCACCTTCG GATTCAACGG AAGTTACTCA GGAACAAACA CCGCACCTAC ACTCACCTGC
ACGGCTAGTT GA
 
Protein sequence
MGLVRRPARA FVATAAGTAV AAAATLGSIT MPSATAAPAG FVTASGGQFV LNGLPYRYGG 
TNNYYLSYQS HADVDDVLAK AQAMNLSVIR TWGFIDIGSL DGSVPTIDGN KNGFYFQYWD
PSTGAPAYND GPTGLQGLDY AIASAAAHGL RVIVVLTNDW KEFGGMDQYD KWYGLPYHDN
FYTDPRTQQA YKNWVNHLLN RVNSITGVTY KNDPTIFAWE LANEPRCVGS GTLPTSGTCT
QATIVNWVDQ MSAYVKSIDP NHMVSVGDEG FYIGSTQGSG WPYNDPSDGV DNNALLRVKN
IDFGTYHLYP NYWGQNADWG TQWIKDHIAN AAAIGKPTIL EEFGWQDTGT RDSVYQTWTQ
TVRTNGGAGW NFWMLAGNVN GQPYPNYDGF NVYYPSSTAT VLASEALAIS TGTSPPPSPS
SSPSSSPSPS PSPSASPSAS PSASSSPSPS PSSSPVSGGV KVQYKNNDSA PGDNQIKPGL
QLVNTGSSSV DLSTVTVRYW FTRDGGSSTL VYNCDWAAMG CGNIRASFGS VNPATPTADT
YLQLSFTGGT LAAGGSTGEI QNRVNKSDWS NFDETNDYSY GTNTAFQDWT KVTVYVNGRL
VWGTEPSGTS PSPTPSPSPT PSPSPSPTPS PSSSPSPSPS PSPSPTPSPS PSPSPSPSVS
SSGVGCRATY VVNSDWGSGF TATVTVTNTG SRATSGWTVA WSFGGNQTVT NYWNTALTQS
GASVTATNLS YNNVIQPGQS TTFGFNGSYS GTNTAPTLTC TAS