Gene Acel_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1701 
Symbol 
ID4484701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1912657 
End bp1916070 
Gene Length3414 bp 
Protein Length1137 aa 
Translation table11 
GC content63% 
IMG OID639730491 
Productglycoside hydrolase family protein 
Protein accessionYP_873459 
Protein GI117928908 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00102428 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCTTGA CGCGCGCGTG GAGTCGTGTT CGCCGATTGG GCGCATTGAT CGGAGTGGGC 
GGCGTTGCCG CGGCCACGCT TGTCGCCGTT GCGTCGGGTC CGGTGACCGC GGCGCCGGCC
GCGACCGGAG AGTTCAATTA CGGGCAGGCG CTGCAGGACG CCATCTACTT TTACGACGAG
CAGCGGGCGG GTCACATTGT CGGCAGTGAC GACCGTGCGA GCTGGAAGGG GAATTCGGCG
CTCAATGACG GCGCGGACGT TGGATTGGAT TTGAGTGGAG GCTTCTTCGA CGCCGGTGAT
TACGTGAAGT TCGGCTTCCC GATGGCCTTT ACCTTGACAA TGCTGTCGTG GAGCGTCGAC
GATTATCGGT CGGCCTACCA GTCGTCCGGC CAGTTGCCGT ACATTCTCAA CAACATCAAG
TGGGGTACGG ACTACCTCAT CAACGCCAAC CCATCACCAA ATGTCTTATA TGGCCAGGTG
GGTGACGCGA GCCTCGACCA CGCATGGTGG GGTCCCGCGG AAGTGATGCC GATGGCGCGT
CCGGCGTACA AAATTGATCC GAGCTGCCCT GGGTCGGACC TTGCCGGCGA GACCGCGGCC
GCTATGGCCG CGGCGTCGAT CGCCTTCGCG CCGACGAATC CCAGCTACGC GCAGACACTG
CTGACGCACG CCAAGAACCT GTACACCTTT GCGGACACGT ATCGCGGCAA ATACAGCGAC
TGCATTACCG CGGCGCAGGG TTACTACAAC TCCTGGAGCG GTTACTGGGA TGAGCTCGTC
TGGGGTGCGC TCTGGCTGTA TCGAGCCACG GGCGATCAGT CATACCTGAA CAAGGCCATC
AGCTACTACC CGAACCTCGG TTACCAAGAC CAGGCCAAGA CAGTCCATTC GTACAAGTGG
ACCATCGCCT GGGATGACAA GTCGTACGGC GACTACGTCC TGCTCGCGCG CTTCACCGGC
AATCAGCAGT ACATCGCGGA CGCCGAACGC TGGCTGGACT GGTGGACGAC CGGCTACAAC
AACAACGGCA CCATCGAGCG CATCACCTAC TCGCCCGGCG GTGAGGCGTG GCTGGACACC
TGGGGTTCGC TGCGGTACGC CGCCAATACC GCCTTCGTCG CCTTAGTGTT CAGCGACTGG
CTGGCGTCAC AAGGATTGGA CCCGGCACGC GTCAAGGCGT ACCACGACTT TGCCGTCCAG
CAAATCAACT ACATCCTGGG TGACAATCCC CGCGGCGGCA GTTACATCGT CGGATTCGGG
AAGAATTCGC CGTTCAATAT TCACAGCCGG GACGCCCATG CGTCATGGGC TAATGACATC
AATACGCCGG CCAACGAACG GCATCTCTTC ATCGGCGCGA TGGTTGGCGG TCCCGGCGCG
GCGGACGATC AATACACCGA TACCCGGAGC AATTACCAGG AGAACGAACC GGCTGACGAT
TACAACGCCG GCCTGACTGG CGCCCTGGCA CGGCTTTATC AAGAATACGG CGGACAGCCT
GCTGCGAACT TCCCGCCGAA AGAGACTCCC GATGGGCCGG AAATTTATAT GCAAGCGTCG
GTCAACTCGG CGGGGACGAA TTACACCGAG ATCAAGGCGT ACATTGTCAA CCAGTCGGCA
TGGCCGGCAC GCGCCCTGGA CCACGGCTCA TTCCGGTACT ACTTCACCTT GGATGGTTCA
ACAACACCGG CTCAAATAAG CCTCAGCTCG GCGTACAACC AGTGCAGTGC GCCGCAGGGA
CCGACGCAGT ACTCCGGCAA CATTTACTAC GTGACGATCA GTTGCGACGG GGTGCACATC
GCGCCGATCG GTCAATCCGA CTACCGGAAG GAAATCCAAT TCCGCATCAG CAGTTCGGGA
AGCTGGGATC CGACGAATGA CTGGTCCTAC CAGGGTGTTG CCACCACTCC GGGAGCGACA
CCGGTAACGG TCAACAATAT CGTGCTCTAT GACGGCACCA CCGCGATTTG GGGTTCCGCG
CCGTCCGGAT CGCCGTCGCC GTCGCCGAGT CCGTCGGCTT CGCCGAGCCC GAGCCCGAGC
AGCTCGCCGT CGCCGTCGCC GAGCCCGAGC CCGCGCCCGA GCCCGAGCCC ATCCTCGTCG
CCGTCTCCGT CGCCGTCACC ATCGCCGAGT CCGTCTCGGT CTCCGTCACC ATCGGCGTCG
CCGAGCCCGT CTTCGTCACC GAGCCCGTCT TCGTCACCGT CTTCGTCACC GATCCCGTCG
CCGTCTTCGT CGCCGGTGTC GGGTGGGGTG AAGGTGCAGT ATAAGAACAA TGATTCGGCG
CCGGGTGATA ACCAGATCAA ACCGGGTTTG CAGGTGGTGA ATACGGGGTC GTCGTCGGTG
GATTTGTCGA CGGTGACGGT GCGGTACTGG TTCACCCGGG ATGGTGGGTC GTCGACACTG
GTGTACAACT GTGACTGGGC GGCGATCGGG TGTGGGAATA TCCGCGCCTC GTTCGGCTCG
GTGAACCCGG CTACGCCGAC GGCGGATACC TACCTGCAGT TGTCGTTCAC TGGTGGAACG
TTGGCCGCTG GTGGGTCGAC GGGTGAGATT CAAAACCGGG TGAATAAGAG TGACTGGTCG
AATTTCACCG AGACGCATGA CTACTCGTAT GGGACGAACA CCGCCTTCCA GGATTGGACG
AAGGTGACGG TGTACGTCAA CGGCGTGTTG GTCTGGGGGA CTGAACCGTC CGGCACCAGC
CCCAGCCCCA CACCATCCCC GAGCCCGAGC CCGAGCCCGA GTGGGGATGT GACGCCGCCG
AGTGTGCCGA CCGGCTTGGT GGTGACCGGG GTGAGTGGGT CGTCGGTGTC GTTGGCGTGG
AATGCGTCGA CGGATAACGT GGGGGTGGCG CATTACAACG TGTACCGCAA CGGGGTGTTG
GTGGGCCAGC CGACGGTGAC CTCGTTCACC GACACGGGTT TGGCCGCGGG AACCGCGTAC
ACCTACACGG TGGCCGCGGT GGACGCTGCG GGCAACACCT CCGCCCCATC CACCCCCGTC
ACCGCCACCA CCACGAGTCC CAGCCCCAGC CCCAGCCTCT CCCCGTTCCC GTCCCCGTCC
CCGTCGCCGA GCCCAAGCCC AAGCCCCACG CCGTCCCCGT CGTCGTCGGG TGTGGGGTGC
CGGGCGACGT ATGTGGTGAA TAGTGATTGG GGTTCTGGGT TTACGGCGAC GGTGACGGTG
ACGAATACCG GGAGCCGGGC GACGAACGGG TGGACGGTGG CGTGGTCGTT TGGTGGGAAT
CAGACGGTCA CGAACTCCTG GAACACCGTG TTGACCCAAT CAGGTAAATC GGTGACGGCG
ACGAACCTGA GCTACAACAA CGTGATCCAA CCCGGTCAGT CCACCACCTT CGGATTCAAC
GCCACCTACA CCGGAACCAA CACCCCACCC ACCCCCACCT GCACCACCAA CTAA
 
Protein sequence
MRLTRAWSRV RRLGALIGVG GVAAATLVAV ASGPVTAAPA ATGEFNYGQA LQDAIYFYDE 
QRAGHIVGSD DRASWKGNSA LNDGADVGLD LSGGFFDAGD YVKFGFPMAF TLTMLSWSVD
DYRSAYQSSG QLPYILNNIK WGTDYLINAN PSPNVLYGQV GDASLDHAWW GPAEVMPMAR
PAYKIDPSCP GSDLAGETAA AMAAASIAFA PTNPSYAQTL LTHAKNLYTF ADTYRGKYSD
CITAAQGYYN SWSGYWDELV WGALWLYRAT GDQSYLNKAI SYYPNLGYQD QAKTVHSYKW
TIAWDDKSYG DYVLLARFTG NQQYIADAER WLDWWTTGYN NNGTIERITY SPGGEAWLDT
WGSLRYAANT AFVALVFSDW LASQGLDPAR VKAYHDFAVQ QINYILGDNP RGGSYIVGFG
KNSPFNIHSR DAHASWANDI NTPANERHLF IGAMVGGPGA ADDQYTDTRS NYQENEPADD
YNAGLTGALA RLYQEYGGQP AANFPPKETP DGPEIYMQAS VNSAGTNYTE IKAYIVNQSA
WPARALDHGS FRYYFTLDGS TTPAQISLSS AYNQCSAPQG PTQYSGNIYY VTISCDGVHI
APIGQSDYRK EIQFRISSSG SWDPTNDWSY QGVATTPGAT PVTVNNIVLY DGTTAIWGSA
PSGSPSPSPS PSASPSPSPS SSPSPSPSPS PRPSPSPSSS PSPSPSPSPS PSRSPSPSAS
PSPSSSPSPS SSPSSSPIPS PSSSPVSGGV KVQYKNNDSA PGDNQIKPGL QVVNTGSSSV
DLSTVTVRYW FTRDGGSSTL VYNCDWAAIG CGNIRASFGS VNPATPTADT YLQLSFTGGT
LAAGGSTGEI QNRVNKSDWS NFTETHDYSY GTNTAFQDWT KVTVYVNGVL VWGTEPSGTS
PSPTPSPSPS PSPSGDVTPP SVPTGLVVTG VSGSSVSLAW NASTDNVGVA HYNVYRNGVL
VGQPTVTSFT DTGLAAGTAY TYTVAAVDAA GNTSAPSTPV TATTTSPSPS PSLSPFPSPS
PSPSPSPSPT PSPSSSGVGC RATYVVNSDW GSGFTATVTV TNTGSRATNG WTVAWSFGGN
QTVTNSWNTV LTQSGKSVTA TNLSYNNVIQ PGQSTTFGFN ATYTGTNTPP TPTCTTN