Gene Acel_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2050 
Symbol 
ID4484729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2322145 
End bp2324598 
Gene Length2454 bp 
Protein Length817 aa 
Translation table11 
GC content66% 
IMG OID639730846 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_873808 
Protein GI117929257 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCGAC GTGATACGGA CCAGACGTCG GCCGCGAATG GTTCTCTGCC GATCTGGCGG 
GACCCGACGC GTTCTCCGCG GGAGCGGGTT GCGGATTTGC TGGCCAAGAT GACGCTCGAG
GAAAAGATCG CCCAGTTGTA CGGCGTCTGG GTGGGAGCGG ACGCGTCCGG CGACGGCGTG
GCGCCGCACC AACACGACAT GAGCAACCCG GACCTTGACT GGCACGCGCT GATTTCGCGA
GGCATCGGAC AGTTGACCCG TCCCTTTGGT ACGGCGCCGG TGGATCCTGC CCTCGGCGCA
CGTGCGCTGG CGCGAACTCA GGCGGAAATC GTCGCCGCCA ATCGTCACGG CGTCCCGGCA
CTGGTGCACG AAGAGTGCCT CACGGGTTTC ATGACGTGGG GCGCCACCAC ATACCCCACG
CCGTTGGCCT GGGGGGCGAG TTTCGACCCG GCGCTCGTCG AGGAAATGGG CCGCCAGATC
GGCACTGCCA TGCGGCGCAT GGGTGTGCAC CAAGGACTCG CCCCGGTTCT CGACGTCGTC
CGCGATTACC GCTGGGGCCG GGTGGAGGAG ACCATCGGTG AGGACCCTTA CCTCGTGGGC
ACGATCGGAT CCGCGTACGT GAAGGGGCTC GAATCCGCCG GAATTGTCGC GACCCTCAAA
CACTTCGTCG CATATTCCGG TTCGCGGGCG GGCCGCAATT TCGCACCCGT GGCGACGGGA
CCGCGCGAAC TCGCGGATGT TTTTTTGTTG CCTTTTGAAA TGGCGCTCCG CCTAGGCGGC
GCGCGCTCCG TCATGCATTC GTACAACGAA ATCGACGGCG TACCCGTCGC AGCAAATGCG
GATCTGCTCA CCAACCTTCT TCGCGACCAA TGGGATTTTC ACGGCACGGT CGTCTCGGAC
TACTTCGGCA TCGCTTTCCT GCGTCGCCTG CACCAGGTCG CGGAAGATGA CACCGGTGCC
GCTGTTCTTG CCCTCACGGC CGGCATCGAC GTCGAACTTC CCACCGTGCA CTGCTATGGA
GAGCCGTTGA CCCAAGCGGT TCGAGCCGGC CTGGTCTCCG AGGAGCTCAT TGACCGCGCA
GTCTGCCGGG TACTCGAGCA GAAGTGCGAG CTTGGCCTGC TCGACCCGGA TTGGACACCG
GAGCCGGCGG CTGTCCGCGA ACTTGGCCAC CATGCCGAGA CCGACGCCGA TGACGCCCGC
GGCACGATCA ACCTTGACCC CCCGGAATCC CGCGCCCTCG CCCGGCGCCT GGCGGAGGAA
TCCGTCGTGC TGCTCGCCAA CACCGGTGTG CTGCCCATCA GCAACGTACG ACGGATGGCT
GTCGTCGGCC CGCTGGCTGA CGACCCAGCG GCGATGCTCG GTTGCTACAC CTTCGAGAGT
CACGTCCGTT CCGCCCATCC GCAGGTGCCG CCCGGAATCG ACATTCCGTC CGTTCTCGAG
GCCCTGCGGG CCGAATTCCC GGAGACCAAA ATCGAGCACG CCCGAGGCTG CGACGTCCGC
AATGACGACC GATCCGGCTT CCCGGAGGCC GTGGCCCTTG CGGAGAGCGC AGACCTCTGC
GTGATTGTCG TCGGCGATCG GTCCGGGCTC TTCGGTCGAG GAACCTCCGG GGAAGGCTCC
GACGTCCCGG ACCTGCGGCT ACCCGGCGTG CAGGAAGAAT TCATCCACCG GATCTGTGAT
GCCGGCACAC CGGCGGTCCT CGTTCTCCTC ACCGGACGGC CGTACGCTCT CGGCGGGTTG
GCCGATCGGG TTCAGGCGAT CGTGCAAGCG TTCTTCCCCG GTGAGGAAGG AGGATCGGCG
ATTGCCGGCA TCCTCTCCGG CCGGGTCAGC CCGTCCGGTC GCCTTCCGGT GAGCATCCCG
CGGCACCCCG GTGGCCAGCC GGGTACCTAC CTCACGCCGC GGCTCGGTCA GCGATCGGAC
GTGAGCAGTG TCGACCCGAC GCCGCTCTGG CCGTTCGGTT ACGGCCTGTC CTACACGTCA
TTTGCCTGGG ACGACGTCCG GTTGGACGGC GTGCCGATCA ATGAGCCCGT GGAGACGCCG
GTTGACAGCG TCGTCGAGCT CTCCCTGCGG GTGACGAACA CCGGTCACCG GCTCGGCACC
GACGTCGTCC AGCTCTATCT TCACGATCCG ATCGCCCAAG TGACCCGGCC GACGGTTCAG
CTCATTGGGT ATGCCCGAGT CACCGTCGCT GCAGGGGAGA GCCGCCGGGT CACATTCCGC
ATCCCGACCG ACGTCTTCGG ATTCACCGGG CGCGACGGGC ACCGGATCGT CGAGCCCGGT
GAGATTGAAC TGCGCCTGTC GGAATCCAGC AATTGCCCGC GCTTCTCGGT CCCGGTCCGC
CTTGTCGGTG AGGTACGGCG TCTCGGTCCC GACCGTGCGT TGGTAACGAC GGCGCAGGTG
AGCGAACCCG TTGCCGCCGC CGGACAACAC CAGGGCGAGG GACTAGCCGA ATAA
 
Protein sequence
MLRRDTDQTS AANGSLPIWR DPTRSPRERV ADLLAKMTLE EKIAQLYGVW VGADASGDGV 
APHQHDMSNP DLDWHALISR GIGQLTRPFG TAPVDPALGA RALARTQAEI VAANRHGVPA
LVHEECLTGF MTWGATTYPT PLAWGASFDP ALVEEMGRQI GTAMRRMGVH QGLAPVLDVV
RDYRWGRVEE TIGEDPYLVG TIGSAYVKGL ESAGIVATLK HFVAYSGSRA GRNFAPVATG
PRELADVFLL PFEMALRLGG ARSVMHSYNE IDGVPVAANA DLLTNLLRDQ WDFHGTVVSD
YFGIAFLRRL HQVAEDDTGA AVLALTAGID VELPTVHCYG EPLTQAVRAG LVSEELIDRA
VCRVLEQKCE LGLLDPDWTP EPAAVRELGH HAETDADDAR GTINLDPPES RALARRLAEE
SVVLLANTGV LPISNVRRMA VVGPLADDPA AMLGCYTFES HVRSAHPQVP PGIDIPSVLE
ALRAEFPETK IEHARGCDVR NDDRSGFPEA VALAESADLC VIVVGDRSGL FGRGTSGEGS
DVPDLRLPGV QEEFIHRICD AGTPAVLVLL TGRPYALGGL ADRVQAIVQA FFPGEEGGSA
IAGILSGRVS PSGRLPVSIP RHPGGQPGTY LTPRLGQRSD VSSVDPTPLW PFGYGLSYTS
FAWDDVRLDG VPINEPVETP VDSVVELSLR VTNTGHRLGT DVVQLYLHDP IAQVTRPTVQ
LIGYARVTVA AGESRRVTFR IPTDVFGFTG RDGHRIVEPG EIELRLSESS NCPRFSVPVR
LVGEVRRLGP DRALVTTAQV SEPVAAAGQH QGEGLAE