Gene Acel_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0133 
Symbol 
ID4484621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp135878 
End bp137314 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content62% 
IMG OID639728895 
Productbeta-galactosidase 
Protein accessionYP_871894 
Protein GI117927343 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAA TCGAAGAGCG CGATCAGGTC GAGAGTCGGC CCACGCTACG GTTCCCTGAC 
CGCTTTGTGT GGGGTGTGGC GACGTCCGCC TACCAAATCG AAGGCGCCGT TGCTGAGGAC
GGGCGCGGTC CGTCGATTTG GGATACGTTC AGCCACACGC CGGGAAAGGT GGTCGGCGGC
GATACCGGAG ATGTCGCCGC CGACCACTAC CACCGTTACG TCGGCGACGT CCGGTTGATG
GCGGACCTCG GGGTCACGTC CTACCGGTTC TCGGTGGCGT GGCCGCGTAT CCTGCCCAGC
GGCTCCGGTG CGGTCAATCG AGCCGGACTC GATTTCTACT CCCGTCTGGT CGATGAGCTG
CTGAACCACG GCATCACGCC TGCACTGACG CTTTACCACT GGGACCTCCC GCAGGCGTTG
CAAGACCAGG GCGGGTGGAC GAATCGTGCA ACTGCACAGC GATTCGCTGA ATATGCGGTC
GTCGTCGCCC GCGAATTGGG TGATCGGGTG AATTTCTGGA TTACTCTCAA CGAGCCGTGG
TGCGCGGCGT TCCTCGGTTA CGGGGCGGGC GTTCATGCAC CCGGACACAC CGACAGTGCG
GAAGCCTTGA CGGCGGCGCA TCACCTGCTC CTTGCGCACG GCCTGGCAGT CCAGGCCCTG
GGCTCGGTTC TGCCGCCGGA TTGCCAGATG GCGATCACGT TGAATCCAGC GGTCGCGCGA
CCGGCGAGCC TCGCCGAGGA AGATGTGGCC GCCGCCCGGA AGGTCGACGG ATTACAGAAT
CGGCTCTGGC TGGATCCGCT GTTTCACGGC ACCTATCCGC AGGATGTGGT GAATTTCACG
TCAAAAGTCA CCGACTGGTC GTTCGTCCGT GACAACGACC TCGCAGTGAT TGCGACCCCC
TTCGACATTC TGGGGGTCAA TTACTATAAC CCGGTCATCG TCGGTCACTA TGCCGGCTCC
GGATCGAGGG GACGCGACGG CCACGGTCAG GGAACCGGTG AGACCTGGCC CGGGTGCCCC
GATATTCAGT TTCCCGAGTG GCCGTTCCGG CGGACCGCGA TGGGCTGGCC CATTGACCCC
TCCGGACTCT ACGAACTCCT CATTCGGCTG AACCGCGACT ATCCACGGCC GATCATGATT
ACTGAGAATG GCGCCGCGTT CGATGATGTC GTCACGGACA ACAATCGGGT GCGGGATCCG
GCACGGGCGG CGTACATCCA GGAACATCTT GCCGCCCTCC ACCAAGCGAT TGCCGACGGC
GTGGACGTTC GCGGTTATTA CCTCTGGTCA TTGATCGACA ACTTTGAATG GGCGTACGGA
TACTCACGCC GGTTCGGCAT CGTTTATGTC GATTTCGAGA CTCAGGAGCG GATCATCAAG
GACAGTGGGT ATTTCTACTC GCTGGTCGCA CGGACGAACA CGATCGCGGC GCCCTGA
 
Protein sequence
MTQIEERDQV ESRPTLRFPD RFVWGVATSA YQIEGAVAED GRGPSIWDTF SHTPGKVVGG 
DTGDVAADHY HRYVGDVRLM ADLGVTSYRF SVAWPRILPS GSGAVNRAGL DFYSRLVDEL
LNHGITPALT LYHWDLPQAL QDQGGWTNRA TAQRFAEYAV VVARELGDRV NFWITLNEPW
CAAFLGYGAG VHAPGHTDSA EALTAAHHLL LAHGLAVQAL GSVLPPDCQM AITLNPAVAR
PASLAEEDVA AARKVDGLQN RLWLDPLFHG TYPQDVVNFT SKVTDWSFVR DNDLAVIATP
FDILGVNYYN PVIVGHYAGS GSRGRDGHGQ GTGETWPGCP DIQFPEWPFR RTAMGWPIDP
SGLYELLIRL NRDYPRPIMI TENGAAFDDV VTDNNRVRDP ARAAYIQEHL AALHQAIADG
VDVRGYYLWS LIDNFEWAYG YSRRFGIVYV DFETQERIIK DSGYFYSLVA RTNTIAAP