Gene Acel_0614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0614 
Symbol 
ID4486396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp653210 
End bp654898 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content62% 
IMG OID639729381 
Productglycoside hydrolase family protein 
Protein accessionYP_872373 
Protein GI117927822 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0151906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCGCG CATTGCGGCG AGTGCCTGGC TCGCGGGTGA TGCTGCGGGT CGGCGTCGTC 
GTCGCGGTGC TGGCATTGGT TGCCGCACTC GCCAACCTAG CCGTGCCGCG GCCGGCTCGC
GCCGCGGGCG GCGGCTATTG GCACACGAGC GGCCGGGAGA TCCTGGACGC GAACAACGTG
CCGGTACGGA TCGCCGGCAT CAACTGGTTT GGGTTCGAAA CCTGCAATTA CGTCGTGCAC
GGTCTCTGGT CACGCGACTA CCGCAGCATG CTCGACCAGA TAAAGTCGCT CGGCTACAAC
ACAATCCGGC TGCCGTACTC TGACGACATT CTCAAGCCGG GCACCATGCC GAACAGCATC
AATTTTTACC AGATGAATCA GGACCTGCAG GGTCTGACGT CCTTGCAGGT CATGGACAAA
ATCGTCGCGT ACGCCGGTCA GATCGGCCTG CGCATCATTC TTGACCGCCA CCGACCGGAT
TGCAGCGGGC AGTCGGCGCT GTGGTACACG AGCAGCGTCT CGGAGGCTAC GTGGATTTCC
GACCTGCAAG CGCTGGCGCA GCGCTACAAG GGAAACCCGA CGGTCGTCGG CTTTGACTTG
CACAACGAGC CGCATGACCC GGCCTGCTGG GGCTGCGGCG ATCCGAGCAT CGACTGGCGA
TTGGCCGCCG AGCGGGCCGG AAACGCCGTG CTCTCGGTGA ATCCGAACCT GCTCATTTTC
GTCGAAGGTG TGCAGAGCTA CAACGGAGAC TCCTACTGGT GGGGCGGCAA CCTGCAAGGA
GCCGGCCAGT ACCCGGTCGT GCTGAACGTG CCGAACCGCC TGGTGTACTC GGCGCACGAC
TACGCGACGA GCGTCTACCC GCAGACGTGG TTCAGCGATC CGACCTTCCC CAACAACATG
CCCGGCATCT GGAACAAGAA CTGGGGATAC CTCTTCAATC AGAACATTGC ACCGGTATGG
CTGGGCGAAT TCGGTACGAC ACTGCAATCC ACGACCGACC AGACGTGGCT GAAGACGCTC
GTCCAGTACC TACGGCCGAC CGCGCAATAC GGTGCGGACA GCTTCCAGTG GACCTTCTGG
TCCTGGAACC CCGATTCCGG CGACACAGGA GGAATTCTCA AGGATGACTG GCAGACGGTC
GACACAGTAA AAGACGGCTA TCTCGCGCCG ATCAAGTCGT CGATTTTCGA TCCTGTCGGC
GCGTCTGCAT CGCCTAGCAG TCAACCGTCC CCGTCGGTGT CGCCGTCTCC GTCGCCGAGC
CCGTCGGCGA GTCGGACGCC GACGCCTACT CCGACGCCGA CAGCCAGCCC GACGCCAACG
CTGACCCCTA CTGCTACGCC CACGCCCACG GCAAGCCCGA CGCCGTCACC GACGGCAGCC
TCCGGAGCCC GCTGCACCGC GAGTTACCAG GTCAACAGCG ATTGGGGCAA TGGCTTCACG
GTAACGGTGG CCGTGACAAA TTCCGGATCC GTCGCGACCA AGACATGGAC GGTCAGTTGG
ACATTCGGCG GAAATCAGAC GATTACCAAT TCGTGGAATG CAGCGGTCAC GCAGAACGGT
CAGTCGGTAA CGGCTCGGAA TATGAGTTAT AACAACGTGA TTCAGCCTGG TCAGAACACC
ACGTTCGGAT TCCAGGCGAG CTATACCGGA AGCAACGCGG CACCGACAGT CGCCTGCGCA
GCAAGTTAA
 
Protein sequence
MPRALRRVPG SRVMLRVGVV VAVLALVAAL ANLAVPRPAR AAGGGYWHTS GREILDANNV 
PVRIAGINWF GFETCNYVVH GLWSRDYRSM LDQIKSLGYN TIRLPYSDDI LKPGTMPNSI
NFYQMNQDLQ GLTSLQVMDK IVAYAGQIGL RIILDRHRPD CSGQSALWYT SSVSEATWIS
DLQALAQRYK GNPTVVGFDL HNEPHDPACW GCGDPSIDWR LAAERAGNAV LSVNPNLLIF
VEGVQSYNGD SYWWGGNLQG AGQYPVVLNV PNRLVYSAHD YATSVYPQTW FSDPTFPNNM
PGIWNKNWGY LFNQNIAPVW LGEFGTTLQS TTDQTWLKTL VQYLRPTAQY GADSFQWTFW
SWNPDSGDTG GILKDDWQTV DTVKDGYLAP IKSSIFDPVG ASASPSSQPS PSVSPSPSPS
PSASRTPTPT PTPTASPTPT LTPTATPTPT ASPTPSPTAA SGARCTASYQ VNSDWGNGFT
VTVAVTNSGS VATKTWTVSW TFGGNQTITN SWNAAVTQNG QSVTARNMSY NNVIQPGQNT
TFGFQASYTG SNAAPTVACA AS