Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0614 |
Symbol | |
ID | 4486396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 653210 |
End bp | 654898 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639729381 |
Product | glycoside hydrolase family protein |
Protein accession | YP_872373 |
Protein GI | 117927822 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0151906 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGCGCG CATTGCGGCG AGTGCCTGGC TCGCGGGTGA TGCTGCGGGT CGGCGTCGTC GTCGCGGTGC TGGCATTGGT TGCCGCACTC GCCAACCTAG CCGTGCCGCG GCCGGCTCGC GCCGCGGGCG GCGGCTATTG GCACACGAGC GGCCGGGAGA TCCTGGACGC GAACAACGTG CCGGTACGGA TCGCCGGCAT CAACTGGTTT GGGTTCGAAA CCTGCAATTA CGTCGTGCAC GGTCTCTGGT CACGCGACTA CCGCAGCATG CTCGACCAGA TAAAGTCGCT CGGCTACAAC ACAATCCGGC TGCCGTACTC TGACGACATT CTCAAGCCGG GCACCATGCC GAACAGCATC AATTTTTACC AGATGAATCA GGACCTGCAG GGTCTGACGT CCTTGCAGGT CATGGACAAA ATCGTCGCGT ACGCCGGTCA GATCGGCCTG CGCATCATTC TTGACCGCCA CCGACCGGAT TGCAGCGGGC AGTCGGCGCT GTGGTACACG AGCAGCGTCT CGGAGGCTAC GTGGATTTCC GACCTGCAAG CGCTGGCGCA GCGCTACAAG GGAAACCCGA CGGTCGTCGG CTTTGACTTG CACAACGAGC CGCATGACCC GGCCTGCTGG GGCTGCGGCG ATCCGAGCAT CGACTGGCGA TTGGCCGCCG AGCGGGCCGG AAACGCCGTG CTCTCGGTGA ATCCGAACCT GCTCATTTTC GTCGAAGGTG TGCAGAGCTA CAACGGAGAC TCCTACTGGT GGGGCGGCAA CCTGCAAGGA GCCGGCCAGT ACCCGGTCGT GCTGAACGTG CCGAACCGCC TGGTGTACTC GGCGCACGAC TACGCGACGA GCGTCTACCC GCAGACGTGG TTCAGCGATC CGACCTTCCC CAACAACATG CCCGGCATCT GGAACAAGAA CTGGGGATAC CTCTTCAATC AGAACATTGC ACCGGTATGG CTGGGCGAAT TCGGTACGAC ACTGCAATCC ACGACCGACC AGACGTGGCT GAAGACGCTC GTCCAGTACC TACGGCCGAC CGCGCAATAC GGTGCGGACA GCTTCCAGTG GACCTTCTGG TCCTGGAACC CCGATTCCGG CGACACAGGA GGAATTCTCA AGGATGACTG GCAGACGGTC GACACAGTAA AAGACGGCTA TCTCGCGCCG ATCAAGTCGT CGATTTTCGA TCCTGTCGGC GCGTCTGCAT CGCCTAGCAG TCAACCGTCC CCGTCGGTGT CGCCGTCTCC GTCGCCGAGC CCGTCGGCGA GTCGGACGCC GACGCCTACT CCGACGCCGA CAGCCAGCCC GACGCCAACG CTGACCCCTA CTGCTACGCC CACGCCCACG GCAAGCCCGA CGCCGTCACC GACGGCAGCC TCCGGAGCCC GCTGCACCGC GAGTTACCAG GTCAACAGCG ATTGGGGCAA TGGCTTCACG GTAACGGTGG CCGTGACAAA TTCCGGATCC GTCGCGACCA AGACATGGAC GGTCAGTTGG ACATTCGGCG GAAATCAGAC GATTACCAAT TCGTGGAATG CAGCGGTCAC GCAGAACGGT CAGTCGGTAA CGGCTCGGAA TATGAGTTAT AACAACGTGA TTCAGCCTGG TCAGAACACC ACGTTCGGAT TCCAGGCGAG CTATACCGGA AGCAACGCGG CACCGACAGT CGCCTGCGCA GCAAGTTAA
|
Protein sequence | MPRALRRVPG SRVMLRVGVV VAVLALVAAL ANLAVPRPAR AAGGGYWHTS GREILDANNV PVRIAGINWF GFETCNYVVH GLWSRDYRSM LDQIKSLGYN TIRLPYSDDI LKPGTMPNSI NFYQMNQDLQ GLTSLQVMDK IVAYAGQIGL RIILDRHRPD CSGQSALWYT SSVSEATWIS DLQALAQRYK GNPTVVGFDL HNEPHDPACW GCGDPSIDWR LAAERAGNAV LSVNPNLLIF VEGVQSYNGD SYWWGGNLQG AGQYPVVLNV PNRLVYSAHD YATSVYPQTW FSDPTFPNNM PGIWNKNWGY LFNQNIAPVW LGEFGTTLQS TTDQTWLKTL VQYLRPTAQY GADSFQWTFW SWNPDSGDTG GILKDDWQTV DTVKDGYLAP IKSSIFDPVG ASASPSSQPS PSVSPSPSPS PSASRTPTPT PTPTASPTPT LTPTATPTPT ASPTPSPTAA SGARCTASYQ VNSDWGNGFT VTVAVTNSGS VATKTWTVSW TFGGNQTITN SWNAAVTQNG QSVTARNMSY NNVIQPGQNT TFGFQASYTG SNAAPTVACA AS
|
| |