Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0603 |
Symbol | |
ID | 4485525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 639808 |
End bp | 641226 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639729370 |
Product | glycoside hydrolase family protein |
Protein accession | YP_872362 |
Protein GI | 117927811 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3325] Chitinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.238448 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0453952 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTCGA GATTCCGTCA CACCCGCTGG ATTACCCGTA CGCAGCGCCA CACCCGCTGG AGTACCCGAA CGCACCGCCG CACCGGGCGG CTTGCCCCGG CGGCGGCCGT CCTCGTCGCC GCGGTCGTCG TCCCATCAAG CGCGGCCGCC CACGAACCGC GCAGCGACGG CCTCGTCCGC GCCGCGTACT ACACACAGTG GTCGGTCTAC AGCGGTTTCA CCGTAGCCGG CGTGGTCGCA AACGGTGACG CCGGCCGCCT CAACCAAATC AACTACGCAT TCATCAATGT CGCGCCGAAC CCTGCCGTGT CCGGCTCACC CATTGAATGC CTCAGCGGCG ACCCGTGGGC CGATTACCAA ATGCCGTTCG GCGGCGCCAC CGGACGCCCA AGTGTCGACG GCACCGCCGA CGACTGGTCC GGACTTCAGG GAAATTTCAA GCAGCTGAGG GAATTGAAGA CGCGTTACCC GAATCTTCGT ATCATCGCCT CCCTCGGCGG ATACTCGTGG TCGGGATACT TCTCCGACGC CGCCCTGACA GCTGAGTCCC GCGCACATCT TGTCGCCTCA TGCATCGACC TGCTCATCAA CGGCAACCTT CCCGGTCTGC CGCCCGGTGC CGCTAAGGGC ATCTTCGACG GCATTGACGT GGACTGGGAA TATCCCGGTG CTGCCGGTGC CACGCTCACC AACGGAAACG GTAATCCCAC CGCACGGCCT GAGGACACCC GGAACTTCAC CTTGCTGCTT GCCGAATTTC GCCGCCAACT GGACAGCGCA GCCCGCGCGA ATCACCACCG CCGGTACCTG CTCACCGCCG CACTGTCAGC GAACCCGACG AAGATTGCCC TCCTCGAGGT GCCGCAGATT TCGCGGCTTC TCGACCAGAT GGATGTCATG GATTACGACT TCCACGGGCC ATGGGAGGCA CATGGCCCGA CGGACTTTCA ATCCGAGCTG TACCCGTCCG CGGCTGAAGT GGCCGTGATC GGCTCGGCGC AGCAATTCAG TGTCGACCAG TCGATCGACG CCTTTCTCCG CGCCGGCGCC GATCGGCACA AGCTCCTCGT CGGTGTGCCG TTCTACGGAC ATGGCTGGGT CGGCGTCCCC GACGGTGGCA CACACGGCTT GTACCAGACG GCGACCGGTC CGTCGTGGCT GAATGGCGGC TCTCCGACCT GGGCGCAACT GGAGGCTCTC GGCTACGCGC CCTACCGCGA TCCGATAACC GGTGGTTATT GGCTCTACGA CCAGGCGAGT GAGACCCTCT ATGTCGTCGA CGACCCGGTA GAAATCGGTC AGAAAATGCA CTACATTCTC CGCCGCGACC TCGGAGGCAC CGCTGCGTGG TCACTGGACG GCGACGACAG CGCGGGTAGT TTGGGAGCAG CCCTCGCACT CGGTCTCATC GATCACTGA
|
Protein sequence | MLSRFRHTRW ITRTQRHTRW STRTHRRTGR LAPAAAVLVA AVVVPSSAAA HEPRSDGLVR AAYYTQWSVY SGFTVAGVVA NGDAGRLNQI NYAFINVAPN PAVSGSPIEC LSGDPWADYQ MPFGGATGRP SVDGTADDWS GLQGNFKQLR ELKTRYPNLR IIASLGGYSW SGYFSDAALT AESRAHLVAS CIDLLINGNL PGLPPGAAKG IFDGIDVDWE YPGAAGATLT NGNGNPTARP EDTRNFTLLL AEFRRQLDSA ARANHHRRYL LTAALSANPT KIALLEVPQI SRLLDQMDVM DYDFHGPWEA HGPTDFQSEL YPSAAEVAVI GSAQQFSVDQ SIDAFLRAGA DRHKLLVGVP FYGHGWVGVP DGGTHGLYQT ATGPSWLNGG SPTWAQLEAL GYAPYRDPIT GGYWLYDQAS ETLYVVDDPV EIGQKMHYIL RRDLGGTAAW SLDGDDSAGS LGAALALGLI DH
|
| |