Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1988 |
Symbol | |
ID | 4486567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 2265320 |
End bp | 2266507 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639730781 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_873746 |
Protein GI | 117929195 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGGCG ATTGGCTCGA CCTCCTCCTG CTCGTCCTCA TCGCCGGCGC GGCCTTCGAT GGGTACCGAG CCGGGTTCGT CGTCGGCGTG CTGTCCTTCG TCGGCTTGGT CGGCGGCGGC GTCGCCGGCG CATTCCTCGC CCCGGTTCTC GCCCGTCATT TCTCCGGAAA TGCGCGCGCC ATTGCCGGCG TCATCACGGT TTTTGTCCTG GCATCCATTG GTCGAGCTCT CGCCGGTGCG CTGGGGGCAT TTTTGCGGGA CCGCGTCCGC GGGCAATCCG GACGGGTCGT CGACGCGATA GCCGGGGCAA TCGTCTCCGT CATCGCGGTT CTGCTCGTCG CATGGTTCAT CGGAAGTTCG CTCGTACGAT CACCGTTCCC TGCCGTCGCC CGCGCGGTGA ACAATTCCCG GATCCTCGCC GCGGTTGACC GAGAAATGCC GCCGGCCGTG GCAGCGTGGT TCGCCAACTT TCGCCGCGTT GTCGTGGACG GCGCGTTGCC CCGGGTCTTC AGCGCACTCG GGGCTGAGCG AATCATTCCG GTCGCGCCTC CGGATCCGGC AATTCTGAGC GACCCGGATG TGCGCCGCGC AGAAGCGAGC GTGGTGAAAA TCACCGGAAT CGCCCGCGCC TGTTCCCGGG ATGTCGAGGG AAGCGGCTTC GTCTTCGCAC CCGGCCGGGT GATGACCAAT GCGCACGTCG TCGCCGGCGT GACCCACCCC GTCGTGCACC TCGCCACGTC CGACGCCCGT TACGCCGCAG TCGTGGTGTA TTACGACCCA CGTGTCGACG TCGCCGTGTT GCGGGTCGAC GGTCTCACCG CGCCGCCACT GCAATTCGAC CAGACACAGG CGGAGACCGG GGATTCCGCG GCCATCGCCG GTTTCCCGGA GAACGGGCCG TACACCGTCG TTCCGGCCCG AATCCGCGGC GCTGAATTCG CCCGCGGGCC GGACATCTAC CAGTCGACAC AAGTGACCCG CGAAGTTTAC GCAATCCGCG GTGACGTGGA GCCGGGCAAT TCCGGCGGCC CGCTTCTCGA CCCGGCGGGC CGCGTGGACG GCGTCATCTT TGGGAAAGCG GTCAACGATC CGCAGACGGG TTACGCGCTC ACGGCCGCGC AAGTCGCCGC TGCGGCGCGC GCCGGCGTCA CGGCGACGCA GCCGGTCTCC ACCCAGGGAT GCGATTAG
|
Protein sequence | MHGDWLDLLL LVLIAGAAFD GYRAGFVVGV LSFVGLVGGG VAGAFLAPVL ARHFSGNARA IAGVITVFVL ASIGRALAGA LGAFLRDRVR GQSGRVVDAI AGAIVSVIAV LLVAWFIGSS LVRSPFPAVA RAVNNSRILA AVDREMPPAV AAWFANFRRV VVDGALPRVF SALGAERIIP VAPPDPAILS DPDVRRAEAS VVKITGIARA CSRDVEGSGF VFAPGRVMTN AHVVAGVTHP VVHLATSDAR YAAVVVYYDP RVDVAVLRVD GLTAPPLQFD QTQAETGDSA AIAGFPENGP YTVVPARIRG AEFARGPDIY QSTQVTREVY AIRGDVEPGN SGGPLLDPAG RVDGVIFGKA VNDPQTGYAL TAAQVAAAAR AGVTATQPVS TQGCD
|
| |