Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1716 |
Symbol | |
ID | 4484871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 1931194 |
End bp | 1932036 |
Gene Length | 843 bp |
Protein Length | 280 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639730506 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_873474 |
Protein GI | 117928923 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.56437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.319195 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGCG GGGCAGGAAC CGGAATCATC ATCAGCTCCG ACGGGCTCGT CGTGACGAAC GCGCACGTCG TCAACGGCGC AACGCAAATA AACGTGACGC TGCCCGGGAA CGGCGGTACG CACGCCGCGT CAATCGTCGG CATTGACACG ACGAAGGACC TCGCCGTGCT CAAGGTGTCC GGAGTGTCCG GTCTCGTGCC TGCAACATTT GCGAATTCCT CGACCGTGCA CGTCGGTGAC ACGGTGCTGG CGATTGGGAA TGCGCTCGGC TACGGTGGTC AGCCGACCGT CACCGAAGGC ATCATTTCGG CAACGAACCG AAGCCTGCGG GACAGCAGCG AGAATCTGAC CGGGCTGCTG CAGACGGACG CCGCCATCAA CCCCGGCAAC AGCGGCGGAC CGCTTGTCAA TACCAGCGGC GAGGTCATCG GCATCAACGT GGCCGTGGCG ACCGGAACAC CGAGCGAGCC CGCCCAGAAT ATCGGCTTCG CGATTCCGAG CAATACCGTG ACCGCAGCGC TGCCCGCACT GGAAGCCGGG AAGTCGGCAT CCGGGTCCCC CTCACCAAGC CAGACGACCG CATTTCTCGG GGTTGTGGTG ACGGATGCGC CGAACGGAGC AGCCGTCGTG GAGGTGCAGC CGAGCGGACC GGCAGCTCAA GCCGGCGTGC AGGCGGGTGA CGTCATCACC GCGGTCGGCA ACGAGCAGAC GCCGGATGCA GCGGCGTTGC AAGCGGCGAT CCGCGCGAAG AAACCGGGAA CGACGGTCAC CCTGCACATC ATCCGCGGCG CCCAACAGCT GACGATTCCC GTCACCCTCG GCTCCACCCA GGTCACTTCC TGA
|
Protein sequence | MASGAGTGII ISSDGLVVTN AHVVNGATQI NVTLPGNGGT HAASIVGIDT TKDLAVLKVS GVSGLVPATF ANSSTVHVGD TVLAIGNALG YGGQPTVTEG IISATNRSLR DSSENLTGLL QTDAAINPGN SGGPLVNTSG EVIGINVAVA TGTPSEPAQN IGFAIPSNTV TAALPALEAG KSASGSPSPS QTTAFLGVVV TDAPNGAAVV EVQPSGPAAQ AGVQAGDVIT AVGNEQTPDA AALQAAIRAK KPGTTVTLHI IRGAQQLTIP VTLGSTQVTS
|
| |