Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1593 |
Symbol | |
ID | 4484646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 1791774 |
End bp | 1793312 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639730377 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_873351 |
Protein GI | 117928800 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.421913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.822638 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAGC ATCAGCAGCA CTGGCCGGGT CGACCGCAGT CCGAATCACC GCAGCCGTCG GAGAACGCGG CCCGGGAGCC GGCAGGCACG CCGGTTGATA CTCCGGCGAC GGCGAACGCC TGGACGGCCG GCGGTGCACC CGCGGGTGCG GGGAACCAGG CGGAGGGCGC TAGTGGAGGC GCTGCGTGGT GGGCGCCGCC TCCGCCGTCC GGCGCGTCGT GGCCCTCCCC TGCCGGATCG CGGCCGTCGT CCGGCGAAGT GCAGCCGCCC GCCGGTGAGG GTACCGATCC GCTCACGACC ACGATTCCAT TGGACGTGCC GGCCCCGACT CCACCGGCCG CGGGATTCCC TCCTGCTGGG GACGTTGCGC CGGCCGGAGG ATTTCCGCCT CCTGAGTCGA GTGGACGGCG CCGGATCTGG CCCATCGTTG TCGTGGCCGT TCTGCTCGCC CTTGTCGCTG GCGGCGGGTC GGGAGCCGTC GTCGGCCGCT ACGTGGCTCG GCACACGACG ACAGCGTCGC CGTCGTCGAC GGTGTCGAGT ACAACGGGGA ATTCGGCGAC ATCGAGTACA GCGCCGACGG TGCCCCCTGC GCCGGCCGGG TCGATTGCCG CCGTCGCGAA ACAACTCCTT CCCTCGGTTG TCTCCATCAT CGTGACGACC GACCAGGGCG GCGACGAAGG CACGGGCATC GTGCTGAGCA GCGACGGCTA CATTCTCACG AACAATCACG TCGTCGAGGC GGCGGCCGGC GGGGGCATCG TTGCGGTCAC AACAGCGGAC GGCCGATCGA TGCGGGCGCG GATCGTCGGG CGGGATCCCA CATCGGACCT CGCGGTGATT CAGGCTGTCG GCGTGACGAA TCTGCAACCG GCGGCCATCG GGGACGACCG CACCCTTCAG GTCGGCCAAC AGGTCATTGC CATCGGATCG CCCCTCGGCC TGTCGAATAC TGTTACGGCA GGAATTGTGA GCGCGTTAAA CCGCCCGGTC TGTACGCAAA ATTGCTCCGG CGGGGGGACG ACGCCCACCG TGCTCGACGC CATTCAAACC GACGCGGCCA TCAATCCCGG CAATTCCGGC GGCCCGTTGG TTGACATGGC CGGCCGGGTG GTCGGCGTCA ACACAGCGAT TGCGACGCTT GACCAGCAGC CGTTCGGCGG CGGGCAATCG GGGAACATCG GGGTCGGCTT TGCGATTCCG ATTTCGGAAG CGATGCGGGT CGTCAAGGAG TTGGAGGCGA CTGGGCATGC GACGCACGCT GTGCTCGGCG TCGGCGTCCG GGATTCGGTC GACCCGACGC TGCAGACGCC GAACGGCTGC CTGGTCGTCA GCGTGACCGC GGGCGGGCCG GCGGATCGGG CGGGAGTTCG CGTCGGAGAC GTCATCGTGC AATTCGGTGA TCGCACCATC CGGGATTCGG ATTCGCTCAT TGCGGCGACC CACGCCGCGG TGCCGAATTC GACGGTGACT ATTGGATTTG TCCGCAACGG CAGCCGGCAC ACCACGCAAG TCACCCTCGG CTCAGCGAGC TCCGGCTGA
|
Protein sequence | MTEHQQHWPG RPQSESPQPS ENAAREPAGT PVDTPATANA WTAGGAPAGA GNQAEGASGG AAWWAPPPPS GASWPSPAGS RPSSGEVQPP AGEGTDPLTT TIPLDVPAPT PPAAGFPPAG DVAPAGGFPP PESSGRRRIW PIVVVAVLLA LVAGGGSGAV VGRYVARHTT TASPSSTVSS TTGNSATSST APTVPPAPAG SIAAVAKQLL PSVVSIIVTT DQGGDEGTGI VLSSDGYILT NNHVVEAAAG GGIVAVTTAD GRSMRARIVG RDPTSDLAVI QAVGVTNLQP AAIGDDRTLQ VGQQVIAIGS PLGLSNTVTA GIVSALNRPV CTQNCSGGGT TPTVLDAIQT DAAINPGNSG GPLVDMAGRV VGVNTAIATL DQQPFGGGQS GNIGVGFAIP ISEAMRVVKE LEATGHATHA VLGVGVRDSV DPTLQTPNGC LVVSVTAGGP ADRAGVRVGD VIVQFGDRTI RDSDSLIAAT HAAVPNSTVT IGFVRNGSRH TTQVTLGSAS SG
|
| |