Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1663 |
Symbol | |
ID | 4485248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1871723 |
End bp | 1873699 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639730452 |
Product | physarolisin II |
Protein accession | YP_873421 |
Protein GI | 117928870 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.146214 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAACC CAAGACGCTG GGCGAAAAGG ATGGCCACCC CGCGACGCCG GCCGGTGCGA CGTCTGACGA TCGTTGCGGC AGCCAGCTCC CTCGGCGTCG TGATCTCGTC ATTCGCTGCT CCGCTCGCCG CGCACGCCAG CCCCGCGCGG TCGCCGGTCC CGCACAGCCG GCCAGGCTGG CTCACGCATG CCCGGGACCT CGGGCGAGCC CCCGCCACGG CCGCCGTCCA GGCACGCATT TATCTGGCGC CGAACGGCGG GCTTGACGCG TTGGCCGCCC GGGCGCTTGC CGTGTCGACC CCCGGGTCCG CCGACTACGG GCATTTCCTC TCACCAGCGG AATACTTCGC GGAATTCGGC ACCACCGACG CGACCGTCCG GGCGGTCACC CGGTGGCTCA CGAGCTCGGG TCTCACCGTG ACCGGGGTCG AAGCCCACAA CCGTTACCTC GACATCACGG GCTCGGTCGC CGCTGCGGAG CGTGCGTTCG GCGTGCACAT CCACGCCTAC AGCCATGACG GGCAGACGGT GCAGGCACCC GATGACACAC TCACGACGCC GGCGGACCTC GCCGACGACG TCCTCGCCGT CGACGGCATT GACACGACGC CGAACATCGT CCGGCCCGCT CTCGCGGCCG GGGCGGTCCC TTATCGGGGA GCGCCGATCC CGCCTCCCGC CGGCTTCCGC AACGCTGCGC CCTGTTCCTC TTACTACGGG GAGAAACTCG CCACCGATCT CCCATCATTC AACGGGCAGG CCCTGCCGTA CGCCGTCTGC GGATACGTGG GCGCGCAATT CCGCTCGGCG TATGAGGGCG CCACGACGCT GGACGGCACC GGCATCACCG TCGGGATCAC CGACGCGTAC GCATCGCCGA CCATCGCAGC CGACGCGGCC ACGTACGCTG TCCGGAACGG GGACCGGCCC TATGCGCCCG GCCAATTGAC CCAGAGCCTT CCCCGCACGT TCACCAACAC CAAGCTGTGT GATGCATCCG GGTGGTTTGG CGAAGAGACG CTGGACGTCG AAGCGGTCCA CGCCATGGCG CAAGGCGCCA AAATCCGGTA TTACGCATCG TCAAGTTGCC TGGACCGGGA TCTGCTGGAT GCCTTTGCCC GCATCAACGA CGAGGCACGC GTCCAGATCG TGTCGAATTC CTGGGGCGCG GTCGAGCAGA GCGAGCACCG TTCGACGATT CTCGCCTACG AACAGGCGTT TCTCCAAGGC GCTGTCGAGG GCATCAGCTA CGTCTTCTCC TCCGGCGACA GCGGCGATGA GGTCGCGAGC AGCGGTACGA AGCAGACGGA TTACCCCGCA TCGGATCCCT TTGTGACCGC TGTCGGCGGT ACGGCGACGG CTATTGACGC GACGGGCCGG CTCGCCTGGG AAACCGGCTG GGGCACATAC CGGTATGCGC TGGCGCCGGA CGGAACCGCC TGGTCGTCGT CCGGGACATT TCTGTACGGA TCCGGCGGCG GGGAGTCGAC GGTATTCGCG CAACCTATTT ACCAGCAGGG AATAACACCC GCCGGTGCCC GCGGCGTGCC TGATGTGGCG ATGGACGCCG ACGTGACGAC CGGGATGCTG GTCGGCCAGA CGCAAGCCTT CCCGGACGGC ACGTATTACG ACCAGTACCG TATCGGCGGC ACGAGCCTGG CGGCGCCGCT CTTCGCCGGC ATGACGGCGC TCACGTTCCA GCATGCGCGC GGCGGTGTCG GGTTGCTGAA TCCGACTATT TACCGAAACG CCGGCACGGG TGTCTTCACC GACGTCACCG GGCCCGGTCC GGACCCAGGA AACGTTCGCG TGGATTATGC CGATGGTGTC GACGCCGCCC AGGGTGTCGT TGCAACGGTC CGGACCTTCA ACCAGGATTC CAGCCTCGCG GTGGGACCCG GTTGGGATCC GGTGACCGGG CTCGGCAGTG CGAATGCCGG CTGGTTGACG GCTATTCCCC CGCGGGCACG GCGATAA
|
Protein sequence | MGNPRRWAKR MATPRRRPVR RLTIVAAASS LGVVISSFAA PLAAHASPAR SPVPHSRPGW LTHARDLGRA PATAAVQARI YLAPNGGLDA LAARALAVST PGSADYGHFL SPAEYFAEFG TTDATVRAVT RWLTSSGLTV TGVEAHNRYL DITGSVAAAE RAFGVHIHAY SHDGQTVQAP DDTLTTPADL ADDVLAVDGI DTTPNIVRPA LAAGAVPYRG APIPPPAGFR NAAPCSSYYG EKLATDLPSF NGQALPYAVC GYVGAQFRSA YEGATTLDGT GITVGITDAY ASPTIAADAA TYAVRNGDRP YAPGQLTQSL PRTFTNTKLC DASGWFGEET LDVEAVHAMA QGAKIRYYAS SSCLDRDLLD AFARINDEAR VQIVSNSWGA VEQSEHRSTI LAYEQAFLQG AVEGISYVFS SGDSGDEVAS SGTKQTDYPA SDPFVTAVGG TATAIDATGR LAWETGWGTY RYALAPDGTA WSSSGTFLYG SGGGESTVFA QPIYQQGITP AGARGVPDVA MDADVTTGML VGQTQAFPDG TYYDQYRIGG TSLAAPLFAG MTALTFQHAR GGVGLLNPTI YRNAGTGVFT DVTGPGPDPG NVRVDYADGV DAAQGVVATV RTFNQDSSLA VGPGWDPVTG LGSANAGWLT AIPPRARR
|
| |