Gene Acel_1663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1663 
Symbol 
ID4485248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1871723 
End bp1873699 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content68% 
IMG OID639730452 
Productphysarolisin II 
Protein accessionYP_873421 
Protein GI117928870 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.146214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAACC CAAGACGCTG GGCGAAAAGG ATGGCCACCC CGCGACGCCG GCCGGTGCGA 
CGTCTGACGA TCGTTGCGGC AGCCAGCTCC CTCGGCGTCG TGATCTCGTC ATTCGCTGCT
CCGCTCGCCG CGCACGCCAG CCCCGCGCGG TCGCCGGTCC CGCACAGCCG GCCAGGCTGG
CTCACGCATG CCCGGGACCT CGGGCGAGCC CCCGCCACGG CCGCCGTCCA GGCACGCATT
TATCTGGCGC CGAACGGCGG GCTTGACGCG TTGGCCGCCC GGGCGCTTGC CGTGTCGACC
CCCGGGTCCG CCGACTACGG GCATTTCCTC TCACCAGCGG AATACTTCGC GGAATTCGGC
ACCACCGACG CGACCGTCCG GGCGGTCACC CGGTGGCTCA CGAGCTCGGG TCTCACCGTG
ACCGGGGTCG AAGCCCACAA CCGTTACCTC GACATCACGG GCTCGGTCGC CGCTGCGGAG
CGTGCGTTCG GCGTGCACAT CCACGCCTAC AGCCATGACG GGCAGACGGT GCAGGCACCC
GATGACACAC TCACGACGCC GGCGGACCTC GCCGACGACG TCCTCGCCGT CGACGGCATT
GACACGACGC CGAACATCGT CCGGCCCGCT CTCGCGGCCG GGGCGGTCCC TTATCGGGGA
GCGCCGATCC CGCCTCCCGC CGGCTTCCGC AACGCTGCGC CCTGTTCCTC TTACTACGGG
GAGAAACTCG CCACCGATCT CCCATCATTC AACGGGCAGG CCCTGCCGTA CGCCGTCTGC
GGATACGTGG GCGCGCAATT CCGCTCGGCG TATGAGGGCG CCACGACGCT GGACGGCACC
GGCATCACCG TCGGGATCAC CGACGCGTAC GCATCGCCGA CCATCGCAGC CGACGCGGCC
ACGTACGCTG TCCGGAACGG GGACCGGCCC TATGCGCCCG GCCAATTGAC CCAGAGCCTT
CCCCGCACGT TCACCAACAC CAAGCTGTGT GATGCATCCG GGTGGTTTGG CGAAGAGACG
CTGGACGTCG AAGCGGTCCA CGCCATGGCG CAAGGCGCCA AAATCCGGTA TTACGCATCG
TCAAGTTGCC TGGACCGGGA TCTGCTGGAT GCCTTTGCCC GCATCAACGA CGAGGCACGC
GTCCAGATCG TGTCGAATTC CTGGGGCGCG GTCGAGCAGA GCGAGCACCG TTCGACGATT
CTCGCCTACG AACAGGCGTT TCTCCAAGGC GCTGTCGAGG GCATCAGCTA CGTCTTCTCC
TCCGGCGACA GCGGCGATGA GGTCGCGAGC AGCGGTACGA AGCAGACGGA TTACCCCGCA
TCGGATCCCT TTGTGACCGC TGTCGGCGGT ACGGCGACGG CTATTGACGC GACGGGCCGG
CTCGCCTGGG AAACCGGCTG GGGCACATAC CGGTATGCGC TGGCGCCGGA CGGAACCGCC
TGGTCGTCGT CCGGGACATT TCTGTACGGA TCCGGCGGCG GGGAGTCGAC GGTATTCGCG
CAACCTATTT ACCAGCAGGG AATAACACCC GCCGGTGCCC GCGGCGTGCC TGATGTGGCG
ATGGACGCCG ACGTGACGAC CGGGATGCTG GTCGGCCAGA CGCAAGCCTT CCCGGACGGC
ACGTATTACG ACCAGTACCG TATCGGCGGC ACGAGCCTGG CGGCGCCGCT CTTCGCCGGC
ATGACGGCGC TCACGTTCCA GCATGCGCGC GGCGGTGTCG GGTTGCTGAA TCCGACTATT
TACCGAAACG CCGGCACGGG TGTCTTCACC GACGTCACCG GGCCCGGTCC GGACCCAGGA
AACGTTCGCG TGGATTATGC CGATGGTGTC GACGCCGCCC AGGGTGTCGT TGCAACGGTC
CGGACCTTCA ACCAGGATTC CAGCCTCGCG GTGGGACCCG GTTGGGATCC GGTGACCGGG
CTCGGCAGTG CGAATGCCGG CTGGTTGACG GCTATTCCCC CGCGGGCACG GCGATAA
 
Protein sequence
MGNPRRWAKR MATPRRRPVR RLTIVAAASS LGVVISSFAA PLAAHASPAR SPVPHSRPGW 
LTHARDLGRA PATAAVQARI YLAPNGGLDA LAARALAVST PGSADYGHFL SPAEYFAEFG
TTDATVRAVT RWLTSSGLTV TGVEAHNRYL DITGSVAAAE RAFGVHIHAY SHDGQTVQAP
DDTLTTPADL ADDVLAVDGI DTTPNIVRPA LAAGAVPYRG APIPPPAGFR NAAPCSSYYG
EKLATDLPSF NGQALPYAVC GYVGAQFRSA YEGATTLDGT GITVGITDAY ASPTIAADAA
TYAVRNGDRP YAPGQLTQSL PRTFTNTKLC DASGWFGEET LDVEAVHAMA QGAKIRYYAS
SSCLDRDLLD AFARINDEAR VQIVSNSWGA VEQSEHRSTI LAYEQAFLQG AVEGISYVFS
SGDSGDEVAS SGTKQTDYPA SDPFVTAVGG TATAIDATGR LAWETGWGTY RYALAPDGTA
WSSSGTFLYG SGGGESTVFA QPIYQQGITP AGARGVPDVA MDADVTTGML VGQTQAFPDG
TYYDQYRIGG TSLAAPLFAG MTALTFQHAR GGVGLLNPTI YRNAGTGVFT DVTGPGPDPG
NVRVDYADGV DAAQGVVATV RTFNQDSSLA VGPGWDPVTG LGSANAGWLT AIPPRARR