Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1679 |
Symbol | |
ID | 4486036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1888842 |
End bp | 1891955 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639730468 |
Product | protease-like |
Protein accession | YP_873437 |
Protein GI | 117928886 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000000116915 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCATGCCA CGCAATGGCG TCGTCGCGTG ACGACAATCG GGCTTCTTGT CGTCGTCCTG GGGTGGACGG CCCACGCCGC GAGCAGCGAT CATGCGGAGC AAACACAACT ACGGCGCGTT GCGACGCCGC CGATCTCTGC CCTTGCCGTG AACGCCCAGC CCAGCGACGC CGCCGTCCCG CTCGTTCTCG ATTCCGCCGC TCCCGCAGCG ATCGCGGGGC CGCCCCAGAT CCCGCAGCCG GACACCGCAA ACGCATCGAC GTCCGGCCGA GTGCCGACAA CAAGACCGCT CGCCGGTGAC ACGCCGGTCC AGGCGACCAT CGCGCTGCGG TCAGCCGTGG TGCCGTCAGC CATTGACGCC GGCATCCACC GTTTGCGCAC CAACGGGCTC AGCGTGACGT TGCTCGGCGA TCCGCCGGTT GCCCTGCTGG TGCATGGGAC CGCCCGCCAG GTCAACCGCG TGTTCCGCAC CTCGGTCGTG AGCTACCGCG GGATCGCCGA TCAGGAGATT CTGACGTTCG CCATGCCGCC CGCGCTCCCG GCCGACCTCG CCGCGGCGAC CGGGACGGCC TTCGTCCGTC GCGGCCAGGC GACGTCCGCC CACCGGCTCA CCGCCGTCCC GGCCGCTTTC TCTACCGCCG GCGGCGCGGT TTCGCCGCAA CCGTGCACCG CCGCAGCAAA CGCCGCCTCC GCCACCGGTG CGCACACGGC CGATCAGATC GCCAGCCACT ACAACATCGG ACCGCTGTAC GCAGCTGCGC CCCAACGCCC AGTCACCGTC GCGCTTGTCG AATTCGAGCC CTTCAATCCG GCTGACATTG CGGCATTCCA GCAGTGCTAC GGCACGCACG CCACGGTGAC CACCGTCCAG GTGGACGGCG GAGCCGGTGC CGGAACGGGG TCCGGCCGGG CGGCCACGGA CATCGAGATG GTGATCGCCG CCGCGCCGGA CGCCAATATC GTCGTCTACC AGGCGCCCGG AGACACCGGA AGCGTCTACG ACACCTATGC GCGGATCGCC GCCGACAACA CCGCCCAGGT GGTCGTCACC AGCTGGGGCA TCTGCGAACC GACGGCGACG GTTTCTTCGC TTCCGACGCT GGAGCGGCCC CTCTTCGAGC AGATGGCACG CAATGGGCAG ACCGTTCTCG CAGCAGCCGG TGACAGCGGA TCGGCCGCTT GCTACGCGCC GCCGTCAGCC ACCGACACCT CCCTCGCCGT GCTCGACCCG GCGAGTCAAC CCACGATCAC CGCAGTAGGC GGTACGTCGT TCGCCGGTGT CAGCGATCCA GACATCTCCT GGCACACGGC CGGCGGTGCC GGCGGCGGGG GAATCTCCCA CATCTGGCCG ATGCCGCGCT ACCAAGCGGG CGCGACCACG ACGCAGAATT CCCCGGCTCT GTGCAACGCG CCAACCGGAT CGGCGTGCCG GCAGGTGCCC GACATCAGCA TGCTTGCCGA CCCGACGCAC GGATACGTGG CCTACGTCGG CGGCACGTGG CGGGCGGTCG GCGGCACCGG AGCGGCAACT GCGACATTCG CCGGCATTCT CGCCCTGATC GACGAAAGTT GCGTCGCGGG TCCGGTTGGA TTGATCAATC CGGCGTTGTA CCGGCTGGCC GGCACCTCCG CGGTGGTGGA TGTCACCCAG GGCCCGAACA CCGACCTGAC CGGAACGAAC GGCGGGGCGT ACCCGCCGGC GACCGGCGTG GACCTGGCCA CCGGGCTGGG CCGACCCGAT GCCGCGGCCC TCGCCGCCGC ACTCTGCCCG CCGACAGGAG CCGCAGGCTC AGGCACGATC ACGGTCGATC CGAACCTGGT CGTGACCAAC AGCTCGACGT CCCTCACCTT CCGCTACACG CCGGCGAGCG GCACCGGGAT GGTCAACGGG GAACTCGACA TCACCGTGCC GGGAACGTGG TCGCTGCCGA CAACCACTTC CGGTCAGCCG GGTTACACGA CCGCCGATGC AGGTGTTCTT ACGGTCAGCG GCAACACCAT CGTGCTTCGA TCGATCACTC TGCCGGCGAA CTCGACGGTG ACCGTGACGT TCGGTGACAC CAGCGGCGGC CCCGGCGCGC GGACGCCGTC CGCCGCGCAA ATCACCACGT TCGCGACGGC CAGCGCGCCG GCGTCCGCCG GTGGAGCGGC CGGACTGGCC CGCAATCCCG CTGTCCGGGT ACTCACCCCG GGTGGCAGCC AAGCCGGGCA GGGCACGTTG CTGCGGATCG CAGGAGCCGA TCGTATCGGT ACCGCCATCG CGGCTTCCCA ATTGCGGTTC ACCACCGGTG GAGCGAGCGC GGTCGTGCTT GCCCGGGCGG ACATTTTCCC CGACGCGCTC GCCGGAGTGC CGTTGGCGGC GCAGGTGCAC GGGCCGCTTC TCCTCACCCC GCCGTCCTCG TTACCGATTG CCGTGCTCAA TGAAATCCAG CGGGTTCTTC CGGTCGGCGG ACCGGTCTTC CTCCTCGGCG GAACCGCAGC TCTCTCCGCA ACCGTCGAGC AGCAACTGGT GACGCTCGGC TATCTGCCGC ACCGGATTTC CGGGATGGAT CGTTTTGATA CCGCCGTTCA AATCGCCCAC GCTCTCGGGG ATCCGACAAC AATCCTCGAA TGCAGTGGTC TTGATTTTCC CGACGCGCTC TCGGCTGGAC CTGCCGCGGT CATCACCCAC GGCGCGGTCC TGCTCACCGC CGGCCCAGAC CAGGCGGCGG CCACTGCGGC GTACCTCACC GTCCACCCGC GCGTGACGCG GTACGCGATC GGCGGACCGG CGGCCCACGC AGATCCAGGC GCCATACCAC TGGTCGGTGC GGATCGTTAC GCGACGTCCG TGCTCGTCGC CCAGCAGTTC TTCACCGCGC CGTCCGGGAT TGGTCTCGCG AGCGGTGCGG CGTTCCCTGA CGCCCTGGCC GGCGGCCCGG CGACAGCGGA GGCTGGGGGT CCGTTGCTGC TCGTGCCGCC AAGCGGCGCA CTGCCCACCG GGACGGCGAA CTACTTCAGC GCCGTCGCCA GCAGCGTGCT GACCGGTTGG CTCTTCGGCG GAACCGCTGC GGTCGGTACG GATATCGCTT CCGAGACCGC TCAGGCGCTT GTCCTCGTCC CACCGCCAAG CTGA
|
Protein sequence | MHATQWRRRV TTIGLLVVVL GWTAHAASSD HAEQTQLRRV ATPPISALAV NAQPSDAAVP LVLDSAAPAA IAGPPQIPQP DTANASTSGR VPTTRPLAGD TPVQATIALR SAVVPSAIDA GIHRLRTNGL SVTLLGDPPV ALLVHGTARQ VNRVFRTSVV SYRGIADQEI LTFAMPPALP ADLAAATGTA FVRRGQATSA HRLTAVPAAF STAGGAVSPQ PCTAAANAAS ATGAHTADQI ASHYNIGPLY AAAPQRPVTV ALVEFEPFNP ADIAAFQQCY GTHATVTTVQ VDGGAGAGTG SGRAATDIEM VIAAAPDANI VVYQAPGDTG SVYDTYARIA ADNTAQVVVT SWGICEPTAT VSSLPTLERP LFEQMARNGQ TVLAAAGDSG SAACYAPPSA TDTSLAVLDP ASQPTITAVG GTSFAGVSDP DISWHTAGGA GGGGISHIWP MPRYQAGATT TQNSPALCNA PTGSACRQVP DISMLADPTH GYVAYVGGTW RAVGGTGAAT ATFAGILALI DESCVAGPVG LINPALYRLA GTSAVVDVTQ GPNTDLTGTN GGAYPPATGV DLATGLGRPD AAALAAALCP PTGAAGSGTI TVDPNLVVTN SSTSLTFRYT PASGTGMVNG ELDITVPGTW SLPTTTSGQP GYTTADAGVL TVSGNTIVLR SITLPANSTV TVTFGDTSGG PGARTPSAAQ ITTFATASAP ASAGGAAGLA RNPAVRVLTP GGSQAGQGTL LRIAGADRIG TAIAASQLRF TTGGASAVVL ARADIFPDAL AGVPLAAQVH GPLLLTPPSS LPIAVLNEIQ RVLPVGGPVF LLGGTAALSA TVEQQLVTLG YLPHRISGMD RFDTAVQIAH ALGDPTTILE CSGLDFPDAL SAGPAAVITH GAVLLTAGPD QAAATAAYLT VHPRVTRYAI GGPAAHADPG AIPLVGADRY ATSVLVAQQF FTAPSGIGLA SGAAFPDALA GGPATAEAGG PLLLVPPSGA LPTGTANYFS AVASSVLTGW LFGGTAAVGT DIASETAQAL VLVPPPS
|
| |