Gene Acel_1679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1679 
Symbol 
ID4486036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1888842 
End bp1891955 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content70% 
IMG OID639730468 
Productprotease-like 
Protein accessionYP_873437 
Protein GI117928886 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000116915 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCATGCCA CGCAATGGCG TCGTCGCGTG ACGACAATCG GGCTTCTTGT CGTCGTCCTG 
GGGTGGACGG CCCACGCCGC GAGCAGCGAT CATGCGGAGC AAACACAACT ACGGCGCGTT
GCGACGCCGC CGATCTCTGC CCTTGCCGTG AACGCCCAGC CCAGCGACGC CGCCGTCCCG
CTCGTTCTCG ATTCCGCCGC TCCCGCAGCG ATCGCGGGGC CGCCCCAGAT CCCGCAGCCG
GACACCGCAA ACGCATCGAC GTCCGGCCGA GTGCCGACAA CAAGACCGCT CGCCGGTGAC
ACGCCGGTCC AGGCGACCAT CGCGCTGCGG TCAGCCGTGG TGCCGTCAGC CATTGACGCC
GGCATCCACC GTTTGCGCAC CAACGGGCTC AGCGTGACGT TGCTCGGCGA TCCGCCGGTT
GCCCTGCTGG TGCATGGGAC CGCCCGCCAG GTCAACCGCG TGTTCCGCAC CTCGGTCGTG
AGCTACCGCG GGATCGCCGA TCAGGAGATT CTGACGTTCG CCATGCCGCC CGCGCTCCCG
GCCGACCTCG CCGCGGCGAC CGGGACGGCC TTCGTCCGTC GCGGCCAGGC GACGTCCGCC
CACCGGCTCA CCGCCGTCCC GGCCGCTTTC TCTACCGCCG GCGGCGCGGT TTCGCCGCAA
CCGTGCACCG CCGCAGCAAA CGCCGCCTCC GCCACCGGTG CGCACACGGC CGATCAGATC
GCCAGCCACT ACAACATCGG ACCGCTGTAC GCAGCTGCGC CCCAACGCCC AGTCACCGTC
GCGCTTGTCG AATTCGAGCC CTTCAATCCG GCTGACATTG CGGCATTCCA GCAGTGCTAC
GGCACGCACG CCACGGTGAC CACCGTCCAG GTGGACGGCG GAGCCGGTGC CGGAACGGGG
TCCGGCCGGG CGGCCACGGA CATCGAGATG GTGATCGCCG CCGCGCCGGA CGCCAATATC
GTCGTCTACC AGGCGCCCGG AGACACCGGA AGCGTCTACG ACACCTATGC GCGGATCGCC
GCCGACAACA CCGCCCAGGT GGTCGTCACC AGCTGGGGCA TCTGCGAACC GACGGCGACG
GTTTCTTCGC TTCCGACGCT GGAGCGGCCC CTCTTCGAGC AGATGGCACG CAATGGGCAG
ACCGTTCTCG CAGCAGCCGG TGACAGCGGA TCGGCCGCTT GCTACGCGCC GCCGTCAGCC
ACCGACACCT CCCTCGCCGT GCTCGACCCG GCGAGTCAAC CCACGATCAC CGCAGTAGGC
GGTACGTCGT TCGCCGGTGT CAGCGATCCA GACATCTCCT GGCACACGGC CGGCGGTGCC
GGCGGCGGGG GAATCTCCCA CATCTGGCCG ATGCCGCGCT ACCAAGCGGG CGCGACCACG
ACGCAGAATT CCCCGGCTCT GTGCAACGCG CCAACCGGAT CGGCGTGCCG GCAGGTGCCC
GACATCAGCA TGCTTGCCGA CCCGACGCAC GGATACGTGG CCTACGTCGG CGGCACGTGG
CGGGCGGTCG GCGGCACCGG AGCGGCAACT GCGACATTCG CCGGCATTCT CGCCCTGATC
GACGAAAGTT GCGTCGCGGG TCCGGTTGGA TTGATCAATC CGGCGTTGTA CCGGCTGGCC
GGCACCTCCG CGGTGGTGGA TGTCACCCAG GGCCCGAACA CCGACCTGAC CGGAACGAAC
GGCGGGGCGT ACCCGCCGGC GACCGGCGTG GACCTGGCCA CCGGGCTGGG CCGACCCGAT
GCCGCGGCCC TCGCCGCCGC ACTCTGCCCG CCGACAGGAG CCGCAGGCTC AGGCACGATC
ACGGTCGATC CGAACCTGGT CGTGACCAAC AGCTCGACGT CCCTCACCTT CCGCTACACG
CCGGCGAGCG GCACCGGGAT GGTCAACGGG GAACTCGACA TCACCGTGCC GGGAACGTGG
TCGCTGCCGA CAACCACTTC CGGTCAGCCG GGTTACACGA CCGCCGATGC AGGTGTTCTT
ACGGTCAGCG GCAACACCAT CGTGCTTCGA TCGATCACTC TGCCGGCGAA CTCGACGGTG
ACCGTGACGT TCGGTGACAC CAGCGGCGGC CCCGGCGCGC GGACGCCGTC CGCCGCGCAA
ATCACCACGT TCGCGACGGC CAGCGCGCCG GCGTCCGCCG GTGGAGCGGC CGGACTGGCC
CGCAATCCCG CTGTCCGGGT ACTCACCCCG GGTGGCAGCC AAGCCGGGCA GGGCACGTTG
CTGCGGATCG CAGGAGCCGA TCGTATCGGT ACCGCCATCG CGGCTTCCCA ATTGCGGTTC
ACCACCGGTG GAGCGAGCGC GGTCGTGCTT GCCCGGGCGG ACATTTTCCC CGACGCGCTC
GCCGGAGTGC CGTTGGCGGC GCAGGTGCAC GGGCCGCTTC TCCTCACCCC GCCGTCCTCG
TTACCGATTG CCGTGCTCAA TGAAATCCAG CGGGTTCTTC CGGTCGGCGG ACCGGTCTTC
CTCCTCGGCG GAACCGCAGC TCTCTCCGCA ACCGTCGAGC AGCAACTGGT GACGCTCGGC
TATCTGCCGC ACCGGATTTC CGGGATGGAT CGTTTTGATA CCGCCGTTCA AATCGCCCAC
GCTCTCGGGG ATCCGACAAC AATCCTCGAA TGCAGTGGTC TTGATTTTCC CGACGCGCTC
TCGGCTGGAC CTGCCGCGGT CATCACCCAC GGCGCGGTCC TGCTCACCGC CGGCCCAGAC
CAGGCGGCGG CCACTGCGGC GTACCTCACC GTCCACCCGC GCGTGACGCG GTACGCGATC
GGCGGACCGG CGGCCCACGC AGATCCAGGC GCCATACCAC TGGTCGGTGC GGATCGTTAC
GCGACGTCCG TGCTCGTCGC CCAGCAGTTC TTCACCGCGC CGTCCGGGAT TGGTCTCGCG
AGCGGTGCGG CGTTCCCTGA CGCCCTGGCC GGCGGCCCGG CGACAGCGGA GGCTGGGGGT
CCGTTGCTGC TCGTGCCGCC AAGCGGCGCA CTGCCCACCG GGACGGCGAA CTACTTCAGC
GCCGTCGCCA GCAGCGTGCT GACCGGTTGG CTCTTCGGCG GAACCGCTGC GGTCGGTACG
GATATCGCTT CCGAGACCGC TCAGGCGCTT GTCCTCGTCC CACCGCCAAG CTGA
 
Protein sequence
MHATQWRRRV TTIGLLVVVL GWTAHAASSD HAEQTQLRRV ATPPISALAV NAQPSDAAVP 
LVLDSAAPAA IAGPPQIPQP DTANASTSGR VPTTRPLAGD TPVQATIALR SAVVPSAIDA
GIHRLRTNGL SVTLLGDPPV ALLVHGTARQ VNRVFRTSVV SYRGIADQEI LTFAMPPALP
ADLAAATGTA FVRRGQATSA HRLTAVPAAF STAGGAVSPQ PCTAAANAAS ATGAHTADQI
ASHYNIGPLY AAAPQRPVTV ALVEFEPFNP ADIAAFQQCY GTHATVTTVQ VDGGAGAGTG
SGRAATDIEM VIAAAPDANI VVYQAPGDTG SVYDTYARIA ADNTAQVVVT SWGICEPTAT
VSSLPTLERP LFEQMARNGQ TVLAAAGDSG SAACYAPPSA TDTSLAVLDP ASQPTITAVG
GTSFAGVSDP DISWHTAGGA GGGGISHIWP MPRYQAGATT TQNSPALCNA PTGSACRQVP
DISMLADPTH GYVAYVGGTW RAVGGTGAAT ATFAGILALI DESCVAGPVG LINPALYRLA
GTSAVVDVTQ GPNTDLTGTN GGAYPPATGV DLATGLGRPD AAALAAALCP PTGAAGSGTI
TVDPNLVVTN SSTSLTFRYT PASGTGMVNG ELDITVPGTW SLPTTTSGQP GYTTADAGVL
TVSGNTIVLR SITLPANSTV TVTFGDTSGG PGARTPSAAQ ITTFATASAP ASAGGAAGLA
RNPAVRVLTP GGSQAGQGTL LRIAGADRIG TAIAASQLRF TTGGASAVVL ARADIFPDAL
AGVPLAAQVH GPLLLTPPSS LPIAVLNEIQ RVLPVGGPVF LLGGTAALSA TVEQQLVTLG
YLPHRISGMD RFDTAVQIAH ALGDPTTILE CSGLDFPDAL SAGPAAVITH GAVLLTAGPD
QAAATAAYLT VHPRVTRYAI GGPAAHADPG AIPLVGADRY ATSVLVAQQF FTAPSGIGLA
SGAAFPDALA GGPATAEAGG PLLLVPPSGA LPTGTANYFS AVASSVLTGW LFGGTAAVGT
DIASETAQAL VLVPPPS