Gene Acel_0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0204 
Symbol 
ID4485290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp217339 
End bp219303 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content64% 
IMG OID639728967 
ProductMername-AA223 peptidase 
Protein accessionYP_871964 
Protein GI117927413 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.176031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.159439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCATAC TGCTGCTCGT GCTGTTCCGC GGCCTGTGGA GCGGCGGCAG CGACTACAAG 
TCGGTGGACA CCTCCCAGGT GGTCGCCGCC GTCGACACCG GTCAGATCAC AAATGCGATC
GTCCACGACA AAGAGCAGAC CATCCAGGTC ACGCTGAAGG ATGGCTCCAA GCTGCAGGCG
TCTTTTGCGA CTGATCAAGA GGCCGCGCTC GTCAGCCAAT TGCAGAAGCT TGTCGACCAA
GGGAAGCTGC CGGCCGACGG CGGATACACC GTCAAGGTCA GCCACGGCAA CATCTTCCTG
GAGATCCTCC TCAACGCCCT GCCGATTCTG CTGCTTGTCG GCCTCATGGT CTTCATGCTG
TCGCAGATGC AAGGCGGCGG TTCCCGGGTG ATGAATTTCG GCAAGTCGCG GGCCAAGCTC
ATCACGAAGG ACACCCCGAA GACGACCTTC GCGGACGTCG CCGGCGCGGA TGAGGCCATC
GAAGAACTCA TGGAAATCAA AGAGTTCCTC GAGAATCCAG CGAAATTCCA GGCGATCGGC
GCGAAGATTC CGAAGGGTGT CTTGCTGTAC GGCCCGCCTG GTACCGGGAA GACCCTTCTC
GCCCGGGCGG TTGCCGGTGA GGCCGGCGTC CCCTTCTACT CGATTTCCGG GTCTGACTTC
GTTGAAATGT TCGTCGGAGT CGGAGCGTCA CGGGTCCGGG ACCTCTTCCA GCAGGCCAAG
GAGAACGCGC CCGCGATTGT CTTCATCGAC GAAATTGACG CGGTCGGGCG CCACCGCGGC
GCCGGTCTCG GCGGTGGGCA TGACGAACGC GAACAGACAC TGAATCAGCT TCTGGTCGAG
ATGGACGGAT TCGACGTCAA GGGCGGGGTC ATTCTGATCG CCGCGACGAA CCGCCCGGAC
ATCCTCGACC CGGCCCTCTT GCGGCCCGGC CGTTTCGACC GGCACATCGT CGTTGACCGT
CCCGATTTGG AAGGCCGCAA AGGCATTCTG CGGGTGCACG CGAAGGGCAA GCCGTTCGCT
CCGGACGTCG ACCTGGACGT CATCGCCCGC CGCACCCCCG GCTTCACCGG AGCCGACCTG
GCGAACGTCA TTAACGAGGC GGCGTTGCTC ACGGCGCGGG CCAATCAGAA GCAGATCACC
ATGGCCACGC TGGAAGAGAG CATCGACCGC GTCATGGCCG GTCCCGAGCG CAAGAGCCGG
ATAATGTCCG ACAAAGAAAA GAAAATCATC GCGTATCACG AGGGGGGTCA CGCTCTCGTC
GGCCATGCCT TGCCGAACGC CGACCCGGTG CACAAGGTCA CCATTCTGCC GCGCGGCCGC
GCGCTGGGTT ACACCTTGGC ATTGCCGACG GAGGACAAAT TCCTTGTGAC GCGGGCCGAA
CTCATGGATC AGCTCGCGAT GCTCCTCGGC GGTCGGACCG CGGAAGAACT GGTTTTCCAC
GAACCGACGA CCGGCGCCGC CAATGACATC GAGAAAGCGA CCGCCATCGC GCGGAACATG
GTCACCCAGT ACGGGATGAG CGAACGGCTC GGTGCGCGGA AATTCGGGCA ATCCGACGGG
GAGGTTTTCC TCGGCCGCGA GATGGGCCAC CAGCGCGACT ACTCCGAGGA GGTCGCCGCG
ACCATCGACG AGGAGGTACG CCGGCTCATT GAGAACGCGC ATGACGAAGC GTGGGAAATC
CTCGTCGAAT ACCGCGACGT GCTCGACGCG CTCGTCCTCG AACTGATGGA GAAAGAGACC
CTGCAGAAGG AGGAAGTGCT GCGGATTTTC GCCCCGGTGC GCAAACGTCC GGCCCGCGGC
ACCTACACCG GGTACGGCAA GCGCCTGCCG TCCGACAAGG CGCCGGTCCT GACACCGCGC
GAACTCGCCC TGCTGGCCGG GGAATCCGGT GACGGCGCGG GCGGCACGAA CGGGAGCCGG
GCCGCCAAAC CGGCGTCCGG CGAGAACCAA CCGTCCGGAA GCTGA
 
Protein sequence
MIILLLVLFR GLWSGGSDYK SVDTSQVVAA VDTGQITNAI VHDKEQTIQV TLKDGSKLQA 
SFATDQEAAL VSQLQKLVDQ GKLPADGGYT VKVSHGNIFL EILLNALPIL LLVGLMVFML
SQMQGGGSRV MNFGKSRAKL ITKDTPKTTF ADVAGADEAI EELMEIKEFL ENPAKFQAIG
AKIPKGVLLY GPPGTGKTLL ARAVAGEAGV PFYSISGSDF VEMFVGVGAS RVRDLFQQAK
ENAPAIVFID EIDAVGRHRG AGLGGGHDER EQTLNQLLVE MDGFDVKGGV ILIAATNRPD
ILDPALLRPG RFDRHIVVDR PDLEGRKGIL RVHAKGKPFA PDVDLDVIAR RTPGFTGADL
ANVINEAALL TARANQKQIT MATLEESIDR VMAGPERKSR IMSDKEKKII AYHEGGHALV
GHALPNADPV HKVTILPRGR ALGYTLALPT EDKFLVTRAE LMDQLAMLLG GRTAEELVFH
EPTTGAANDI EKATAIARNM VTQYGMSERL GARKFGQSDG EVFLGREMGH QRDYSEEVAA
TIDEEVRRLI ENAHDEAWEI LVEYRDVLDA LVLELMEKET LQKEEVLRIF APVRKRPARG
TYTGYGKRLP SDKAPVLTPR ELALLAGESG DGAGGTNGSR AAKPASGENQ PSGS