Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0204 |
Symbol | |
ID | 4485290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 217339 |
End bp | 219303 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639728967 |
Product | Mername-AA223 peptidase |
Protein accession | YP_871964 |
Protein GI | 117927413 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.176031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.159439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCATAC TGCTGCTCGT GCTGTTCCGC GGCCTGTGGA GCGGCGGCAG CGACTACAAG TCGGTGGACA CCTCCCAGGT GGTCGCCGCC GTCGACACCG GTCAGATCAC AAATGCGATC GTCCACGACA AAGAGCAGAC CATCCAGGTC ACGCTGAAGG ATGGCTCCAA GCTGCAGGCG TCTTTTGCGA CTGATCAAGA GGCCGCGCTC GTCAGCCAAT TGCAGAAGCT TGTCGACCAA GGGAAGCTGC CGGCCGACGG CGGATACACC GTCAAGGTCA GCCACGGCAA CATCTTCCTG GAGATCCTCC TCAACGCCCT GCCGATTCTG CTGCTTGTCG GCCTCATGGT CTTCATGCTG TCGCAGATGC AAGGCGGCGG TTCCCGGGTG ATGAATTTCG GCAAGTCGCG GGCCAAGCTC ATCACGAAGG ACACCCCGAA GACGACCTTC GCGGACGTCG CCGGCGCGGA TGAGGCCATC GAAGAACTCA TGGAAATCAA AGAGTTCCTC GAGAATCCAG CGAAATTCCA GGCGATCGGC GCGAAGATTC CGAAGGGTGT CTTGCTGTAC GGCCCGCCTG GTACCGGGAA GACCCTTCTC GCCCGGGCGG TTGCCGGTGA GGCCGGCGTC CCCTTCTACT CGATTTCCGG GTCTGACTTC GTTGAAATGT TCGTCGGAGT CGGAGCGTCA CGGGTCCGGG ACCTCTTCCA GCAGGCCAAG GAGAACGCGC CCGCGATTGT CTTCATCGAC GAAATTGACG CGGTCGGGCG CCACCGCGGC GCCGGTCTCG GCGGTGGGCA TGACGAACGC GAACAGACAC TGAATCAGCT TCTGGTCGAG ATGGACGGAT TCGACGTCAA GGGCGGGGTC ATTCTGATCG CCGCGACGAA CCGCCCGGAC ATCCTCGACC CGGCCCTCTT GCGGCCCGGC CGTTTCGACC GGCACATCGT CGTTGACCGT CCCGATTTGG AAGGCCGCAA AGGCATTCTG CGGGTGCACG CGAAGGGCAA GCCGTTCGCT CCGGACGTCG ACCTGGACGT CATCGCCCGC CGCACCCCCG GCTTCACCGG AGCCGACCTG GCGAACGTCA TTAACGAGGC GGCGTTGCTC ACGGCGCGGG CCAATCAGAA GCAGATCACC ATGGCCACGC TGGAAGAGAG CATCGACCGC GTCATGGCCG GTCCCGAGCG CAAGAGCCGG ATAATGTCCG ACAAAGAAAA GAAAATCATC GCGTATCACG AGGGGGGTCA CGCTCTCGTC GGCCATGCCT TGCCGAACGC CGACCCGGTG CACAAGGTCA CCATTCTGCC GCGCGGCCGC GCGCTGGGTT ACACCTTGGC ATTGCCGACG GAGGACAAAT TCCTTGTGAC GCGGGCCGAA CTCATGGATC AGCTCGCGAT GCTCCTCGGC GGTCGGACCG CGGAAGAACT GGTTTTCCAC GAACCGACGA CCGGCGCCGC CAATGACATC GAGAAAGCGA CCGCCATCGC GCGGAACATG GTCACCCAGT ACGGGATGAG CGAACGGCTC GGTGCGCGGA AATTCGGGCA ATCCGACGGG GAGGTTTTCC TCGGCCGCGA GATGGGCCAC CAGCGCGACT ACTCCGAGGA GGTCGCCGCG ACCATCGACG AGGAGGTACG CCGGCTCATT GAGAACGCGC ATGACGAAGC GTGGGAAATC CTCGTCGAAT ACCGCGACGT GCTCGACGCG CTCGTCCTCG AACTGATGGA GAAAGAGACC CTGCAGAAGG AGGAAGTGCT GCGGATTTTC GCCCCGGTGC GCAAACGTCC GGCCCGCGGC ACCTACACCG GGTACGGCAA GCGCCTGCCG TCCGACAAGG CGCCGGTCCT GACACCGCGC GAACTCGCCC TGCTGGCCGG GGAATCCGGT GACGGCGCGG GCGGCACGAA CGGGAGCCGG GCCGCCAAAC CGGCGTCCGG CGAGAACCAA CCGTCCGGAA GCTGA
|
Protein sequence | MIILLLVLFR GLWSGGSDYK SVDTSQVVAA VDTGQITNAI VHDKEQTIQV TLKDGSKLQA SFATDQEAAL VSQLQKLVDQ GKLPADGGYT VKVSHGNIFL EILLNALPIL LLVGLMVFML SQMQGGGSRV MNFGKSRAKL ITKDTPKTTF ADVAGADEAI EELMEIKEFL ENPAKFQAIG AKIPKGVLLY GPPGTGKTLL ARAVAGEAGV PFYSISGSDF VEMFVGVGAS RVRDLFQQAK ENAPAIVFID EIDAVGRHRG AGLGGGHDER EQTLNQLLVE MDGFDVKGGV ILIAATNRPD ILDPALLRPG RFDRHIVVDR PDLEGRKGIL RVHAKGKPFA PDVDLDVIAR RTPGFTGADL ANVINEAALL TARANQKQIT MATLEESIDR VMAGPERKSR IMSDKEKKII AYHEGGHALV GHALPNADPV HKVTILPRGR ALGYTLALPT EDKFLVTRAE LMDQLAMLLG GRTAEELVFH EPTTGAANDI EKATAIARNM VTQYGMSERL GARKFGQSDG EVFLGREMGH QRDYSEEVAA TIDEEVRRLI ENAHDEAWEI LVEYRDVLDA LVLELMEKET LQKEEVLRIF APVRKRPARG TYTGYGKRLP SDKAPVLTPR ELALLAGESG DGAGGTNGSR AAKPASGENQ PSGS
|
| |