Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1604 |
Symbol | |
ID | 4486508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1804686 |
End bp | 1806857 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639730390 |
Product | prolyl oligopeptidase |
Protein accession | YP_873362 |
Protein GI | 117928811 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.115584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCCA TCGACCCGGC TCGTTCGGCG ACCGTCCCGC TCAGCTATCC GCCGTCCCGG CGACTTGACC TCGTCGAGGT CCTGCCGGCC GCCAATCCGA CCCACCCGGT CGCCGATCCG TACCGGTGGC TGGAGAACGC CGACGATCCG GAGGTCCGAA CCTGGATCGA CGCCCAACAC GCCTTGTGCC GCAGGTATCT CGACGCTCTG CCCGGGCGGG ATCGGCTGCG CCGCCGGCTC ACCGAACTGC TGGGCGCGGG TGTGGTGAGC GCGCCCGTCT GGCGCCGGGG TCGGCAATTC TTCCTCCGTC GCCAGGCCGA TCAGGAGCAC GCCGTGCTCT TCACCGTCGA TCCGGACGGT ACGGAACGGG TTCTCCTCGA TCCGATGGCG GTTGACCCGA CCGGCAGGAC GACGCTCGAC ACCTGGCAGC CGTCGAAGGA AGGGCGTTTC CTCGCCTACC AGCTGTCGAC CGGCGGGGAC GAGGAGTCCG TGCTGCGCGT CATGGATGTC GAAACCGGGG AAATCGTCGA TGGCCCGATC GACCGATGCC GGTATTCGCC GGTCGGGTGG TTGCCCGGCG GCGCGGCTTT TTATTACGTC CGGCGGTTGC CGCCCGGTGA CGTGCCGCCG GACGAGACCG CGTTCCACCG GAGAGTCTGG CTGCACCGTC TGGGCACGAG CGCGGACGAC GACGTGCTCA TTTTCGGCGA CGGCATGGAC AAAACGACGT TCTTCTCCGC ATCGGTCAGT CTGGACGGCC GCTGGCTCAT TGTGTCGGCA AGCCCCGGCA CCGCACCGCG CAATGACGTC TGGATCGCGG ATTTGTCGGA CGGCGATCCT GCGGCACCGG TCCTCCGGCC GGTTCAGGTC GGCGTGGACG CGCAGCTCGT GGTGCACATC GGACGGGACG GCCGGGCCTA CCTGTACACC GACCGCGACG CGCCCCGCGG CCGGCTGGCG GTCGCCGATC CGACGGAGTT GCCGGCCGAG AAATGGCGAG ACCTGCTGCC GGAGGATCCG GAAGCGGTGC TGGTCGATTA CGCCATCCTG GATGGCCCGC AACTGGATCG ACCGGTCCTG GTGGCGGCAT GGACACGGCA CGCGTTGAGT GAGCTCTCGG TCCACGATCT GGAGACTGGT GAACGGCTCG GCGCCGTCAC CTTGCCCGGA CTCGGCACGG TGACCGGCCT TTCCGAACAA CCGGAGGGCG GCCACCAGTG CTGGTTCGGG TACACCGATT ACGCCACGCC GCCAAGCGTT TTCTGTTTTG ACGCGCTGAC GAATGCCACC ACGGTATGGG CTCGGCCTCC CGGTCAGGCG CCCGTACCCC CGGTGCATAC GACGCAGGTG GTGTACGAAT CGCGCGACGG CACGCCGGTT CGCATGATGC TCATTGCGCC GCCGGTCGAA CCGGCCCGCC CGCGCCCGAC AATCCTCACC GGGTACGGCG GATTCGGCAC GTCCCTCACC CCGGGCTATT CGGCGGGAAT TCTGGCCTGG GTCGAAGCCG GCGGCGTGTA CGCCGTCGCG AACCTGCGCG GCGGCGGGGA AGAGGGCGAG CAGTGGCATC GCGCCGGAAT GCGCGGAAAT AAACAGAATG TCTTCGACGA TTTCCACGCC GCCGCCGACT GGCTGATTGC CAACGGGTGG ACGACGCCCG GCCAGCTCGG CATTTCCGGC GGGAGCAACG GCGGACTGCT CGTCGGCGCG GCAATGACCC AGGCTCCGGA AAAATATGCC GCGGTCGTCT GCTCCGCACC GTTGCTCGAC ATGGCGCGGT ACGAGAAATT CGGGCTCGGT CCGTTGTGGC GGGAAGAATA CGGCACCGCT GAGAATCCCG AGGAATTAGC GGTATTGCTC GCGTATTCCC CGTATCACAA CATGCGCCCG GGAACGCCGT ACCCGGCGGT GCTCTTCACC GTCTTCGATT CCGATACCCG GGTCGACCCG ATGCACGCCC GCAAAATGTG CGCCGCGCTG CAAGCGGCGT CCACGTCCGG CAAGCCGGTG CTGCTGCGCC GGGAGTCGGA CGTCGGGCAC GGGGCGCGGG CCCTCAGCCG CAGCATCGAG CTGTCTGTCG ACACCTTGGC ATTCCTTGCC GCGCACACCG GCCTCGACCT CGAACAGCCC GACAGGACCG CCGAACAGCC CGACCGGACC GCCGGAGGGT GA
|
Protein sequence | MASIDPARSA TVPLSYPPSR RLDLVEVLPA ANPTHPVADP YRWLENADDP EVRTWIDAQH ALCRRYLDAL PGRDRLRRRL TELLGAGVVS APVWRRGRQF FLRRQADQEH AVLFTVDPDG TERVLLDPMA VDPTGRTTLD TWQPSKEGRF LAYQLSTGGD EESVLRVMDV ETGEIVDGPI DRCRYSPVGW LPGGAAFYYV RRLPPGDVPP DETAFHRRVW LHRLGTSADD DVLIFGDGMD KTTFFSASVS LDGRWLIVSA SPGTAPRNDV WIADLSDGDP AAPVLRPVQV GVDAQLVVHI GRDGRAYLYT DRDAPRGRLA VADPTELPAE KWRDLLPEDP EAVLVDYAIL DGPQLDRPVL VAAWTRHALS ELSVHDLETG ERLGAVTLPG LGTVTGLSEQ PEGGHQCWFG YTDYATPPSV FCFDALTNAT TVWARPPGQA PVPPVHTTQV VYESRDGTPV RMMLIAPPVE PARPRPTILT GYGGFGTSLT PGYSAGILAW VEAGGVYAVA NLRGGGEEGE QWHRAGMRGN KQNVFDDFHA AADWLIANGW TTPGQLGISG GSNGGLLVGA AMTQAPEKYA AVVCSAPLLD MARYEKFGLG PLWREEYGTA ENPEELAVLL AYSPYHNMRP GTPYPAVLFT VFDSDTRVDP MHARKMCAAL QAASTSGKPV LLRRESDVGH GARALSRSIE LSVDTLAFLA AHTGLDLEQP DRTAEQPDRT AGG
|
| |