Gene Acel_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1604 
Symbol 
ID4486508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1804686 
End bp1806857 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content68% 
IMG OID639730390 
Productprolyl oligopeptidase 
Protein accessionYP_873362 
Protein GI117928811 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.115584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCCA TCGACCCGGC TCGTTCGGCG ACCGTCCCGC TCAGCTATCC GCCGTCCCGG 
CGACTTGACC TCGTCGAGGT CCTGCCGGCC GCCAATCCGA CCCACCCGGT CGCCGATCCG
TACCGGTGGC TGGAGAACGC CGACGATCCG GAGGTCCGAA CCTGGATCGA CGCCCAACAC
GCCTTGTGCC GCAGGTATCT CGACGCTCTG CCCGGGCGGG ATCGGCTGCG CCGCCGGCTC
ACCGAACTGC TGGGCGCGGG TGTGGTGAGC GCGCCCGTCT GGCGCCGGGG TCGGCAATTC
TTCCTCCGTC GCCAGGCCGA TCAGGAGCAC GCCGTGCTCT TCACCGTCGA TCCGGACGGT
ACGGAACGGG TTCTCCTCGA TCCGATGGCG GTTGACCCGA CCGGCAGGAC GACGCTCGAC
ACCTGGCAGC CGTCGAAGGA AGGGCGTTTC CTCGCCTACC AGCTGTCGAC CGGCGGGGAC
GAGGAGTCCG TGCTGCGCGT CATGGATGTC GAAACCGGGG AAATCGTCGA TGGCCCGATC
GACCGATGCC GGTATTCGCC GGTCGGGTGG TTGCCCGGCG GCGCGGCTTT TTATTACGTC
CGGCGGTTGC CGCCCGGTGA CGTGCCGCCG GACGAGACCG CGTTCCACCG GAGAGTCTGG
CTGCACCGTC TGGGCACGAG CGCGGACGAC GACGTGCTCA TTTTCGGCGA CGGCATGGAC
AAAACGACGT TCTTCTCCGC ATCGGTCAGT CTGGACGGCC GCTGGCTCAT TGTGTCGGCA
AGCCCCGGCA CCGCACCGCG CAATGACGTC TGGATCGCGG ATTTGTCGGA CGGCGATCCT
GCGGCACCGG TCCTCCGGCC GGTTCAGGTC GGCGTGGACG CGCAGCTCGT GGTGCACATC
GGACGGGACG GCCGGGCCTA CCTGTACACC GACCGCGACG CGCCCCGCGG CCGGCTGGCG
GTCGCCGATC CGACGGAGTT GCCGGCCGAG AAATGGCGAG ACCTGCTGCC GGAGGATCCG
GAAGCGGTGC TGGTCGATTA CGCCATCCTG GATGGCCCGC AACTGGATCG ACCGGTCCTG
GTGGCGGCAT GGACACGGCA CGCGTTGAGT GAGCTCTCGG TCCACGATCT GGAGACTGGT
GAACGGCTCG GCGCCGTCAC CTTGCCCGGA CTCGGCACGG TGACCGGCCT TTCCGAACAA
CCGGAGGGCG GCCACCAGTG CTGGTTCGGG TACACCGATT ACGCCACGCC GCCAAGCGTT
TTCTGTTTTG ACGCGCTGAC GAATGCCACC ACGGTATGGG CTCGGCCTCC CGGTCAGGCG
CCCGTACCCC CGGTGCATAC GACGCAGGTG GTGTACGAAT CGCGCGACGG CACGCCGGTT
CGCATGATGC TCATTGCGCC GCCGGTCGAA CCGGCCCGCC CGCGCCCGAC AATCCTCACC
GGGTACGGCG GATTCGGCAC GTCCCTCACC CCGGGCTATT CGGCGGGAAT TCTGGCCTGG
GTCGAAGCCG GCGGCGTGTA CGCCGTCGCG AACCTGCGCG GCGGCGGGGA AGAGGGCGAG
CAGTGGCATC GCGCCGGAAT GCGCGGAAAT AAACAGAATG TCTTCGACGA TTTCCACGCC
GCCGCCGACT GGCTGATTGC CAACGGGTGG ACGACGCCCG GCCAGCTCGG CATTTCCGGC
GGGAGCAACG GCGGACTGCT CGTCGGCGCG GCAATGACCC AGGCTCCGGA AAAATATGCC
GCGGTCGTCT GCTCCGCACC GTTGCTCGAC ATGGCGCGGT ACGAGAAATT CGGGCTCGGT
CCGTTGTGGC GGGAAGAATA CGGCACCGCT GAGAATCCCG AGGAATTAGC GGTATTGCTC
GCGTATTCCC CGTATCACAA CATGCGCCCG GGAACGCCGT ACCCGGCGGT GCTCTTCACC
GTCTTCGATT CCGATACCCG GGTCGACCCG ATGCACGCCC GCAAAATGTG CGCCGCGCTG
CAAGCGGCGT CCACGTCCGG CAAGCCGGTG CTGCTGCGCC GGGAGTCGGA CGTCGGGCAC
GGGGCGCGGG CCCTCAGCCG CAGCATCGAG CTGTCTGTCG ACACCTTGGC ATTCCTTGCC
GCGCACACCG GCCTCGACCT CGAACAGCCC GACAGGACCG CCGAACAGCC CGACCGGACC
GCCGGAGGGT GA
 
Protein sequence
MASIDPARSA TVPLSYPPSR RLDLVEVLPA ANPTHPVADP YRWLENADDP EVRTWIDAQH 
ALCRRYLDAL PGRDRLRRRL TELLGAGVVS APVWRRGRQF FLRRQADQEH AVLFTVDPDG
TERVLLDPMA VDPTGRTTLD TWQPSKEGRF LAYQLSTGGD EESVLRVMDV ETGEIVDGPI
DRCRYSPVGW LPGGAAFYYV RRLPPGDVPP DETAFHRRVW LHRLGTSADD DVLIFGDGMD
KTTFFSASVS LDGRWLIVSA SPGTAPRNDV WIADLSDGDP AAPVLRPVQV GVDAQLVVHI
GRDGRAYLYT DRDAPRGRLA VADPTELPAE KWRDLLPEDP EAVLVDYAIL DGPQLDRPVL
VAAWTRHALS ELSVHDLETG ERLGAVTLPG LGTVTGLSEQ PEGGHQCWFG YTDYATPPSV
FCFDALTNAT TVWARPPGQA PVPPVHTTQV VYESRDGTPV RMMLIAPPVE PARPRPTILT
GYGGFGTSLT PGYSAGILAW VEAGGVYAVA NLRGGGEEGE QWHRAGMRGN KQNVFDDFHA
AADWLIANGW TTPGQLGISG GSNGGLLVGA AMTQAPEKYA AVVCSAPLLD MARYEKFGLG
PLWREEYGTA ENPEELAVLL AYSPYHNMRP GTPYPAVLFT VFDSDTRVDP MHARKMCAAL
QAASTSGKPV LLRRESDVGH GARALSRSIE LSVDTLAFLA AHTGLDLEQP DRTAEQPDRT
AGG