Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1489 |
Symbol | |
ID | 4485406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1675569 |
End bp | 1677509 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639730273 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_873247 |
Protein GI | 117928696 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.475976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGAGA TTGCTCCGTA CGGCAGTTGG GTGTCCCCGA TTTCCGCAGC CGACGTCGCG CGGGGCGGCG TTCGGCTTGG ATTTCCGAGC CTCGTCGCCG GCGACGTCTG GTGGTTGGAA GGCCGCCCGA CCGAGCAGGG CCGGCAGGTC GTCGTCTCCG CCCAGCGAGG TGACCTTCTC GGGCCGCCGT GGAACGCCCG TACCCGGGTC CATGAGTACG GCGGCCGAAG TTTCTGGCCG ATAGCCGCGG ACTCGTTTGT CTTCAGCGAA TGGACCGACC AGCGGCTCTA CCTCGTCACC GGGGAGGAGG ACCCGCGTCC CCTGACACCT GCTCCCGCCG TGCCGAGCGG TTGGCGATAC GCCGACGCCA CGGTCGGCCC GGACGGGCAG GTGTGGTGCG TGCGGGAGAC GGTCACGGCC GCCGGGGCAT TGGACGTCGT CCGGGACATT GTCGCCGTTC CCTTGGACGG GTCGGCGGCC GATGATCCGC GACGCATCCG GGTTGTCGTG GGCGGCAGCC GGTTCTTCGC GTATCCGACG GTGAGCCCGG ACGGCGGCCG CCTTGCCTGG GTCGCGTGGG ATCACCCGCA GATGCCGTGG GACGGCACCG AATTACGGAT CGGCGCGCTC CGCGACGGCA TCGTCGACAC CTGGACGACT GTCGCGGGCG GGCCGGCGGA ATCCATTCTG CAGCCGACGT GGGCGACGGA CGGTTCGCTG TATTTCCTCT CCGACCGCAG CGGGTGGTGG AATCTGTACC GCTGGGACGG CGCGGCGGTT CACGCCCTTG CACCGCGGGA TGAAGAATGC GGCGGTCCGC TGTGGCAACT GGGCATGCGG TGGTTCGCAC CGCTCGCCGA CGGCCGGATC GCTGTCATTC ACGGCGGACA TCTCGGTGTG CTGAACCCCG ATTCCGGTGA GGTCGTGGAT GTGGCAATTC CCCTGTCGTA TGTGGATGCG TCGGTCACCG CCGACGGCTC CGAGGTTGTC GTCGTCGGTG CGAGTCCGAC GCACCCGCTG TCGGTTGCGC GCGTGGACGT TTCTACCGGC CGGTACGCAG TGGTGCGCCG GTCGCTGGAG ACCCTTCCCC CTGAGGAATA CTTGCCGGTG CCGGTCACGC GTATTTTCCG GTCCGACGAC GGGCATGACG TGCACGCGCA CGTTTATCCG CCGCGGAATC CGGATTTCCG TGCGCCGGAT GGTGAGCGCC CGCCATACAT TGTCGTCGCG CACGGCGGTC CGACAGCATC GTCGCCGCCG ATATTCCGGT TGGAATACGC GTACTTCACC AACCGCGGCA TCGGAATTCT GGACGTCGAC TACGGCGGGT CGTCGGGGTA CGGCCGCGCC TACCGGGAAC GTTTGCGTGG ACAGTGGGGT GTCGTCGATG TCGCTGATTG TGTGACGGCG GTTCGCGCAT TAGCGGCGTC CGGCGAAGCC GACCCGAACC GCGTGGCGAT CCGCGGCGGC AGTGCCGGCG GCTGGACGGT CCTCTGCGCC GTCACGCGCA CGGACGTTTT CGCCGCGGGC ACCTCGTACT TCGGTGTCGC CGATCCCGAG CAGCTCGCGG CGGAGACCCA TGACTTTGAA TCGCACTACC TTGACGGGCT GCTCGGTCCC CTGCCGGAGG CGCGTGACGT CTACCGCGAG CGGGCGCCGA TCCGCCGCGT CGACGCGGTA CGCTGCCCGG TGCTGTTGCT GCAAGGCGCG CAGGACCCCA TCGTTCCGCC GTCCCAAGCG GAATTGTTCC GCGACGCGCT GGCGGCCAAG GGAATTCCGC ATGCGTATCT GTTGTTTGAG GGCGAGCAGC ATGGGTTCCG ACAGGCGGAG AACATCGTCC GCGCGTTGGA AGCGGAATTG TCGTTCTACG GCCAGCTCTT CGGCTTTTCG CCGCCGGGAA TTCCCGTGCT TCCGCTCACG CCGGCGCCGC CGGCTCCTTA G
|
Protein sequence | MPEIAPYGSW VSPISAADVA RGGVRLGFPS LVAGDVWWLE GRPTEQGRQV VVSAQRGDLL GPPWNARTRV HEYGGRSFWP IAADSFVFSE WTDQRLYLVT GEEDPRPLTP APAVPSGWRY ADATVGPDGQ VWCVRETVTA AGALDVVRDI VAVPLDGSAA DDPRRIRVVV GGSRFFAYPT VSPDGGRLAW VAWDHPQMPW DGTELRIGAL RDGIVDTWTT VAGGPAESIL QPTWATDGSL YFLSDRSGWW NLYRWDGAAV HALAPRDEEC GGPLWQLGMR WFAPLADGRI AVIHGGHLGV LNPDSGEVVD VAIPLSYVDA SVTADGSEVV VVGASPTHPL SVARVDVSTG RYAVVRRSLE TLPPEEYLPV PVTRIFRSDD GHDVHAHVYP PRNPDFRAPD GERPPYIVVA HGGPTASSPP IFRLEYAYFT NRGIGILDVD YGGSSGYGRA YRERLRGQWG VVDVADCVTA VRALAASGEA DPNRVAIRGG SAGGWTVLCA VTRTDVFAAG TSYFGVADPE QLAAETHDFE SHYLDGLLGP LPEARDVYRE RAPIRRVDAV RCPVLLLQGA QDPIVPPSQA ELFRDALAAK GIPHAYLLFE GEQHGFRQAE NIVRALEAEL SFYGQLFGFS PPGIPVLPLT PAPPAP
|
| |