Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0360 |
Symbol | |
ID | 4485899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 371430 |
End bp | 372461 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639729127 |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | YP_872120 |
Protein GI | 117927569 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.195424 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGACG AACCGTTGGT CCTGGGCATC GAGACGTCAT GCGACGAAAC GGGGGTCGGT GTCGTTCGCG GCCGTACCCT GCTCGCCAAC GAAATTGCCT CCAGCGTCGA TCTTCACGCC CGCTTCGGCG GCGTCGTCCC GGAGGTGGCG AGTCGGGCGC ATCTGGAGGC ATTGGTGCCG ACGATGCACC GGGCGCTGGA GAAAGCCGGC CTCCGGCTCG CTGACGTCGA CGCGATCGCG GTGACCGCGG GACCCGGGCT CGCCGGCACC CTGCTCGTCG GCGTGGCGGC CGCCAAAGCG TATGCCCTGG CTCTCGGCAA GCCGCTGTTC GGCGTCAACC ACCTGGCGGC GCACGTTGCC GTCGACATCC TGGAGCATGG TCCGTTGCCG CGTCCATGCG TCGCGCTCTT GGTCTCCGGT GGCCACTCGT CGCTGCTGCT GGTCGAGGAC GTGACCGGTA CGGTGCGGCC GCTTGGTTCG ACGGTGGACG ACGCGGCCGG GGAGGCGTTC GACAAGGTAG CGCGCGTCCT TGGCCTGCCG TTTCCCGGCG GGCCGCCGAT CGACCGCGCC GCACAAGAAG GGGATCCGCA GTTCGTGGCG TTCCCCCGGG GCAAGGCCGA CGACGGCACC TTCGATTTTT CGTTTGCCGG CTTGAAGACC GCGGTGGCCC GATGGGTGGA GAAGCGGGAG CGCGACGGTG AGCCGGTGCC GGTGGCCGAT GTCGCTGCGG CGTTTCAAGA GGCGGTGGCG GACGTCCTGA CGGCGAAGGC GGTCGCAGCG TGCCGGACGT ACGGGGTCGG GGATCTCCTC ATCGGCGGTG GGGTCGCGGC AAATTCACGG CTCCGGTCGC TCGCGGCCGA GCGGTGCGAG GCGGCGGGCA TTCGTCTTCG GGTCCCGCGG CCGGGGCTCT GCACGGACAA TGGCGCGATG GTCGCGGCGC TTGGTGCATG CTTGATCAAA GCGGGGCGGA CGCCGTCCGA GCCGGAGTTT CCGGCCGACT CCTCGCTCCC GATCACTGAG GTTCTCGTGT GA
|
Protein sequence | MTDEPLVLGI ETSCDETGVG VVRGRTLLAN EIASSVDLHA RFGGVVPEVA SRAHLEALVP TMHRALEKAG LRLADVDAIA VTAGPGLAGT LLVGVAAAKA YALALGKPLF GVNHLAAHVA VDILEHGPLP RPCVALLVSG GHSSLLLVED VTGTVRPLGS TVDDAAGEAF DKVARVLGLP FPGGPPIDRA AQEGDPQFVA FPRGKADDGT FDFSFAGLKT AVARWVEKRE RDGEPVPVAD VAAAFQEAVA DVLTAKAVAA CRTYGVGDLL IGGGVAANSR LRSLAAERCE AAGIRLRVPR PGLCTDNGAM VAALGACLIK AGRTPSEPEF PADSSLPITE VLV
|
| |