Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2157 |
Symbol | |
ID | 3833006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2261054 |
End bp | 2262082 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637830079 |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | YP_430989 |
Protein GI | 83590980 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0185524 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAGG AATTGGCGAC AGAAACGAAT ATCCTGGCCA TCGAGAGTTC CTGTGATGAG ACGGCGGCGG CCATTGTCAG CGACGGCACC AGGGTCCGGG CCAACATCAT CGCCTCCCAG ATCGCCGTTC ACCGCCGCTT TGGCGGCGTG GTGCCGGAAA TAGCTTCCCG CCACCATATG GAGAATATAG TACCGGTGGT ATCGGAGGCC CTGGCTACAG CCGGCCTGGC CTTTAGCGAT GTGGACGCCG TGGCGGTGAC CTATGGTCCC GGACTGGTAG GGGCCCTGCT GGTGGGTGTC GCTTACGCCA AGAGCCTGGC CTACGCCCTG GGTAAGCCCC TCATCGGTGT CCACCACCTC CTGGGGCATA TCTATGCCGG TTTTCTGGCC TACCCTGGCC TGCCCTTGCC GGCGGTCTCC CTGGTGGTCT CGGGCGGGCA TACCAACCTG GTCTACCTGG AGGATCACAC CACCCGTCGT ATCCTGGGGT CAACCCGGGA TGACGCCGCC GGGGAAGCCT TCGACAAGGT GGCCAGGGTC CTGGGGTTGC CCTATCCGGG CGGGCCGGAG CTGGAAAAAC TGGCCCGGGA AGGCAATCCC CGGGCCATTC CTTTCCCCCG GGCCTGGCTG GAGGAAAACA GCCTTGATTT CAGCTTTAGC GGCCTGAAAT CTGCGGTCAT CAACTACCTG CACCACGCCC GCCAGGTGGG CCAGGAGGTT AACCGGGCCG ACGTGGCTGC CAGTTTCCAG GCGGCGGTGG CCGAGGTCCT GGTGACCAAG ACCCTGCTGG CGGCTACCAG CTACCGGGCC AGGTCTATCC TTCTCGCCGG TGGGGTGGCG GCCAATTCGG TCCTGCGCCG GGAACTTCGT TCAGCCGGGG AGCAGGCGGG CCTCCCGGTC TTTTTTCCAC CGCGGGAACT CTGCACCGAC AACGCGGCCA TGATCGGCTG TGCCGCTTAT TACCAGTACC TGCGCCGGGA TTTTGCCCCT TTAAGCCTCA ACGCTATCCC CGATTTACCC CTTAATTGA
|
Protein sequence | MVKELATETN ILAIESSCDE TAAAIVSDGT RVRANIIASQ IAVHRRFGGV VPEIASRHHM ENIVPVVSEA LATAGLAFSD VDAVAVTYGP GLVGALLVGV AYAKSLAYAL GKPLIGVHHL LGHIYAGFLA YPGLPLPAVS LVVSGGHTNL VYLEDHTTRR ILGSTRDDAA GEAFDKVARV LGLPYPGGPE LEKLAREGNP RAIPFPRAWL EENSLDFSFS GLKSAVINYL HHARQVGQEV NRADVAASFQ AAVAEVLVTK TLLAATSYRA RSILLAGGVA ANSVLRRELR SAGEQAGLPV FFPPRELCTD NAAMIGCAAY YQYLRRDFAP LSLNAIPDLP LN
|
| |