Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2417 |
Symbol | |
ID | 3832168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2540004 |
End bp | 2541182 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637830336 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_431242 |
Protein GI | 83591233 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0358901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCTG TCGCTCCCCG AGGAAACAAG GGGGTTGCCT TCTTGAAGCG CTTTTTTCCC GGCGGTTTGA GGCGCTACCG GATACTCCTT CTGGTCTCAT TAGTTTTTTT GAGTTCTATC CTGGGGGCGG GAGCCGCCCT CTATTTCGCA CCGCGATTGA TGTTCCGCCC CCTGCCACCA GCGTCCCCAC CCCAAGGCTT GATTAACCAA CCCGTAGGCT TGCCGGCCCA GTATACCGCC GATTCACCGG TAGTCACTAT TGCCCGGCAG GTGGGCCCTT CCGTCGTGGG GGTTCAGGCC ATGACCGGTA CCAATTATAC TGGCGACGGC GTGGTAAAGC AGGGTTCGGG AGTAATCTTT GATACCACTA ACGGCTATAT TGTTACCAAT AACCACGTTA TTGCTGGCGC CGGTCGGATA ACAGTCAGCC TGGACCGGGA GCAAACCTAT CCGGCGACCC TGGTAGGTGC CGATGAGCGT AGCGATCTGG CGGTATTAAA GGTCCAGGGG CCCAATCTCC CCCAGGCGCG CCTGGGGGAT TCCAGCACCC TGCAGGTAGG GGAAACTGTA GTAGCCATTG GTAACCCCCT GGGACGGGAG TTTGCCCGGT CGGTAACCGT AGGGGTAATC AGCGCTTTGA ACCGGGAGGT AACAGTTCCC GGGTCCCGGG GTGTGGAGAT AACCCTCCGT GTCCTCCAGA CCGATGCCCC AATTAACCCT GGCAACAGCG GCGGCGCCCT GGTTAACCTG CGAGGTGAGA TTATCGGCAT CAACAGCGTC AAGATTGCCG CCAGTGGTGT CGAGGGAATG GGTTTCGCCA TTCCCATAAA CGACGTCCGG CCCATTATTG ACCAGATAAT CACCCGCGGC TATGTCACCC ACCCCTTCCT GGGAGTTTAT AACCTCCAGG AGATTACCCC GGAGATGGCC CAGTGGTATA ATATACCTGT AGGCGTCTAT GTCGGGGGTG TCTTCAAGGA TGGCCCGGCA GCCAAGGCCG GCCTGCAGGT AGGAGATGTC ATCACCGCCG TAGAGAACCA GAAAGTCGCC ACCTATGATG ACATCCAGCG CCTGATCAAT AAGAAATCCC CGGGGGATCA GGTGACGGTG ACTATCCGGC GCCTCAAGTC GCCAAACCCG GTCAACTACA CTATAACCCT GGGGGAATTA CCTAAATAG
|
Protein sequence | MAAVAPRGNK GVAFLKRFFP GGLRRYRILL LVSLVFLSSI LGAGAALYFA PRLMFRPLPP ASPPQGLINQ PVGLPAQYTA DSPVVTIARQ VGPSVVGVQA MTGTNYTGDG VVKQGSGVIF DTTNGYIVTN NHVIAGAGRI TVSLDREQTY PATLVGADER SDLAVLKVQG PNLPQARLGD SSTLQVGETV VAIGNPLGRE FARSVTVGVI SALNREVTVP GSRGVEITLR VLQTDAPINP GNSGGALVNL RGEIIGINSV KIAASGVEGM GFAIPINDVR PIIDQIITRG YVTHPFLGVY NLQEITPEMA QWYNIPVGVY VGGVFKDGPA AKAGLQVGDV ITAVENQKVA TYDDIQRLIN KKSPGDQVTV TIRRLKSPNP VNYTITLGEL PK
|
| |