Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0500 |
Symbol | |
ID | 3832823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 517103 |
End bp | 518146 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637828434 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_429373 |
Protein GI | 83589364 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000000058985 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCAA TCAATGTGAA CACTGTGCTG GCCGTTATGC TGAGCGGTTT GTTAATCATG GCCGGGGGTT TTGGTCTGGC CAGGATTTAC GATAACATGC ATAATCAAGT GGTTGCAGAG GCATCTAAAC TCGGTCCTAT CGAGGAGAAC CACTCTGAGT CGGCCAAGTC CCAGGATTTA AAAAGCATTA TTGCCGAGAA TCAAAAGCTA GTAGTGTGTC TAGAGGTTGA GCTTGATTAC GGTAAGGTAC AGGGTTCCGG ATTTCTTTAT AACCGGCAGG GGGATGTAAT AACCAATGCC CACGTGGTGG GTGACGCCAG AACTTGCCGG GTCAAATTGT CCGATGGCAC AGTTTATCAA GGCACAGTTA TCGGGCGTGG GGAACAAATT GATGTGGCTC TGGTCAGGGT GCCGGAACTG GCTGGTAAGG AACCAATGAA AATTGCCCGG GATAGAATGG CAGAAGTGGG AGATGAAGTG ATCGCCCTGG GCAGTCCCCT TGGATTACAA AATACCGCCA CCACAGGAAT CATCAGCGGT GTCAACCGGG ATTTAGATAT CGGGGATTAT CATTATAAGG GGCTATACCA AATTTCTGCC CCCATCTCGC ATGGCAGTAG CGGAGGACCG TTACTGGATC GACATACCGG CGAGGTGCTG GGTGTTAATT CCGCCGGGGC CGAGGGAGAA AATATCGGCT TTAGCATACC CATCACCCAG GTTTTACCCC TGGTGGAAAA CTGGTCAAAA AACCCCAGTA CAGCACCGGT GCAGCCAACC ACCGGTACTG TCGGGAAAAT TTCCGAGAAA GAATTAGCCA CTGCTGCATC CTACCTGGTT GAGGACTTTT ACACTTGTGT CAATAACGGG GATTATGTAG GGGCCTATGC TCTGCTGGGA AGTGACTGGC AGGCCAAACA GCCCTATGAG AAGTTCCGGG CCGGCTATTT AAATACACTT TCAGTCTCAG ACGCATTTTT AAAACCAGGC CGCCATTCGG CAACTCCTCA ATTACTTGAC TGGAATGCCA GTGTCGCTCC TTGA
|
Protein sequence | MKSINVNTVL AVMLSGLLIM AGGFGLARIY DNMHNQVVAE ASKLGPIEEN HSESAKSQDL KSIIAENQKL VVCLEVELDY GKVQGSGFLY NRQGDVITNA HVVGDARTCR VKLSDGTVYQ GTVIGRGEQI DVALVRVPEL AGKEPMKIAR DRMAEVGDEV IALGSPLGLQ NTATTGIISG VNRDLDIGDY HYKGLYQISA PISHGSSGGP LLDRHTGEVL GVNSAGAEGE NIGFSIPITQ VLPLVENWSK NPSTAPVQPT TGTVGKISEK ELATAASYLV EDFYTCVNNG DYVGAYALLG SDWQAKQPYE KFRAGYLNTL SVSDAFLKPG RHSATPQLLD WNASVAP
|
| |