Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1059 |
Symbol | |
ID | 3833323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1090805 |
End bp | 1092070 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637828987 |
Product | peptidase M16-like |
Protein accession | YP_429916 |
Protein GI | 83589907 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0076042 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00221195 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTACCAGA AAGAAATCCT GGAGAACGGT ATCCGGATAG TCTCCGAGGA AATCCCCTTT GTTAATTCCG TAGCCCTGGG CGTATGGGTA CGGACCGGGT CCCGTAACGA GGATGAGGAT AACCAGGGCG TTTCCCACTT TCTGGAGCAC CTTTTGTTTA AGGGGACTAC CAGGCGTACT GCCAGGCAGA TAGCGGAGGA ACTGGAAGCA GTCGGCGGGG TCATCAATGC CTTCACAACT AAAGAATATA CCTGCTTTTA CAGCCGGGTC CTGGCGGAAC ACCTGGACCT GGCTATCGAT GTTTTAAGCG ATATGTTTTT TAATTCCCTC CTGGCCCCGG AAGATATCGA GAAGGAAAAA AGGGTGATCC TGGAGGAAAT TAAAATGTAC GAGGACTCCC CGGATGAATT GATCCACGAC CTTTTTGCCC GGACCATCTG GCCGGGCCAT CCCCTGGGAA GGGCTATCCT GGGGACCTAT GAAACAGTGG CTGCTTTAAA CCGGGATCTC ATATACCGCT ACTACCAGGA ACAGTATAAT TGCGCCAATA TTGTCCTGGC TGCTGCCGGT AAGTTTAATA CCTCCGAACT GATAGTCAAA CTGGAAGCAT CCTTCGGCCG GCAAAGGCGT CCGGGCAAGG CAGCCCAATT CCACCCGCCT GTAAACCGGG CGGCCACCAG TATGCAGGTC AAGGATACGG AACAGGTCCA GATCTGCCTG GGTGTGCCGG GACTGGCCCA GGATGATCCA GCTATTTATG CTGTCCAGGC CTTAAACAAT ATCCTGGGGG GCGGCCTGAG TTCCCGCCTT TTCCAGCTTA TCCGGGAAGA ACGCGCCCTG GCCTATTCTG TCTATTCCTA CCATGCCGGT TTCGGCGACA GCGGCCTCTT TACTGTTTAC GCCGGTACCA GCCCGGATAA TTACCGGCAG GTGGTGCAAC TGGTTCTGGA AGAACTGGCG TCCCTGAAGA ACAACGGCGT TACCGAAGAG GAGTTAAAAA GGACCAAGGA CCAGATCCGG GGCAATCTTC TCCTGGGTCA GGAGAGTGTC AGCCAGCGTA TGAGCCGCCT GGGGAAAACG GAGGTTTCTT TTGGCCGGGT AATCACGGCC GAAGAAGTAA TCGAGCGCCT GAACCAGGTT ACGAGGGATG ACGTCCAGGC CCTGGCGCAG CGGCTTTTCC GGCCGGAATA CCTATCCCTG ACCGCCCTGG GGCCGGAGGT AGAACCCCTG GACTTGGCCG CCATCGCCAG CTCCGCAGGG TTGTAG
|
Protein sequence | MYQKEILENG IRIVSEEIPF VNSVALGVWV RTGSRNEDED NQGVSHFLEH LLFKGTTRRT ARQIAEELEA VGGVINAFTT KEYTCFYSRV LAEHLDLAID VLSDMFFNSL LAPEDIEKEK RVILEEIKMY EDSPDELIHD LFARTIWPGH PLGRAILGTY ETVAALNRDL IYRYYQEQYN CANIVLAAAG KFNTSELIVK LEASFGRQRR PGKAAQFHPP VNRAATSMQV KDTEQVQICL GVPGLAQDDP AIYAVQALNN ILGGGLSSRL FQLIREERAL AYSVYSYHAG FGDSGLFTVY AGTSPDNYRQ VVQLVLEELA SLKNNGVTEE ELKRTKDQIR GNLLLGQESV SQRMSRLGKT EVSFGRVITA EEVIERLNQV TRDDVQALAQ RLFRPEYLSL TALGPEVEPL DLAAIASSAG L
|
| |