Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0201 |
Symbol | |
ID | 3832274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 197705 |
End bp | 199084 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828137 |
Product | peptidase |
Protein accession | YP_429079 |
Protein GI | 83589070 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02889] germination protein YpeB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACACGCA AGCTTTCAAC CATTCTCCTT TCCCTGGCCC TCCTGCTGGC TATCGGCTGG GGACTGTGGG AGAGGGCCAA CCGGCTAACC CTGGCCAACG CCGTCGAGGC CGGCGGCCAA CGTGATTTTT ATAATCTCCT GAACTATGTC GAGCAGGCCC AGGTAAGTAT GGGCAAAACC CTGGCCAGCA GTTCCCCCCG TCAACAGGCC GTCCACCTGA CGGAAGCCTG GAACCAGGCG GCTGCGGCGC AACACTCCCT GACCCAGCTC CCCACCCCCG GTTTTAAACC GGTGAATACC AGCAAATTTC TGTCCCAGAC CAGCGATTAC AGTAACTACC TGGCGCAAAA ACTGGCCCGG GGCGAAGAAA TGACCCCCCA GGAAAGGCAG CAACTGGCCA GCCTGCGGGA AGAAATGGGG CGTCTGGCCG CCGATTTGCA CCAGACAGAA GGCCAGGTGG CCGGCAAGAC TTTGCGCTGG AGCAGCTTTT ACGGCTTCAA GATGCCGTCC CTGCCGCGGA CCATAGCCGG CCGGGCCCTT CCGGTCGAGG CGCGACCCGG CCCCCTGGAT GGTTTCGTCA ATACCGACCG GCGCCTGCAG ACCCTCCCCA GCCTGAACTA TGACGGCCCC TTTTCCGATC ACCTGGAGAA ACAGCGACCC CTGGGGTTGG GTGGTGGCGA GGTAACCCAG GCCGAGGCTG AACGACGGGC GATGAACTTC AGCAACGCCG CCAGTAACAG CAATTACCGG GTCCAGGCCA CTCGCACCAG CAACGGCCGC ATCCCCACCT TTTCCCTCCG GCTGCTGGAT AGCAGGCGGC CCAATGTGAC GACCCATATA GACGTCAGCA AGCAGGGCGG CCAGATCGTC TCCCTGTTAA ACACCCGTCC TGTAGGAGCT CCTACCCTGG ACGCCGCCGC GGCCCTGGAA AAGGCCAGGG CTTTTCTCCA GGCCCAGGGC TTCACCGGGA TGCAGCCGAC CTACACGGTA CGCACCGATA ACAACCAGGT CATCACCTTT GCCGCCAAGG AAGGGGATGT CATCCTCTAC CCGGATCAGG TGAAGGTGAA GGTCGCCCTG GACAACGGCG AAATAACCGG CTGGGACGCC ACACCCTATT ATATGTCCCA CCACAAACGG GATCTGCCCC GGCCGAAGCT GACACCAGAG CAGGCCCGGG CCAAAATAAA CCCCGGGATC AAGGTAGAAG GCGTCAGGCT GGCCCTCATA CCCTTGCCCG GGGGGCAGGA GAAGTTGACC TACGAGGTTA AAACCAAAAT GGACAACACT TATTACCTCA ACTATATTAA TGCCTTGACC GGTGAGGAAG AAAAGGTCTT GCAGATAATC GACGTACCCG GCGGCCAGCT CACCATGTAG
|
Protein sequence | MTRKLSTILL SLALLLAIGW GLWERANRLT LANAVEAGGQ RDFYNLLNYV EQAQVSMGKT LASSSPRQQA VHLTEAWNQA AAAQHSLTQL PTPGFKPVNT SKFLSQTSDY SNYLAQKLAR GEEMTPQERQ QLASLREEMG RLAADLHQTE GQVAGKTLRW SSFYGFKMPS LPRTIAGRAL PVEARPGPLD GFVNTDRRLQ TLPSLNYDGP FSDHLEKQRP LGLGGGEVTQ AEAERRAMNF SNAASNSNYR VQATRTSNGR IPTFSLRLLD SRRPNVTTHI DVSKQGGQIV SLLNTRPVGA PTLDAAAALE KARAFLQAQG FTGMQPTYTV RTDNNQVITF AAKEGDVILY PDQVKVKVAL DNGEITGWDA TPYYMSHHKR DLPRPKLTPE QARAKINPGI KVEGVRLALI PLPGGQEKLT YEVKTKMDNT YYLNYINALT GEEEKVLQII DVPGGQLTM
|
| |