Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1536 |
Symbol | |
ID | 3831922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1579958 |
End bp | 1581010 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637829468 |
Product | hypothetical protein |
Protein accession | YP_430388 |
Protein GI | 83590379 |
COG category | [S] Function unknown |
COG ID | [COG3854] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR02858] stage III sporulation protein AA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.127679 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACCG TACACCTGGT ACAGGTTGAA ACCGCCAATC AGCCTCCTGG ACAGGCACCC CGCCGGGATG GAGACCCCCT GGCAGGGATC GAGGAGTTGC TACCGCCGAA TATCAAGGCC GCTGTGGAAA GTCTTCCCGC CGGAATTAGG GATAATCTGG AAGAGATTCG CCTGCGCCGG GAGCGTCCCC TTCAGGTTCG CTGGAGCGGC GGCGAAGGCT GGGTCGCAGC CAGCGGCGGC CTGGCCGCCG GCCCGGATGG CGCTTATAAA GTAACTGCTG CCGATCTGGG GCGGACCATT GAGGCCCTGA CCAGGAGTTC CCTTTACGCC CTGGAAGAGG AGCTGCGTTC CGGTTATATC ACCATCAGCG GCGGCCACCG GGTAGGCCTG GTAGGGGAGG CGGTTGTACT CCAGGGGGAA ATCCGCACCT TGAAAAACTT TGCCGGACTC AACCTGCGCC TGGCGAGGGA TATCCCCGGT TGCGCCAGGA GCCTCATTCC TTACCTTCTG GAGGGAGGGC GCCCCCTGCA TACCCTGATC CTCTCACCGC CGCGGTGCGG CAAGACAACC CTCCTGCGGG ATCTCATCCG CCTCCTAAGT ACCGGCGTAC CAGAGCTTAA GTTTTCCGGT GTCAATGTGG GTGTGGTGGA CGAGCGGTCG GAAATTGCCG GCTGCTGGCT GGGGGTACCG CAGCTCGAGG TAGGCCCGCG GACGGATGTC CTGGACCGCT GCCCCAAAGC GGCAGGGATG CTCATGCTCC TGCGGTCTAT GGGACCGGAA GTCATTGCCA CCGACGAGAT CGGCCGGCCG GAGGAACTGG CGGCCCTGCA GGATGTTCTC CACGCCGGCG TCACCATGCT GGCCAGCGTC CACGCCGGTA GCCTGGAAGA GCTGCAACAC CGCCCGGGCT GGGGCCCCCT GCTTAAGCAG GGCTTCTGGC AGCGCCTGGT GCTCCTGGGG CGCACCCTGG GTCCGGGAAC TATTGAAGGC GTTTTTTCCG GGGATCACCG TACCCTGAAG CGGGGTCCCT GGCGGGGGGA GGCCCGACCG TGA
|
Protein sequence | MATVHLVQVE TANQPPGQAP RRDGDPLAGI EELLPPNIKA AVESLPAGIR DNLEEIRLRR ERPLQVRWSG GEGWVAASGG LAAGPDGAYK VTAADLGRTI EALTRSSLYA LEEELRSGYI TISGGHRVGL VGEAVVLQGE IRTLKNFAGL NLRLARDIPG CARSLIPYLL EGGRPLHTLI LSPPRCGKTT LLRDLIRLLS TGVPELKFSG VNVGVVDERS EIAGCWLGVP QLEVGPRTDV LDRCPKAAGM LMLLRSMGPE VIATDEIGRP EELAALQDVL HAGVTMLASV HAGSLEELQH RPGWGPLLKQ GFWQRLVLLG RTLGPGTIEG VFSGDHRTLK RGPWRGEARP
|
| |