Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2506 |
Symbol | |
ID | 3832778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2612099 |
End bp | 2613085 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637830429 |
Product | spore germination protein-like |
Protein accession | YP_431331 |
Protein GI | 83591322 |
COG category | [R] General function prediction only |
COG ID | [COG5401] Spore germination protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0523043 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTACTTT TCACAGTCCT GCTGTTAAGC CTGGTTTTAG TTATTAATGG TTGTGGTAGG AAAGCTCAAG CTCCTGCAGC AATTAATCCT GGTGCCAGCG AAGGTCCCAA GGAAATCAGA CCGTTGCTCT ATGACCGGGA GCTGGACCGG GTCATCGTCT ATTACCTGAC CGGAGATGGT CGCTACCTGG TACCGGTGAC CGTTAATTTT AATCCGACCA GGGAAGTTGC CAAGATAGCA GTAGAGAAGT TATTGGCCGG CCCCCAGGGT GACGGGCTGA AACCGGTCTT TCCTGAGGGT GTCAAGCTTC AAGATATCTA TCTTTTAAAC AACCAGCAAA CTGTCTATGT AAATTTGACC AGGGAATTTC TGGATATCAA AGATGCCAGG CAGGCGGACC TAGCCGTTAA AGCCCTGGTT CTAACGATGA CGAACCTCAC CAACGTCAAG GAAGTTCAGA TCCTGGTAGA GGGCAATAAA GTACCTGAGG TAGCAGGGGT AAAATTGGAT GCCCCTCTAC ACCGCCCTGA CAGTGTTAAT AGCCTGCTGA AGGACGGCAA TCAAAAGGGG GTTCAGGTGT TTTTCAACGA TGCTGATGCC CGCTTTTTCG TCCCGGTAAC AGTTGCTATG CCGCCGGGGT CCAGTGCAGA TAATCTACCC CGGGCGGCAG TTCTGGCCCT CTTGGCCGGT CCTCCTGCAG ATAGCGGTCT TATTCGGACT ATCTGGCCCG GGACCAGGCT CCTGGACTTT AAGGTTGAGG GAGGCCTGGC CACGGTTAAT TTCAGCCGCC AGGTCACCGG TTACGGCGGA GGCAGTGCCG CCGAGACCGC CCTGCTAAAA TCCCTCCTCT TTACCCTGAC CCAGTTCCCA GATATTGACC GGGTACAGAT TCTTATTGAC GGGAAGAAGA AGGAGTATTT ACCTGAAGGT ACGGCTATCG ATAAACCCCT GTCCAGGCCG GAACTCCTCA ATCCCCTTAA TCACTAA
|
Protein sequence | MVLFTVLLLS LVLVINGCGR KAQAPAAINP GASEGPKEIR PLLYDRELDR VIVYYLTGDG RYLVPVTVNF NPTREVAKIA VEKLLAGPQG DGLKPVFPEG VKLQDIYLLN NQQTVYVNLT REFLDIKDAR QADLAVKALV LTMTNLTNVK EVQILVEGNK VPEVAGVKLD APLHRPDSVN SLLKDGNQKG VQVFFNDADA RFFVPVTVAM PPGSSADNLP RAAVLALLAG PPADSGLIRT IWPGTRLLDF KVEGGLATVN FSRQVTGYGG GSAAETALLK SLLFTLTQFP DIDRVQILID GKKKEYLPEG TAIDKPLSRP ELLNPLNH
|
| |