Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1462 |
Symbol | |
ID | 3831348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1511892 |
End bp | 1513142 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829395 |
Product | spore germination B3 GerAC like |
Protein accession | YP_430315 |
Protein GI | 83590306 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02887] germination protein, Ger(x)C family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAAT TGAAGAGGTT GGTGCTCTTT ACATCTATCC TGGGGGCATT TGCGCTGGCG TTTTCTGCCG GCGGCTGCTG GGATCGCCGG GAGATAAACG AGCTCGCTTT CCTTTCCTGC GCAGCCTTTG ACCTGGAGGG CGGTAATCGC GTCCTGACAT CTGAGTTCAT CCGGCCCTCC GCAGCCGGTG GGGGAGAAAG GGGCGGTGGG GAGACCTTGC CCCAGCGACA GGCCCTCATA GTGAGTAACC GGAACAAGAC CTTCCTGGCC ATCGGGCGGG AAAAAGCCCT GGGGCTGCCC CGTCGGGCCT ACCTGGCCCA TACTGCCGCC GTCCTGGTGG GGGAGGAGAT GGCCCGGTAC GGCATAAAAG AAGTCCTGGA TTTTGTCGAC CGGAACCCGG AGATACGCCG CACCACCCTG ATTTTACTTA CCCGCGGCCC GGCGCGGGAG GTGCTGGTCA GGGCCCAGAG CGGCTTGGAA AAAACCCTGG GAAGGGAAAT AACCGGCCTT CATAAATGGG TCCAGGTCAG CGGTTACGGA TATATCCCCA ACATTAACGA TATTTTTTTC GATTTGTCCG GTGATGCGGG AACAACCGTC CTGCCGGTGC TGGAATTAAG TCCCCAGCCG TTCCCGCCCA TTCTCGGCCC CGCTACCGCG ACGGGCGGCG GCATTCCCGC CGGGAGGGCT GGAGAACCGG AAACGCTAAT GACGGCGCGC CTGAACGGCG CCGGGCTGTT CTACCATGAC AAATTGGTGG CGTGGCTGGA TCAAGAACAA ACCCGCGGCT GGGCCTGGGT ACGCAATAAA GTTAAAAGTG CCATGCTGGC TTTACCCCGC CAGGAAAACA GCCTGGTATC CGTAAATATC ATCTCTTCCC GGGCCGAGGC CGCTATCGAC ATGCAAGGAG GCCGGCCCCA GGGCAAAATC AAGATCAAGG TGGAGGGCGA TCTCCTGGAA GAGCAGTGCT ACCAGGACTT TACCAAAGAA GAAGCTGTTA AATCGCTGGA AAGCCGGATG GCAGCCCAGA TCACGTCTGA AATCAGCAGC GCCCTCAATC AGGCCAAGAT GGCTGGTACG GATGTTTTTG GCTTCGGCGG CGCCCTCCAC CGCCGGTACC CAGAAGTGTG GCGCCAGTTA GAAGGACGTT GGAACGAGGA ATTCAAAAAG TTGCCTGTTA CCATTAGCGT CGAAGCCAAA CTACGACGTA CCGGGATGAC TGGACGTCCC TGGCAGCCCG GGGCGCGCTA G
|
Protein sequence | MPKLKRLVLF TSILGAFALA FSAGGCWDRR EINELAFLSC AAFDLEGGNR VLTSEFIRPS AAGGGERGGG ETLPQRQALI VSNRNKTFLA IGREKALGLP RRAYLAHTAA VLVGEEMARY GIKEVLDFVD RNPEIRRTTL ILLTRGPARE VLVRAQSGLE KTLGREITGL HKWVQVSGYG YIPNINDIFF DLSGDAGTTV LPVLELSPQP FPPILGPATA TGGGIPAGRA GEPETLMTAR LNGAGLFYHD KLVAWLDQEQ TRGWAWVRNK VKSAMLALPR QENSLVSVNI ISSRAEAAID MQGGRPQGKI KIKVEGDLLE EQCYQDFTKE EAVKSLESRM AAQITSEISS ALNQAKMAGT DVFGFGGALH RRYPEVWRQL EGRWNEEFKK LPVTISVEAK LRRTGMTGRP WQPGAR
|
| |