Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2041 |
Symbol | |
ID | 3831187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2130204 |
End bp | 2131301 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829970 |
Product | spore germination protein |
Protein accession | YP_430880 |
Protein GI | 83590871 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00912] spore germination protein (amino acid permease) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGGGAAG AGGGTTATAT CAGCCCCCGC CAGGCAAGCG TTCTGCTATG GTTTGCCATC CTGCCCACGG CCATCCTCTT TTTGCCGGCC CTTCTGGCTT TACGTGCCCG TCAGGACGCC TGGTTGGCGG TTATCCTGGC TACAATGGCC GCCGGAGTGC CGGCACTGGG TATTTACCTG CTGGCCCAGC GCTTTCCCCA TTATACCCTG TTTCAATACT GCGAGTTGAT TTTAGGACGC TTGGCCGGCA AAATGGTGGC CATGGTGTTT GTCCTGGGTT TTTTCTTACT CAACGCCGTG GTCATCCGGG AGTTCAGCGA GTTTCTAACG ACCGCGGTAA TGCCGGAGAC GCCGTGTTTA TTCTTTGCCT TCAGTATTGT CGCCGTGGCT ATTTATGCCG CCCGCAACGG CCTGGAGGTT ATCGCCCGCT GCACAGATTT TATCATGCCC CTGCTGGTCG CTTTTATTTT TGTGATCCTT CTCTTTTCGA CTCCGGAAAT GTCTTTAAGG AATATCTTTC CTTTTCTAGA AAACGGCATC CGGCCGGTCC TGGGAGGAAC CCTAATCACC TGGCCCTTCC TGGGCCAGGT GATTATCCTG GCCACTTATG GTGCCTTCGT CAACCCACCC CGGTCCCTGG GGTGGAGCCT CATTAGCGGT CTGGTGGGGA TCGCCTTTTT CCTCGGTCTA GTTACTATGG GCGTAATCCT CGTCTTTGGT GCCTGCGAAG CCAGCAGCCT GACCTTTGCC GGTTATAACC TGGCCAGGGT TGTCTCCCTG GGCCAGTTTT TGGAGAGGAT AGAGGTCCTT TTCCTGGCTA TCTGGGTCGC CGGTGTCTTT ATTAAAATAA CCTTGAACTT CTATGTGGTC GCCCTGGGCC TGGCTACTGT TACCGGTCTG AAAGAGTACC GGCCCCTGGT GGCGCCCCTA GGGGCCTTAA ATACGGCTCT AGCAGCATTT ATGTATAGAA ACATCAGCGA GATCCGGCAG GATTTATTGA TGGTAGAACC TGGTTGGACC TTGACCTGGC AATTTATTCT ACCCCTGCTG TTGCTGCTGG TAGCCTGGGT GCGAGGGCAA AGGAGGACTG GCGCTTGA
|
Protein sequence | MREEGYISPR QASVLLWFAI LPTAILFLPA LLALRARQDA WLAVILATMA AGVPALGIYL LAQRFPHYTL FQYCELILGR LAGKMVAMVF VLGFFLLNAV VIREFSEFLT TAVMPETPCL FFAFSIVAVA IYAARNGLEV IARCTDFIMP LLVAFIFVIL LFSTPEMSLR NIFPFLENGI RPVLGGTLIT WPFLGQVIIL ATYGAFVNPP RSLGWSLISG LVGIAFFLGL VTMGVILVFG ACEASSLTFA GYNLARVVSL GQFLERIEVL FLAIWVAGVF IKITLNFYVV ALGLATVTGL KEYRPLVAPL GALNTALAAF MYRNISEIRQ DLLMVEPGWT LTWQFILPLL LLLVAWVRGQ RRTGA
|
| |