Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0067 |
Symbol | |
ID | 3832676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 68473 |
End bp | 69375 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637827999 |
Product | extracellular solute-binding protein |
Protein accession | YP_428949 |
Protein GI | 83588940 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2998] ABC-type tungstate transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000233722 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.143778 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAAA TAAGAAGAAT GTGGCTGGTT TTATTGGCGA CCCTAATCCT GTTATTAACG TTTACTGGAT GCGGCCAGGG GACAAAAACA GGGGACAAGC AGCAGACGCC GCTAGCCGCC GGACCTGTCA ATAAAAACCT TATCCTGGCC ACTACCACCA GTACTATGGA CAGTGGCTTA CTCGACGTCC TTATTCCCAT GTTTGAAGAA AAGAGCGGCT ATATCGTTAA ACCAAATGCC GTAGGTACCG GGCAAGCCCT GGCCATGGGC GATCAGGGTA ACGCCGATGT CTTGCTGGTC CATGCCCCGG CCGACGAGGT CAAACTGGTA CAAAAAGGAA CAGTCATTAA CCGCCAGCTG GTCATGCATA ATGACTTTAT CATCGTCGGC CCGCCAAGCG ATCCGGCCGG GATCAGAGGG GTAAAAAAAG CGGCCGACGC TTTTAAAAAG ATTGCCGCCA AGCAGGCCAT CTTTGTCTCC CGGGGAGACG ATTCCGGCAC CCATAAGAAG GAAAAGGATA TTTGGAAAGA AGCCGGGATC AATCCCCAAG GCAAGTGGTA CCAGGAAGCC GGTGCCGGCA TGGGCCAGAC CTTAAATATA GCCTCGGAAA AAGGCGGCTA TACCCTGACG GACCGTGGCA CCTACCTGGC ATTAAAAAAG AACCTTAACC TGGATATTAT GTTAGAAGGG GAAAAGACCC TGCTCAATAT CTATCATGTT ATGCAGGTCA ACCCGGAGAA GTTCCCCGGA ATGAAGATCA ACAGCGAGGG AGCGAAGGCC TTTGTAGATT TCATGGTGGC TCCGGAGACC CAGAAGGTTA TCGGTGATTT TGGTAAGGAT AAATTCGGTC AGTCCCTCTT CTTCCCCGAC GCCGGCAAGG ATGAGAATAC CCTGGGGCAG TAA
|
Protein sequence | MAKIRRMWLV LLATLILLLT FTGCGQGTKT GDKQQTPLAA GPVNKNLILA TTTSTMDSGL LDVLIPMFEE KSGYIVKPNA VGTGQALAMG DQGNADVLLV HAPADEVKLV QKGTVINRQL VMHNDFIIVG PPSDPAGIRG VKKAADAFKK IAAKQAIFVS RGDDSGTHKK EKDIWKEAGI NPQGKWYQEA GAGMGQTLNI ASEKGGYTLT DRGTYLALKK NLNLDIMLEG EKTLLNIYHV MQVNPEKFPG MKINSEGAKA FVDFMVAPET QKVIGDFGKD KFGQSLFFPD AGKDENTLGQ
|
| |